COMMITS
/ README.md September 20, 2024
O
docs: Add cuda 12.5 to README.md (#1750)
Olivier DEBAUCHE committed
September 18, 2024
O
feat(ci): Speed up CI workflows using `uv`, add support for CUDA 12.5 wheels
Olivier DEBAUCHE committed
August 29, 2024
A
docs: Update README
Andrei Betlen committed
A
docs: Update README
Andrei Betlen committed
A
docs: Update README
Andrei Betlen committed
A
docs: Add MiniCPM-V-2.6 to multi-modal model list
Andrei Betlen committed
A
docs: center icon and resize
Andrei Betlen committed
A
docs: Add project icon courtesy of 🤗
Andrei Betlen committed
July 17, 2024
E
fix(docs): Update README.md typo (#1589)
Eric Curtin committed
July 9, 2024
A
fix(docs): Remove kompute backend references
Andrei Betlen committed
A
fix(docs): Update LLAMA_ flags to GGML_ flags
Andrei Betlen committed
June 21, 2024
J
docs: Update readme examples to use newer Qwen2 model (#1544)
Jon Craton committed
June 20, 2024
A
docs: Remove references to deprecated opencl backend. Closes #1512
Andrei Betlen committed
June 4, 2024
N
feat: adding `rpc_servers` parameter to `Llama` class (#1477)
nullname committed
May 24, 2024
A
docs: Fix table formatting
Andrei Betlen committed
A
fix(docs): Fix link typo
Andrei Betlen committed
A
docs: Update multi-modal model section
Andrei Betlen committed
May 10, 2024
P
docs: Fix typo in README.md (#1444)
Peng Yu committed
May 8, 2024
I
docs: update README.md (#1432)
Ikko Eltociear Ashimine committed
April 30, 2024
A
docs: Change all examples from interpreter style to script style.
Andrei Betlen committed
A
docs: Update README.md
Andrei Betlen committed
A
docs: Update README to include CUDA 12.4 wheels
Andrei Betlen committed
April 28, 2024
J
fix: Functionary bug fixes (#1385)
Jeffrey Fong committed
April 26, 2024
D
feat: Allow for possibly non-pooled embeddings (#1380)
Douglas Hanley committed
April 5, 2024
S
fix(docs): incorrect tool_choice example (#1330)
Sigbjørn Skjæret committed
April 4, 2024
A
docs: Rename cuBLAS section to CUDA
Andrei Betlen committed
A
docs: Add docs explaining how to install pre-built wheels.
Andrei Betlen committed
A
docs: LLAMA_CUBLAS -> LLAMA_CUDA
Andrei Betlen committed
April 3, 2024
A
feat: Binary wheels for CPU, CUDA (12.1 - 12.3), Metal (#1247)
Andrei committed
April 1, 2024
L
fix: Changed local API doc references to hosted (#1317)
lawfordp2017 committed
March 3, 2024
K
docs: fix small typo in README: 'model know how' -> 'model knows how' (#1244)
Kenneth Hoste committed
March 1, 2024
A
docs: Add information re: auto chat formats. Closes #1236
Andrei Betlen committed
February 29, 2024
D
docs: fix typo in README.md embeddings example. (#1232)
Douglas Hanley committed
February 26, 2024
A
A
docs: Add example of local image loading to README
Andrei Betlen committed
February 23, 2024
A
docs: Fix functionary repo_id
Andrei Betlen committed
A
docs: Restore functionary docs in README
Andrei Betlen committed
A
docs: Update README.md to fix pip install llama cpp server (#1187)
Aditya Purandare committed
A
docs: Fix typo
Andrei Betlen committed
February 22, 2024
A
docs: fix low-level api example
Andrei Betlen committed
A
docs: Update README
Andrei Betlen committed
A
docs: Update README
Andrei Betlen committed
A
docs: Update README
Andrei Betlen committed
A
docs: fix indentation for mkdocs-material
Andrei Betlen committed
February 21, 2024
A
feat: Pull models directly from huggingface (#1206)
Andrei committed
February 16, 2024
A
docs: Add embeddings section
Andrei Betlen committed
February 13, 2024
A
docs: Fix typo
Andrei Betlen committed
February 12, 2024
A
docs: Temporarily revert function calling docs
Andrei Betlen committed
February 11, 2024
A
docs: Set the correct command for compiling with syscl support (#1172)
Akarshan Biswas committed