COMMITS
April 6, 2024
A
chore: Bump version
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
A
fix: Always embed metal library. Closes #1332
Andrei Betlen committed
April 5, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
A
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main
Andrei Betlen committed
A
A
A
feat: Update llama.cpp
Andrei Betlen committed
S
fix(docs): incorrect tool_choice example (#1330)
Sigbjørn Skjæret committed
April 4, 2024
A
docs: Rename cuBLAS section to CUDA
Andrei Betlen committed
A
docs: Add docs explaining how to install pre-built wheels.
Andrei Betlen committed
A
docs: LLAMA_CUBLAS -> LLAMA_CUDA
Andrei Betlen committed
April 3, 2024
A
fix(ci): use correct script name
Andrei Betlen committed
A
chore: Bump version
Andrei Betlen committed
A
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
A
feat: Binary wheels for CPU, CUDA (12.1 - 12.3), Metal (#1247)
Andrei committed
A
fix: segfault when logits_all=False. Closes #1319
Andrei Betlen committed
A
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
April 1, 2024
Y
fix: last tokens passing to sample_repetition_penalties function (#1295)
Yuri Mikhailov committed
A
chore: Bump version
Andrei Betlen committed
L
fix: Changed local API doc references to hosted (#1317)
lawfordp2017 committed
L
feat: add support for KV cache quantization options (#1307)
Limour committed
March 31, 2024
W
feat: Add logprobs support to chat completions (#1311)
windspirit95 committed
March 29, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
March 28, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 27, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 26, 2024
A
feat: Update llama.cpp
Andrei Betlen committed