COMMITS
April 3, 2024
A
Merge branch 'main' into binary-wheels
Andrei committed
A
fix: segfault when logits_all=False. Closes #1319
Andrei Betlen committed
A
Update workflow name
Andrei Betlen committed
A
Update generate index workflow
Andrei Betlen committed
A
Add workflows to build CUDA and Metal wheels
Andrei Betlen committed
A
Merge branch 'main' into binary-wheels
Andrei Betlen committed
A
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
April 1, 2024
Y
fix: last tokens passing to sample_repetition_penalties function (#1295)
Yuri Mikhailov committed
A
chore: Bump version
Andrei Betlen committed
L
fix: Changed local API doc references to hosted (#1317)
lawfordp2017 committed
L
feat: add support for KV cache quantization options (#1307)
Limour committed
March 31, 2024
W
feat: Add logprobs support to chat completions (#1311)
windspirit95 committed
March 29, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
March 28, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 27, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 26, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 25, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 24, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 23, 2024
A
fix(server): minor type fixes
Andrei Betlen committed
A
fix: tool_call missing first token.
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
March 21, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
March 20, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
B
fix: set LLAMA_METAL_EMBED_LIBRARY=on on MacOS arm64 (#1289)
bretello committed
March 19, 2024
A
docs: Add chat examples to openapi ui
Andrei Betlen committed
A
A
feat: Update llama.cpp
Andrei Betlen committed
March 18, 2024
A
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main
Andrei Betlen committed