COMMITS
April 18, 2024
A
chore: Bump version
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
L
feat: update grammar schema converter to match llama.cpp (#1353)
Lucca Zenóbio committed
April 17, 2024
A
Revert "feat: Update json to grammar (#1350)"
Andrei Betlen committed
L
feat: Update json to grammar (#1350)
Lucca Zenóbio committed
K
feat: add `disable_ping_events` flag (#1257)
khimaros committed
T
feat: Make saved state more compact on-disk (#1296)
tc-wolf committed
A
feat: Update llama.cpp
Andrei Betlen committed
D
feat: Use all available CPUs for batch processing (#1345)
ddh0 committed
April 14, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
April 13, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
April 10, 2024
A
chore: Bump version
Andrei Betlen committed
A
fix: pass correct type to chat handlers for chat completion logprobs
Andrei Betlen committed
A
feat: Add support for yaml based configs
Andrei Betlen committed
A
feat: Add typechecking for ctypes structure attributes
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
April 9, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
April 6, 2024
A
chore: Bump version
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
A
fix: Always embed metal library. Closes #1332
Andrei Betlen committed
April 5, 2024
A
feat: Update llama.cpp
Andrei Betlen committed
A
Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main
Andrei Betlen committed
A
A
A
feat: Update llama.cpp
Andrei Betlen committed
S
fix(docs): incorrect tool_choice example (#1330)
Sigbjørn Skjæret committed
April 4, 2024
A
docs: Rename cuBLAS section to CUDA
Andrei Betlen committed
A
docs: Add docs explaining how to install pre-built wheels.
Andrei Betlen committed
A
docs: LLAMA_CUBLAS -> LLAMA_CUDA
Andrei Betlen committed