COMMITS
May 2, 2024
A
chore: Bump version
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
A
feat: Add llama-3-vision-alpha chat format
Andrei Betlen committed
April 30, 2024
A
A
feat: Update llama.cpp
Andrei Betlen committed
A
Merge branch 'main' of github.com:abetlen/llama_cpp_python into main
Andrei Betlen committed
A
J
fix: UTF-8 handling with grammars (#1415)
Jonathan Soma committed
A
docs: Change all examples from interpreter style to script style.
Andrei Betlen committed
A
docs: Update README.md
Andrei Betlen committed
A
chore: Bump version
Andrei Betlen committed
A
fix: wrong parameter for flash attention in pickle __getstate__
Andrei Betlen committed
A
feat: Add option to enable `flash_attn` to Lllama params and ModelSettings
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
O
fix(ci): Fix build-and-release.yaml (#1413)
Olivier DEBAUCHE committed
A
docs: Update README to include CUDA 12.4 wheels
Andrei Betlen committed
A
chore: Bump version
Andrei Betlen committed
A
fix: Ensure image renders before text in chat formats regardless of message content order.
Andrei Betlen committed
A
A
chore: Bump version
Andrei Betlen committed
A
A
feat: Update llama.cpp
Andrei Betlen committed
D
chore(deps): bump actions/upload-artifact from 3 to 4 (#1412)
dependabot[bot] committed
D
chore(deps): bump actions/configure-pages from 4 to 5 (#1411)
dependabot[bot] committed
D
chore(deps): bump softprops/action-gh-release from 1 to 2 (#1408)
dependabot[bot] committed
O
ci(fix): Workflow actions updates and fix arm64 wheels not included in release (#1392)
Olivier DEBAUCHE committed
April 29, 2024
A
fix(ci): Update generate wheel index script to include cu12.3 and cu12.4 Closes #1406
Andrei Betlen committed
April 28, 2024
O
ci: Add support for pre-built cuda 12.4.1 wheels (#1388)
Olivier DEBAUCHE committed
A
feat: Add support for str type kv_overrides
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed