COMMITS
March 22, 2026
A
misc: Add Ruff formatting (#2148)
Andrei committed
A
fix(ci): Rename `huggingface-cli` to `hf` (#2149)
Andrei committed
August 15, 2025
A
chore: Bump version
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
August 7, 2025
A
chore: Bump version
Andrei Betlen committed
S
fix: rename op_offloat to op_offload in llama.py (#2046)
sergey21000 committed
A
feat: Add gpt-oss chat format support through strftime_now in chat format by @iamlemec
Andrei Betlen committed
A
misc: Add Python 3.13 classifier tag
Andrei Betlen committed
A
misc: Update pypi downloads badge
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
July 18, 2025
A
chore: Bump version
Andrei Betlen committed
July 16, 2025
A
feat: Update llama.cpp
Andrei Betlen committed
July 15, 2025
A
chore: Bump version
Andrei Betlen committed
A
fix: Better chat format for Qwen2.5-VL (#2040)
Alcoft committed
A
feat: Update llama.cpp
Andrei Betlen committed
July 6, 2025
A
fix(ci): Fix macos cpu builds
Andrei Betlen committed
A
chore: Bump version
Andrei Betlen committed
A
fix(ci): Temporarily disable windows cuda wheels
Andrei Betlen committed
A
feat: Update llama.cpp
Andrei Betlen committed
A
fix(ci): Update docker runner
Andrei Betlen committed
July 5, 2025
A
fix(ci): update runners for cpu builds
Andrei Betlen committed
A
chore: Bump version
Andrei Betlen committed
A
fix(ci): Remove macos-13 builds to fix cross compilation error
Andrei Betlen committed
A
fix(ci): Add git to package list
Andrei Betlen committed
A
fix(ci): Update cuda build action to use ubuntu 22.04
Andrei Betlen committed
A
fix: Update reference to in Llama.embed. Closes #2037
Andrei Betlen committed
July 3, 2025
A
chore: Bump version
Andrei Betlen committed
A
docs: Add Qwen2.5-VL to README
Andrei Betlen committed
A
fix: Use num_threads from llama model for mtmd
Andrei Betlen committed