Commits: llama_cpp/server/model.py - abetlen/llama-cpp-python - Morph

SIGN IN SIGN UP

abetlen / llama-cpp-python UNCLAIMED

Python bindings for llama.cpp

0 0 1 Python

COMMITS

/ llama_cpp/server/model.py

abetlen/update-llama-cpp-bindings

March 22, 2026

A

misc: Add Ruff formatting (#2148)

Andrei committed 6d ago

July 3, 2025

A

feat: Add support for new mtmd api, add Qwen2.5-VL chat handler

Andrei Betlen committed 8mo ago

September 20, 2024

A

feat: Add option to configure n_ubatch

Andrei Betlen committed 1y ago

August 29, 2024

A

feat: Add server chat_format minicpm-v-2.6 for MiniCPMv26ChatHandler

Andrei Betlen committed 1y ago

July 17, 2024

G

fix(server): Use split_mode from model settings (#1594)

Grider committed 1y ago

June 13, 2024

J

feat: Add `.close()` method to `Llama` class to explicitly free model from memory (#1513)

Junpei Kawamoto committed 1y ago

June 4, 2024

N

feat: adding `rpc_servers` parameter to `Llama` class (#1477)

nullname committed 1y ago

May 29, 2024

A

fix: fix string value kv_overrides. Closes #1487

Andrei Betlen committed 1y ago

May 3, 2024

D

fix(server): Propagate `flash_attn` to model load. (#1424)

Daniel Thuerck committed 1y ago

May 2, 2024

A

feat: Add llama-3-vision-alpha chat format

Andrei Betlen committed 1y ago

April 30, 2024

A

feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147)

Andrei committed 1y ago

April 1, 2024

L

feat: add support for KV cache quantization options (#1307)

Limour committed 2y ago

February 28, 2024

A

Andrei Betlen committed 2y ago

February 26, 2024

A

feat(server): Add support for pulling models from Huggingface Hub (#1222)

Andrei committed 2y ago

February 8, 2024

A

fix: broken import

Andrei Betlen committed 2y ago

J

feat: Integrate functionary v1.4 and v2 models + add custom tokenizer support to Llama class (#1078)

Jeffrey Fong committed 2y ago

January 31, 2024

A

Add speculative decoding (#1120)

Andrei committed 2y ago

January 21, 2024

A

fix: pass chat handler not chat formatter for huggingface autotokenizer and tokenizer_config formats.

Andrei Betlen committed 2y ago

January 19, 2024

A

feat: Add ability to load chat format from huggingface autotokenizer or tokenizer_config.json files.

Andrei Betlen committed 2y ago

January 15, 2024

P

Implement GGUF metadata KV overrides (#1011)

Phil H committed 2y ago

December 22, 2023

D

[Feat] Multi model support (#931)

Dave committed 2y ago