Commits - abetlen/llama-cpp-python - Morph

SIGN IN SIGN UP

abetlen / llama-cpp-python UNCLAIMED

Python bindings for llama.cpp

0 0 1 Python

COMMITS

36668331074c79f40494b51e4d541cd03601b9f2

May 5, 2024

O

feat(ci): Add docker checks and check deps more frequently (#1426)

Olivier DEBAUCHE committed 1y ago

A

feat: Update llama.cpp

Andrei Betlen committed 1y ago

May 4, 2024

N

fix: detokenization case where first token does not start with a leading space (#1375)

Noam Gat committed 1y ago

J

feat: Implement streaming for Functionary v2 + Bug fixes (#1419)

Jeffrey Fong committed 1y ago

May 3, 2024

A

Merge branch 'main' of github.com:abetlen/llama_cpp_python into main

Andrei Betlen committed 1y ago

A

fix: Use memmove to copy str_value kv_override. Closes #1417

Andrei Betlen committed 1y ago

A

feat(server): Remove temperature bounds checks for server. Closes #1384

Andrei Betlen committed 1y ago

D

fix(server): Propagate `flash_attn` to model load. (#1424)

Daniel Thuerck committed 1y ago

May 2, 2024

A

chore: Bump version

Andrei Betlen committed 1y ago

A

feat: Update llama.cpp

Andrei Betlen committed 1y ago

A

feat: Add llama-3-vision-alpha chat format

Andrei Betlen committed 1y ago

April 30, 2024

A

fix: Change default verbose value of verbose in image chat format handlers to True to match Llama

Andrei Betlen committed 1y ago

A

feat: Update llama.cpp

Andrei Betlen committed 1y ago

A

Merge branch 'main' of github.com:abetlen/llama_cpp_python into main

Andrei Betlen committed 1y ago

A

fix: Suppress all logs when verbose=False, use hardcoded fileno's to work in colab notebooks. Closes #796 Closes #729

Andrei Betlen committed 1y ago

J

fix: UTF-8 handling with grammars (#1415)

Jonathan Soma committed 1y ago

A

docs: Change all examples from interpreter style to script style.

Andrei Betlen committed 1y ago

A

docs: Update README.md

Andrei Betlen committed 1y ago

A

chore: Bump version

Andrei Betlen committed 1y ago

A

fix: wrong parameter for flash attention in pickle __getstate__

Andrei Betlen committed 1y ago

A

feat: Add option to enable `flash_attn` to Lllama params and ModelSettings

Andrei Betlen committed 1y ago

A

feat: Update llama.cpp

Andrei Betlen committed 1y ago

O

fix(ci): Fix build-and-release.yaml (#1413)

Olivier DEBAUCHE committed 1y ago

A

docs: Update README to include CUDA 12.4 wheels

Andrei Betlen committed 1y ago

A

chore: Bump version

Andrei Betlen committed 1y ago

A

fix: Ensure image renders before text in chat formats regardless of message content order.

Andrei Betlen committed 1y ago

A

fix(ci): Fix bug in use of upload-artifact failing to merge multiple artifacts into a single release.

Andrei Betlen committed 1y ago

A

chore: Bump version

Andrei Betlen committed 1y ago

A

feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147)

Andrei committed 1y ago

A

feat: Update llama.cpp

Andrei Betlen committed 1y ago