Commits: llama_cpp/llama.py - abetlen/llama-cpp-python

abetlen / llama-cpp-python UNCLAIMED

Python bindings for llama.cpp

0 0 1 Python

COMMITS

/ llama_cpp/llama.py

fcfea66857c73c9b95326835debb80dbc0b76f17

April 22, 2024

feat: Use new llama_token_is_eog in create_completions

Andrei Betlen committed 1y ago

d40a250

April 20, 2024

feat: Add stopping_criteria to ChatFormatter, allow stopping on arbitrary token ids, fixes llama3 instruct

Andrei Betlen committed 1y ago

cc81afe

April 17, 2024

feat: Make saved state more compact on-disk (#1296)

tc-wolf committed 1y ago

4924455

feat: Use all available CPUs for batch processing (#1345)

ddh0 committed 1y ago

c96b2da

April 10, 2024

fix: pass correct type to chat handlers for chat completion logprobs

Andrei Betlen committed 1y ago

bb65b4d

April 3, 2024

fix: segfault when logits_all=False. Closes #1319

Andrei Betlen committed 2y ago

8649d76

April 1, 2024

feat: add support for KV cache quantization options (#1307)

Limour committed 2y ago

f165048

March 31, 2024

feat: Add logprobs support to chat completions (#1311)

windspirit95 committed 2y ago

aa9f1ae

March 14, 2024

fix: set default pooling type to unspecified

Andrei Betlen committed 2y ago

4084aab

fix: Set default pooling_type to mean, check for null pointer.

Andrei Betlen committed 2y ago

d318cc8

March 9, 2024

feat: Switch embed to llama_get_embeddings_seq (#1263)

Douglas Hanley committed 2y ago

2811014

March 6, 2024

Update llama.cpp

Andrei Betlen committed 2y ago

93dc56a

March 1, 2024

docs: Add information re: auto chat formats. Closes #1236

Andrei Betlen committed 2y ago

97aa3a1

feat: Update llama.cpp

Andrei Betlen committed 2y ago

f062a7f

February 28, 2024

fix: eos/bos_token set correctly for Jinja2ChatFormatter and automatic chat formatter (#1230)

Sigbjørn Skjæret committed 2y ago

c36ab15

February 25, 2024

feat: Update llama.cpp

Andrei Betlen committed 2y ago

2292af5

February 23, 2024

fix: LlamaHFTokenizer now receives pre_tokens

Andrei Betlen committed 2y ago

47bad30

fix: module 'llama_cpp.llama_cpp' has no attribute 'c_uint8'

Andrei Betlen committed 2y ago

db776a8

February 22, 2024

fix: Update from_pretrained defaults to match hf_hub_download

Andrei Betlen committed 2y ago

e6d6260

February 21, 2024

feat(low-level-api): Improve API static type-safety and performance (#1205)

Andrei committed 2y ago

7f51b60

feat: Pull models directly from huggingface (#1206)

Andrei committed 2y ago

0f8aa4a

February 17, 2024

fix: self.numa missing

Andrei Betlen committed 2y ago

53f6f5f

feat: Update llama.cpp

Andrei Betlen committed 2y ago

fdce078

February 15, 2024

fix: create_embedding broken response for input type str

Andrei Betlen committed 2y ago

0ce66bc

fix: Incorporate embedding pooling layer fixes (#1194)

Douglas Hanley committed 2y ago

7bb91f0

February 14, 2024

feat: Support batch embeddings (#1186)

Douglas Hanley committed 2y ago

d7a6791

February 13, 2024

fix: sample idx off-by-one error for logit_processors (#1179)

Andrew Lapp committed 2y ago

d6be533

February 12, 2024

fix: Always set logits_all = True when using speculative decoding

Andrei Betlen committed 2y ago

cb79171

feat: Generic chatml Function Calling (#957)

Andrei committed 2y ago

153a004

February 9, 2024

Merge branch 'main' of github.com:abetlen/llama_cpp_python into main

Andrei Betlen committed 2y ago

4abb8c9

fix: revert _create_completions.

Andrei Betlen committed 2y ago

e16f06e

February 8, 2024

feat: Move tokenizer to own module

Andrei Betlen committed 2y ago

b5fca91

feat: Integrate functionary v1.4 and v2 models + add custom tokenizer support to Llama class (#1078)

Jeffrey Fong committed 2y ago

9018270

February 6, 2024

fix: Use llama_log_callback to avoid suppress_stdout_stderr

Andrei Betlen committed 2y ago

59760c8

January 31, 2024

Add speculative decoding (#1120)

Andrei committed 2y ago

fb762a6

January 29, 2024

Automatically set chat format from gguf (#1110)

Andrei committed 2y ago

da003d8

January 24, 2024

fix: Check order

Andrei Betlen committed 2y ago

9677a1f

fix: format

Andrei Betlen committed 2y ago

4d6b2f7

fix: GGUF metadata KV overrides, re #1011 (#1116)

Phil H committed 2y ago

fe5d6ea

January 19, 2024

feat: Expose gguf model metadata in metadata property

Andrei Betlen committed 2y ago

5a34c57

Fix mirostat sampling

Andrei Betlen committed 2y ago

3babe35

January 18, 2024

Offload KQV by default

Andrei Betlen committed 2y ago

48c3b77

January 17, 2024

Re-order classes in llama.py

Andrei Betlen committed 2y ago

7b46bb5

Move helper classes to _internals submodule

Andrei Betlen committed 2y ago

cc4630e

Move cache classes to llama_cache submodule.

Andrei Betlen committed 2y ago

3b92419

January 15, 2024

Add split_mode option. Closes #1085

Andrei Betlen committed 2y ago

84615ad

Implement GGUF metadata KV overrides (#1011)

Phil H committed 2y ago

76aafa6

January 10, 2024

Add ability to pass in penalize_nl param (#1068)

Stephen Hankinson committed 2y ago

df3be58

December 22, 2023

Fix typo

Andrei Betlen committed 2y ago

d9a1d90

server: Support none defaulting to infinity for completions (#111)

swg committed 2y ago

4b01a87