Commits: llama_cpp/llama.py - abetlen/llama-cpp-python

abetlen / llama-cpp-python UNCLAIMED

Python bindings for llama.cpp

0 0 65 Python

COMMITS

/ llama_cpp/llama.py

batch-processing

January 18, 2024

Merge branch 'main' into batch-processing

Andrei Betlen committed 2y ago

850416a

January 17, 2024

Re-order classes in llama.py

Andrei Betlen committed 2y ago

7b46bb5

Move helper classes to _internals submodule

Andrei Betlen committed 2y ago

cc4630e

Move cache classes to llama_cache submodule.

Andrei Betlen committed 2y ago

3b92419

January 15, 2024

Merge branch 'main' into batch-processing

Andrei Betlen committed 2y ago

358593f

Add split_mode option. Closes #1085

Andrei Betlen committed 2y ago

84615ad

Implement GGUF metadata KV overrides (#1011)

Phil H committed 2y ago

76aafa6

January 10, 2024

Use sampling context

Andrei Betlen committed 2y ago

7f4ba48

Merge branch 'main' into batch-processing

Andrei Betlen committed 2y ago

456a601

Add ability to pass in penalize_nl param (#1068)

Stephen Hankinson committed 2y ago

df3be58

January 5, 2024

Fix #1038

Andrei Betlen committed 2y ago

e1cd61e

Merge branch 'main' into batch-processing

Andrei Betlen committed 2y ago

b1e9962

December 22, 2023

Fix typo

Andrei Betlen committed 2y ago

d9a1d90

server: Support none defaulting to infinity for completions (#111)

swg committed 2y ago

4b01a87

fix text_offset of multi-token characters (#1037)

twaka committed 2y ago

2f03fb0

December 18, 2023

fix: float32 is not JSON serializable when streaming logits.

Andrei Betlen committed 2y ago

a05b4da

Fix logits are not json serializable

Andrei Betlen committed 2y ago

fcbd177

Merge branch 'main' into batch-processing

Andrei Betlen committed 2y ago

a625412

Add offload_kqv option to llama and server

Andrei Betlen committed 2y ago

095c650

Remove unnused import

Andrei Betlen committed 2y ago

472b344

perf: Don't convert logprobs arrays to lists (#1021)

kddubey committed 2y ago

6b2e0e0

Bugfix: Remove f16_kv, add offload_kqv field (#1019)

Brandon Roberts committed 2y ago

62944df

December 16, 2023

Bug fixed with n_ctx=0 (#1015)

Daniele Morotti committed 2y ago

f1c631d

Fix logits_to_logprobs for 2-D and 3-D logits (#1002)

kddubey committed 2y ago

5a89446

December 12, 2023

Replace logits_to_logprobs implementation with numpy equivalent to llama.cpp (#991)

Tanner Hobson committed 2y ago

ef22e47

Merge branch 'main' into batch-processing

Andrei Betlen committed 2y ago

4335a9d

December 11, 2023

Remove f16_kv

Andrei Betlen committed 2y ago

ec26f36

November 30, 2023

Refactor _create_completion

Andrei Betlen committed 2y ago

40f2293

November 29, 2023

Fix #891 (#952)

kddubey committed 2y ago

b069d06

November 28, 2023

Add sampling context

Andrei Betlen committed 2y ago

3a1ba77

Clean up llama.py into seperate modules.

Andrei Betlen committed 2y ago

3c13436

November 26, 2023

docs: Update Llama docs

Andrei Betlen committed 2y ago

6308f21

November 24, 2023

docs: Update completion and chat_completion parameter docstrings

Andrei Betlen committed 2y ago

4026166

November 23, 2023

docs: Add Llama class example

Andrei Betlen committed 2y ago

b6bb7ac

November 21, 2023

Format

Andrei Betlen committed 2y ago

7a3f878

Fix: Add logit_bias to all completion api methods

Andrei Betlen committed 2y ago

422ebc8

Add support for logit_bias outside of server api. Closes #827

Andrei Betlen committed 2y ago

07e47f5

Added support for min_p (#921)

TK-Master committed 2y ago

b8438f7

Fix #929

Andrei Betlen committed 2y ago

a34d480

November 10, 2023

Fix sampling bug when logits_all=False

Andrei Betlen committed 2y ago

6f0b0b1

Potential bugfix for eval

Andrei Betlen committed 2y ago

d9b38e3

Fix: default max_tokens matches openai api (16 for completion, max length for chat completion)

Andrei Betlen committed 2y ago

e7962d2

November 8, 2023

Add set_seed to Llama class

Andrei Betlen committed 2y ago

fd41ed3

Fix destructor NoneType is not callable error

Andrei Betlen committed 2y ago

ca4cb88

Add JSON mode support. Closes #881

Andrei Betlen committed 2y ago

b30b9c3

Add seed parameter support for completion and chat_completion requests. Closes #884

Andrei Betlen committed 2y ago

86aeb9f

Multimodal Support (Llava 1.5) (#821)

Damian Stewart committed 2y ago

aab74f0

November 6, 2023

Fix type bug

Andrei Betlen committed 2y ago

be0add1

Refactor Llama class internals

Andrei Betlen committed 2y ago

e214a58

November 3, 2023

Clean up stdout / stderr suppression

Andrei Betlen committed 2y ago

2ec043a