Commits: llama_cpp/llama.py - abetlen/llama-cpp-python

abetlen / llama-cpp-python UNCLAIMED

Python bindings for llama.cpp

0 0 2 Python

COMMITS

/ llama_cpp/llama.py

v0.1.75

July 20, 2023

Merge pull request #481 from c0sogi/main

Andrei committed 2y ago

365d9a4

Now the last token sent when `stream=True`

Carlos Tejada committed 2y ago

0756a2d

July 16, 2023

Added `RouteErrorHandler` for server

c0sogi committed 2y ago

1551ba1

July 15, 2023

Re-order Llama class params

Andrei Betlen committed 2y ago

8ab098e

Merge branch main into custom_rope

Andrei Betlen committed 2y ago

f0797a6

July 9, 2023

Add bindings for custom_rope

randoentity committed 2y ago

3f8f276

bugfix: truncate completion max_tokens to fit context length by default

Andrei Betlen committed 2y ago

a86bfdf

July 8, 2023

Merge branch 'main' into add_unlimited_max_tokens

Andrei committed 2y ago

5d756de

Merge pull request #453 from wu-qing-157/main

Andrei committed 2y ago

b8e0bed

bugfix: fix compatibility bug with openai api on last token

Andrei Betlen committed 2y ago

d6e6aad

Format

Andrei Betlen committed 2y ago

4f2b5d0

perf: convert pointer to byref

Andrei Betlen committed 2y ago

34c505e

July 7, 2023

perf: avoid allocating new buffers during sampling

Andrei Betlen committed 2y ago

11eae75

perf: assign to candidates data structure instead

Andrei Betlen committed 2y ago

a14d8a9

fix indexing token_logprobs after sorting

wu-qing-157 committed 2y ago

9e61661

June 29, 2023

Hotfix: logits_all bug

Andrei Betlen committed 2y ago

e34f441

Load logits directly into scores buffer

Andrei Betlen committed 2y ago

a2ede37

Use pre-allocated buffers to store input_ids and scores

Andrei Betlen committed 2y ago

b95b0ff

Free model when llama is unloaded. Closes #434

Andrei Betlen committed 2y ago

a5e059c

June 26, 2023

Merge branch 'main' of github.com:abetlen/llama_cpp_python into main

Andrei Betlen committed 2y ago

3379dc4

Update llama.cpp

Andrei Betlen committed 2y ago

9522284

Update type signature

Andrei Betlen committed 2y ago

b4a3db3

Merge branch 'main' into fix-state-pickle

Andrei committed 2y ago

5eb4ebb

June 24, 2023

Only concatenate after all batches are done

samfundev committed 2y ago

d788fb4

June 23, 2023

Merge branch 'main' into fix-state-pickle

Andrei committed 2y ago

877ca6d

June 17, 2023

Update docs. Closes #386

Andrei Betlen committed 2y ago

d410f12

June 16, 2023

Update llama.py: Added how many input tokens in ValueError exception

imaprogrammer committed 2y ago

fd9f294

June 15, 2023

Add low_vram parameter

Andrei Betlen committed 2y ago

44b83ca

Merge pull request #351 from player1537-forks/th/add-logits-bias-parameter

Andrei committed 2y ago

f568bae

June 13, 2023

fix: Make LLamaState pickable for disk cache

Okabintaro committed 2y ago

10b0cb7

June 10, 2023

Re-enable cache

Andrei Betlen committed 2y ago

21acd79

June 9, 2023

Add support for logit_bias and logit_bias_type parameters

Tanner Hobson committed 2y ago

eb7645b

Temporarily disable cache until save state bug is fixed.

Andrei Betlen committed 2y ago

0da655b

Truncate max_tokens if it exceeds context length

Andrei Betlen committed 2y ago

556c7ed

June 8, 2023

Fix cache implementation breaking changes

Andrei Betlen committed 2y ago

0c42168

June 6, 2023

Merge pull request #289 from Maximilian-Winter/main

Andrei committed 2y ago

0f0b447

Fix resize issue. Closes #330

Andrei Betlen committed 2y ago

8b4968e

May 31, 2023

Added both LlamaChache classes Disk and RAM.

Maximilian-Winter committed 2y ago

29f9c9c

Merge branch 'abetlen:main' into main

Maximilian Winter committed 2y ago

9ea7a37

May 28, 2023

Diskcache implementation for llama state.

Maximilian-Winter committed 2y ago

719c3ea

May 27, 2023

Format

Andrei Betlen committed 2y ago

8f2b445

Align dtype to match c structs

Andrei Betlen committed 2y ago

84e313b

Merge branch 'main' into add-numpy-support

Andrei Betlen committed 2y ago

66bcb8d

Fix stop sequence performance bug.

Andrei Betlen committed 2y ago

8f35bdd

Remove usage of eval_tokens for cache check

Andrei Betlen committed 2y ago

7fc7bc3

Replace eval_logits and eval_tokens with numpy arrays

Andrei Betlen committed 2y ago

fe331ec

May 26, 2023

Add support for numpy

Andrei Betlen committed 2y ago

8eb9769

Bugfix for logits_processor and stopping_criteria

Andrei Betlen committed 2y ago

4c1b7f7

Add extra logits_processor and stopping_criteria

Andrei Betlen committed 2y ago

433a2e3

Fix streaming hang on last token when cache is on.

Andrei Betlen committed 2y ago

f74b90e