Commits: llama_cpp/llama.py - abetlen/llama-cpp-python

abetlen / llama-cpp-python UNCLAIMED

Python bindings for llama.cpp

0 0 0 Python

COMMITS

/ llama_cpp/llama.py

a1ac19998014fa499271d382cb58b3316ec64f46

October 15, 2023

Add validation for tensor_split size exceeding LLAMA_MAX_DEVICES (#820)

Eric Liu committed 2y ago

b501665

September 30, 2023

Fix logits_all bug

Andrei Betlen committed 2y ago

d696251

Fix bug in embedding

Andrei Betlen committed 2y ago

42bb721

September 29, 2023

Configurable Chat Formats (#711)

Andrei committed 2y ago

3bca770

Fix rope scaling defaults (#767)

Josh XT committed 2y ago

a945404

Update llama.cpp

Andrei Betlen committed 2y ago

1a1c3dc

September 18, 2023

Update llama.cpp

Andrei Betlen committed 2y ago

38e34c9

September 14, 2023

Add numa support, low level api users must now explicitly call llama_backend_init at the start of their programs.

Andrei Betlen committed 2y ago

f4090a0

Reorder init params to match llama.cpp order

Andrei Betlen committed 2y ago

6a20293

Explicitly make all init params other than model_path into keyword only params

Andrei Betlen committed 2y ago

c8f9b8a

Add kwargs to init to catch extra params

Andrei Betlen committed 2y ago

a68f9e2

remove print

Andrei Betlen committed 2y ago

9e345a4

Convert missed llama.cpp constants into standard python types

Andrei Betlen committed 2y ago

517f9ed

Fix tensor_split cli option

Andrei Betlen committed 2y ago

c4c440b

September 12, 2023

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

1910793

August 29, 2023

cjk pr minor cleanup

Andrei Betlen committed 2y ago

3f76e1d

Merge pull request #309 from MeouSker77/fix-CJK

Andrei committed 2y ago

bae44ec

August 27, 2023

Update llama.cpp

Andrei Betlen committed 2y ago

4887973

Update llama.cpp

Andrei Betlen committed 2y ago

3a29d65

August 25, 2023

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

ac47d55

Use _with_model variants for tokenization

Andrei Betlen committed 2y ago

48cf43b

Strip leading space when de-tokenizing.

Andrei Betlen committed 2y ago

8ac5946

August 24, 2023

Remove deprecated params

Andrei Betlen committed 2y ago

4ed632c

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

cf405f6

Update llama.cpp

Andrei Betlen committed 2y ago

bbbf0f4

August 15, 2023

Merge branch 'main' of github.com:abetlen/llama_cpp_python into main

Andrei Betlen committed 2y ago

620cd2f

Remove unnused import

Andrei Betlen committed 2y ago

5788f1f

August 13, 2023

make n_gpu_layers=-1 offload all layers

Billy Cao committed 2y ago

c471871

August 12, 2023

Add doc string for n_gpu_layers argument

Billy Cao committed 2y ago

d018c7b

August 9, 2023

fix CJK output again

MeouSker77 committed 2y ago

88184ed

August 8, 2023

Move grammar to function call argument

Andrei Betlen committed 2y ago

66fb034

fix

Andrei Betlen committed 2y ago

1e844d3

Merge branch 'main' into c0sogi/main

Andrei Betlen committed 2y ago

843b7cc

Add mul_mat_q option

Andrei Betlen committed 2y ago

d015bdb

August 7, 2023

reset grammar for every generation

c0sogi committed 2y ago

b07713c

August 6, 2023

Added grammar based sampling

c0sogi committed 2y ago

418aa83

July 28, 2023

Suppress llama.cpp output when loading model.

Andrei Betlen committed 2y ago

ce57920

Format

Andrei Betlen committed 2y ago

a9b9f03

fix: annoying bug where attribute exceptions were droining out file not found exceptions

Andrei Betlen committed 2y ago

abc538f

July 25, 2023

Change tensor_split from array to pointer

Shouyi Wang committed 2y ago

426dbfe

July 24, 2023

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

3434803

Add temporary rms_norm_eps parameter

Andrei Betlen committed 2y ago

11dd2bf

Add rms_eps_norm

Andrei Betlen committed 2y ago

8cd64d4

add support for llama2 70b

bretello committed 2y ago

0f09f10

July 20, 2023

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

0538ba1

Merge pull request #481 from c0sogi/main

Andrei committed 2y ago

365d9a4

Now the last token sent when `stream=True`

Carlos Tejada committed 2y ago

0756a2d

July 19, 2023

Add functions parameters

Andrei Betlen committed 2y ago

b43917c

July 18, 2023

Use numpy arrays for logits_processors and stopping_criteria. Closes #491

Andrei Betlen committed 2y ago

19ba9d3

July 16, 2023

Added `RouteErrorHandler` for server

c0sogi committed 2y ago

1551ba1