Commits: llama_cpp/llama.py - abetlen/llama-cpp-python

abetlen / llama-cpp-python UNCLAIMED

Python bindings for llama.cpp

0 0 0 Python

COMMITS

/ llama_cpp/llama.py

df31303a12fe4e38cdd5b2dfd57940e3685cef09

November 3, 2023

Migrate inference to llama_batch and llama_decode api (#795)

Andrei committed 2y ago

ab028cb

November 2, 2023

Update llama.cpp

Andrei Betlen committed 2y ago

fa83cc5

fix: tokenization of special characters: (#850)

Antoine Lizee committed 2y ago

4d4e0f1

November 1, 2023

llama: fix exception in Llama.__del__ (#846)

cebtenzzre committed 2y ago

eefd76f

fixed Llama._create_completion suffix check, it can be either None or str instance (#854)

Marko Tasic committed 2y ago

9c8f4dc

October 24, 2023

Update llama.cpp

Andrei Betlen committed 2y ago

53861c9

October 19, 2023

Fix streaming doesn't return finish reason (#798)

gmcgoldr committed 2y ago

09a8406

Update llama.cpp

Andrei Betlen committed 2y ago

ff58003

October 15, 2023

Make use of suppress_stdout_stderr when freeing model (#803)

Pierre Alexandre SCHEMBRI committed 2y ago

10304d7

Add validation for tensor_split size exceeding LLAMA_MAX_DEVICES (#820)

Eric Liu committed 2y ago

b501665

September 30, 2023

Fix logits_all bug

Andrei Betlen committed 2y ago

d696251

Fix bug in embedding

Andrei Betlen committed 2y ago

42bb721

September 29, 2023

Configurable Chat Formats (#711)

Andrei committed 2y ago

3bca770

Fix rope scaling defaults (#767)

Josh XT committed 2y ago

a945404

Update llama.cpp

Andrei Betlen committed 2y ago

1a1c3dc

September 18, 2023

Update llama.cpp

Andrei Betlen committed 2y ago

38e34c9

September 14, 2023

Add numa support, low level api users must now explicitly call llama_backend_init at the start of their programs.

Andrei Betlen committed 2y ago

f4090a0

Reorder init params to match llama.cpp order

Andrei Betlen committed 2y ago

6a20293

Explicitly make all init params other than model_path into keyword only params

Andrei Betlen committed 2y ago

c8f9b8a

Add kwargs to init to catch extra params

Andrei Betlen committed 2y ago

a68f9e2

remove print

Andrei Betlen committed 2y ago

9e345a4

Convert missed llama.cpp constants into standard python types

Andrei Betlen committed 2y ago

517f9ed

Fix tensor_split cli option

Andrei Betlen committed 2y ago

c4c440b

September 12, 2023

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

1910793

August 29, 2023

cjk pr minor cleanup

Andrei Betlen committed 2y ago

3f76e1d

Merge pull request #309 from MeouSker77/fix-CJK

Andrei committed 2y ago

bae44ec

August 27, 2023

Update llama.cpp

Andrei Betlen committed 2y ago

4887973

Update llama.cpp

Andrei Betlen committed 2y ago

3a29d65

August 25, 2023

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

ac47d55

Use _with_model variants for tokenization

Andrei Betlen committed 2y ago

48cf43b

Strip leading space when de-tokenizing.

Andrei Betlen committed 2y ago

8ac5946

August 24, 2023

Remove deprecated params

Andrei Betlen committed 2y ago

4ed632c

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

cf405f6

Update llama.cpp

Andrei Betlen committed 2y ago

bbbf0f4

August 15, 2023

Merge branch 'main' of github.com:abetlen/llama_cpp_python into main

Andrei Betlen committed 2y ago

620cd2f

Remove unnused import

Andrei Betlen committed 2y ago

5788f1f

August 13, 2023

make n_gpu_layers=-1 offload all layers

Billy Cao committed 2y ago

c471871

August 12, 2023

Add doc string for n_gpu_layers argument

Billy Cao committed 2y ago

d018c7b

August 9, 2023

fix CJK output again

MeouSker77 committed 2y ago

88184ed

August 8, 2023

Move grammar to function call argument

Andrei Betlen committed 2y ago

66fb034

fix

Andrei Betlen committed 2y ago

1e844d3

Merge branch 'main' into c0sogi/main

Andrei Betlen committed 2y ago

843b7cc

Add mul_mat_q option

Andrei Betlen committed 2y ago

d015bdb

August 7, 2023

reset grammar for every generation

c0sogi committed 2y ago

b07713c

August 6, 2023

Added grammar based sampling

c0sogi committed 2y ago

418aa83

July 28, 2023

Suppress llama.cpp output when loading model.

Andrei Betlen committed 2y ago

ce57920

Format

Andrei Betlen committed 2y ago

a9b9f03

fix: annoying bug where attribute exceptions were droining out file not found exceptions

Andrei Betlen committed 2y ago

abc538f

July 25, 2023

Change tensor_split from array to pointer

Shouyi Wang committed 2y ago

426dbfe

July 24, 2023

Merge branch 'main' into v0.2-wip

Andrei Betlen committed 2y ago

3434803