llama-cpp-python

abetlen/llama-cpp-python

Fork 0

mirror of https://github.com/abetlen/llama-cpp-python.git synced 2026-03-27 05:33:00 +00:00

Commit Graph

Select branches

Hide Pull Requests

abetlen/bump-version-0.3.18

abetlen/fix-cuda-real-arch

abetlen/fix-hf-download-checks

abetlen/fix-macos-runner-label

abetlen/setup-ruff

abetlen/update-llama-cpp-bindings

add-paligemma-support

batch-processing

binary-wheels

dependabot/github_actions/conda-incubator/setup-miniconda-3.1.1

dependabot/github_actions/pypa/cibuildwheel-2.23.2

dependabot/pip/starlette-context-gte-0.3.6-and-lt-0.5

dev-add-rpc

docker

expose-libggml

fix-embeddings-for-non-embedding-models

fix-windows-import-error

generic-vlm-chat-format

main

patch-1

patch-3

remove-unwanted-bos

update-sampling-api

#10

#1002

#1005

#1007

#1010

#1011

#1012

#1015

#1019

#1021

#1023

#1028

#1037

#1050

#1052

#1058

#1059

#1067

#1067

#1068

#1078

#108

#1080

#1080

#1086

#1087

#1088

#1090

#1092

#1093

#1106

#1106

#111

#1110

#1115

#1116

#1120

#1122

#1131

#1136

#1147

#1156

#1157

#1160

#1160

#1170

#1170

#1172

#1175

#1176

#1179

#118

#1184

#1186

#1187

#1188

#119

#1193

#1194

#1205

#1206

#1209

#1209

#1210

#1212

#1212

#1214

#1222

#1230

#1232

#1244

#1246

#1247

#1248

#125

#1257

#126

#1263

#1264

#1272

#1273

#1281

#1281

#1282

#1289

#1295

#1296

#130

#1307

#1311

#1317

#1330

#1331

#1331

#1333

#1345

#1347

#135

#1350

#1351

#1351

#1353

#1364

#137

#1370

#1371

#1374

#1374

#1375

#1380

#1382

#1382

#1383

#1383

#1385

#1386

#1387

#1388

#139

#1390

#1391

#1392

#1396

#1397

#1398

#1399

#140

#1400

#1401

#1407

#1407

#1408

#1409

#1410

#1411

#1412

#1413

#1415

#1416

#1419

#1424

#1426

#1427

#1427

#1430

#1432

#1433

#1434

#1439

#144

#144

#1440

#1440

#1441

#1444

#1448

#1449

#1453

#1457

#1459

#146

#146

#1462

#1465

#1471

#1472

#1476

#1477

#1486

#1486

#1492

#1493

#1497

#15

#1503

#1503

#1509

#1509

#1513

#1514

#1515

#1515

#1516

#1522

#1525

#1526

#1527

#153

#1530

#1539

#154

#1544

#1550

#1550

#1551

#1552

#1552

#156

#1568

#1569

#157

#1576

#1583

#1583

#1585

#1589

#1590

#1594

#1596

#1597

#160

#1605

#1605

#161

#1613

#1613

#162

#1626

#163

#1630

#1632

#1633

#1634

#1635

#1637

#164

#1649

#165

#1651

#1651

#1653

#1656

#1657

#1658

#1659

#1667

#1668

#1673

#1677

#1677

#1679

#168

#1680

#1704

#1716

#1716

#1718

#1718

#172

#172

#1721

#173

#1738

#1739

#1739

#1741

#1742

#1743

#1750

#1751

#1758

#1758

#176

#176

#1761

#177

#1775

#1776

#1776

#1777

#1777

#178

#1782

#1782

#1783

#1786

#1786

#1788

#1790

#1791

#1791

#1793

#1795

#1796

#1798

#1800

#1800

#1802

#1807

#1817

#1817

#1819

#1820

#1820

#1821

#1825

#1825

#1833

#1834

#1834

#1838

#1842

#1842

#1843

#1843

#1844

#1845

#1858

#1862

#1864

#1864

#1867

#1867

#1871

#1877

#1877

#1878

#1878

#1879

#1884

#1884

#1887

#1887

#189

#1894

#1898

#1899

#1899

#190

#1901

#191

#1911

#1911

#1912

#1959

#1961

#1961

#1966

#1971

#1971

#1973

#1974

#1978

#198

#198

#1980

#1981

#1981

#1989

#1989

#1991

#1991

#2007

#2007

#2009

#2015

#2015

#2018

#2018

#2025

#2025

#2027

#203

#2030

#2034

#2034

#2039

#2039

#2040

#2041

#2041

#2042

#2042

#2044

#2044

#2046

#2049

#2056

#2056

#2058

#2058

#2059

#2059

#2066

#2066

#2071

#2071

#2072

#2072

#2077

#2082

#2082

#2083

#2083

#2085

#2085

#2088

#2088

#2099

#2099

#2100

#2102

#2102

#2108

#2108

#2109

#2109

#2112

#212

#2120

#2120

#2121

#2121

#2128

#2128

#2129

#2129

#2131

#2131

#2132

#2133

#2134

#2134

#2139

#214

#2141

#2141

#2143

#2147

#2147

#2148

#2149

#215

#2150

#2151

#2152

#2153

#2154

#2155

#2156

#2157

#2158

#2160

#2161

#2162

#2164

#2164

#217

#225

#235

#240

#246

#248

#249

#251

#252

#258

#262

#263

#264

#265

#270

#271

#274

#277

#28

#289

#29

#292

#293

#294

#295

#296

#297

#309

#310

#322

#323

#324

#325

#326

#328

#329

#329

#350

#351

#359

#360

#365

#366

#367

#368

#370

#376

#385

#390

#392

#402

#405

#41

#414

#414

#415

#42

#420

#429

#430

#431

#432

#435

#436

#442

#443

#444

#452

#453

#456

#460

#466

#481

#485

#486

#498

#499

#50

#501

#505

#505

#508

#511

#516

#517

#517

#521

#522

#523

#525

#530

#531

#532

#536

#540

#540

#548

#549

#55

#550

#551

#556

#564

#565

#567

#572

#573

#575

#575

#577

#583

#584

#594

#594

#598

#600

#604

#61

#610

#611

#612

#621

#624

#624

#625

#63

#631

#631

#633

#64

#644

#647

#648

#667

#668

#669

#670

#671

#678

#678

#680

#682

#684

#688

#690

#691

#692

#694

#694

#703

#704

#705

#709

#709

#711

#713

#716

#716

#73

#730

#733

#733

#748

#752

#752

#76

#767

#768

#772

#778

#781

#783

#784

#790

#795

#798

#799

#803

#808

#809

#820

#821

#824

#824

#826

#83

#834

#835

#842

#843

#843

#846

#848

#849

#850

#851

#854

#855

#864

#868

#868

#869

#87

#875

#892

#913

#913

#920

#921

#922

#922

#925

#927

#93

#930

#931

#934

#936

#937

#938

#940

#947

#951

#951

#952

#955

#955

#957

#958

#961

#962

#962

#967

#98

#986

#991

#993

#993

#994

#996

v0.1.29

v0.1.30

v0.1.31

v0.1.32

v0.1.33

v0.1.34

v0.1.35

v0.1.36

v0.1.37

v0.1.38

v0.1.39

v0.1.40

v0.1.41

v0.1.42

v0.1.43

v0.1.44

v0.1.45

v0.1.46

v0.1.47

v0.1.48

v0.1.49

v0.1.50

v0.1.51

v0.1.52

v0.1.53

v0.1.54

v0.1.55

v0.1.56

v0.1.57

v0.1.59

v0.1.60

v0.1.61

v0.1.62

v0.1.63

v0.1.64

v0.1.65

v0.1.66

v0.1.67

v0.1.68

v0.1.69

v0.1.70

v0.1.71

v0.1.72

v0.1.73

v0.1.74

v0.1.75

v0.1.76

v0.1.77

v0.1.78

v0.1.79

v0.1.80

v0.1.81

v0.1.82

v0.1.83

v0.1.84

v0.1.85

v0.2.0

v0.2.1

v0.2.10

v0.2.11

v0.2.12

v0.2.13

v0.2.14

v0.2.15

v0.2.16

v0.2.17

v0.2.18

v0.2.19

v0.2.2

v0.2.20

v0.2.21

v0.2.22

v0.2.23

v0.2.24

v0.2.25

v0.2.26

v0.2.27

v0.2.28

v0.2.29

v0.2.3

v0.2.30

v0.2.31

v0.2.32

v0.2.33

v0.2.34

v0.2.35

v0.2.36

v0.2.37

v0.2.38

v0.2.39

v0.2.4

v0.2.40

v0.2.41

v0.2.42

v0.2.43

v0.2.44

v0.2.45

v0.2.46

v0.2.47

v0.2.48

v0.2.49

v0.2.5

v0.2.50

v0.2.51

v0.2.52

v0.2.53

v0.2.54

v0.2.55

v0.2.56

v0.2.57

v0.2.58

v0.2.59

v0.2.59-cu121

v0.2.59-cu122

v0.2.59-cu123

v0.2.59-metal

v0.2.6

v0.2.60

v0.2.60-cu121

v0.2.60-cu122

v0.2.60-cu123

v0.2.60-metal

v0.2.61

v0.2.61-cu121

v0.2.61-cu122

v0.2.61-cu123

v0.2.61-metal

v0.2.62

v0.2.62-cu121

v0.2.62-cu122

v0.2.62-cu123

v0.2.62-metal

v0.2.63

v0.2.63-cu121

v0.2.63-cu122

v0.2.63-cu123

v0.2.63-metal

v0.2.64

v0.2.64-cu121

v0.2.64-cu122

v0.2.64-cu123

v0.2.64-metal

v0.2.65

v0.2.65-cu121

v0.2.65-cu122

v0.2.65-cu123

v0.2.65-metal

v0.2.66

v0.2.66-cu121

v0.2.66-cu122

v0.2.66-cu123

v0.2.66-cu124

v0.2.66-metal

v0.2.67

v0.2.67-cu121

v0.2.67-cu122

v0.2.67-cu123

v0.2.67-cu124

v0.2.67-metal

v0.2.68

v0.2.68-cu121

v0.2.68-cu122

v0.2.68-cu123

v0.2.68-cu124

v0.2.68-metal

v0.2.69

v0.2.69-cu121

v0.2.69-cu122

v0.2.69-cu123

v0.2.69-cu124

v0.2.69-metal

v0.2.7

v0.2.70

v0.2.70-cu121

v0.2.70-cu122

v0.2.70-cu123

v0.2.70-cu124

v0.2.70-metal

v0.2.71

v0.2.71-cu121

v0.2.71-cu122

v0.2.71-cu123

v0.2.71-cu124

v0.2.71-metal

v0.2.72

v0.2.72-cu121

v0.2.72-cu122

v0.2.72-cu123

v0.2.72-cu124

v0.2.72-metal

v0.2.73

v0.2.73-cu121

v0.2.73-cu122

v0.2.73-cu123

v0.2.73-cu124

v0.2.73-metal

v0.2.74

v0.2.74-cu121

v0.2.74-cu122

v0.2.74-cu123

v0.2.74-cu124

v0.2.74-metal

v0.2.75

v0.2.75-cu121

v0.2.75-cu122

v0.2.75-cu123

v0.2.75-cu124

v0.2.75-metal

v0.2.76

v0.2.76-cu121

v0.2.76-cu122

v0.2.76-cu123

v0.2.76-cu124

v0.2.76-metal

v0.2.77

v0.2.77-cu121

v0.2.77-cu122

v0.2.77-cu123

v0.2.77-cu124

v0.2.77-metal

v0.2.78

v0.2.78-metal

v0.2.79

v0.2.79-metal

v0.2.8

v0.2.80

v0.2.80-metal

v0.2.81

v0.2.81-cu121

v0.2.81-cu122

v0.2.81-cu123

v0.2.81-cu124

v0.2.81-metal

v0.2.82

v0.2.82-cu121

v0.2.82-cu122

v0.2.82-cu123

v0.2.82-cu124

v0.2.82-metal

v0.2.83

v0.2.83-cu121

v0.2.83-cu122

v0.2.83-cu123

v0.2.83-cu124

v0.2.83-metal

v0.2.84

v0.2.84-cu121

v0.2.84-cu122

v0.2.84-cu123

v0.2.84-cu124

v0.2.84-metal

v0.2.85

v0.2.85-cu121

v0.2.85-cu122

v0.2.85-cu123

v0.2.85-cu124

v0.2.85-metal

v0.2.86

v0.2.86-cu121

v0.2.86-cu122

v0.2.86-cu123

v0.2.86-cu124

v0.2.86-metal

v0.2.87

v0.2.87-cu121

v0.2.87-cu122

v0.2.87-cu123

v0.2.87-cu124

v0.2.87-metal

v0.2.88

v0.2.88-cu121

v0.2.88-cu122

v0.2.88-cu123

v0.2.88-cu124

v0.2.88-metal

v0.2.89

v0.2.89-cu121

v0.2.89-cu124

v0.2.89-metal

v0.2.9

v0.2.90

v0.2.90-cu121

v0.2.90-cu122

v0.2.90-cu123

v0.2.90-cu124

v0.2.90-metal

v0.3.0

v0.3.0-metal

v0.3.1

v0.3.1-metal

v0.3.10

v0.3.11

v0.3.11-metal

v0.3.12

v0.3.12-cu121

v0.3.12-cu122

v0.3.12-cu123

v0.3.12-cu124

v0.3.12-metal

v0.3.13

v0.3.13-cu121

v0.3.13-cu122

v0.3.13-cu123

v0.3.13-cu124

v0.3.13-metal

v0.3.14

v0.3.14-cu124

v0.3.14-metal

v0.3.15

v0.3.15-cu121

v0.3.15-cu122

v0.3.15-cu123

v0.3.15-cu124

v0.3.15-metal

v0.3.16

v0.3.16-cu121

v0.3.16-cu122

v0.3.16-cu123

v0.3.16-cu124

v0.3.16-metal

v0.3.17

v0.3.17-metal

v0.3.18

v0.3.18-cu121

v0.3.18-cu122

v0.3.18-cu123

v0.3.18-cu124

v0.3.18-metal

v0.3.19

v0.3.19-cu121

v0.3.19-cu122

v0.3.19-cu123

v0.3.19-cu124

v0.3.19-metal

v0.3.2

v0.3.2-metal

v0.3.3

v0.3.4

v0.3.4-cu121

v0.3.4-cu122

v0.3.4-cu123

v0.3.4-cu124

v0.3.4-metal

v0.3.5

v0.3.5-metal

v0.3.6

v0.3.7

v0.3.8

v0.3.9

7f52335c50 feat: Update llama.cpp Andrei Betlen 2024-04-25 21:21:29 -04:00
266abfc1a3 fix(ci): Fix metal tests as well Andrei Betlen 2024-04-25 03:09:46 -04:00
de37420fcf fix(ci): Fix python macos test runners issue Andrei Betlen 2024-04-25 03:08:32 -04:00
2a9979fce1 feat: Update llama.cpp Andrei Betlen 2024-04-25 02:48:26 -04:00
c50d3300d2 chore: Bump version v0.2.64-metal v0.2.64-cu123 v0.2.64-cu122 v0.2.64-cu121 v0.2.64 Andrei Betlen 2024-04-23 02:53:20 -04:00
611781f531 ci: Build arm64 wheels. Closes #1342 Andrei Betlen 2024-04-23 02:48:09 -04:00
53ebcc8bb5 feat(server): Provide ability to dynamically allocate all threads if desired using -1 (#1364) Sean Bailey 2024-04-23 02:35:38 -04:00
507c1da066 fix: Update scikit-build-core build dependency avoid bug in 0.9.1 (#1370) Geza Velkey 2024-04-23 08:34:15 +02:00
8559e8ce88 feat: Add Llama-3 chat format (#1371) abk16 2024-04-23 06:33:29 +00:00
617d536e1c feat: Update llama.cpp Andrei Betlen 2024-04-23 02:31:40 -04:00
d40a250ef3 feat: Use new llama_token_is_eog in create_completions Andrei Betlen 2024-04-22 00:35:47 -04:00
b21ba0e2ac Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-04-21 20:46:42 -04:00
159cc4e5d9 feat: Update llama.cpp Andrei Betlen 2024-04-21 20:46:40 -04:00
0281214863 chore: Bump version v0.2.63-metal v0.2.63-cu123 v0.2.63-cu122 v0.2.63-cu121 v0.2.63 Andrei Betlen 2024-04-20 00:09:37 -04:00
cc81afebf0 feat: Add stopping_criteria to ChatFormatter, allow stopping on arbitrary token ids, fixes llama3 instruct Andrei Betlen 2024-04-20 00:00:53 -04:00
d17c1887a3 feat: Update llama.cpp Andrei Betlen 2024-04-19 23:58:16 -04:00
893a27a736 chore: Bump version v0.2.62-metal v0.2.62-cu123 v0.2.62-cu122 v0.2.62-cu121 v0.2.62 Andrei Betlen 2024-04-18 01:43:39 -04:00
a128c80500 feat: Update llama.cpp Andrei Betlen 2024-04-18 01:39:45 -04:00
4f42664955 feat: update grammar schema converter to match llama.cpp (#1353) Lucca Zenóbio 2024-04-18 02:36:25 -03:00
fa4bb0cf81 Revert "feat: Update json to grammar (#1350)" Andrei Betlen 2024-04-17 16:18:16 -04:00
610a592f70 feat: Update json to grammar (#1350) Lucca Zenóbio 2024-04-17 11:10:21 -03:00
b73c73c0c6 feat: add disable_ping_events flag (#1257) khimaros 2024-04-17 14:08:19 +00:00
4924455dec feat: Make saved state more compact on-disk (#1296) tc-wolf 2024-04-17 09:06:50 -05:00
9842cbf99d feat: Update llama.cpp Andrei Betlen 2024-04-17 10:06:15 -04:00
c96b2daebf feat: Use all available CPUs for batch processing (#1345) ddh0 2024-04-17 09:04:33 -05:00
a420f9608b feat: Update llama.cpp Andrei Betlen 2024-04-14 19:14:09 -04:00
90dceaba8a feat: Update llama.cpp Andrei Betlen 2024-04-14 11:35:57 -04:00
2e9ffd28fd feat: Update llama.cpp Andrei Betlen 2024-04-12 21:09:12 -04:00
ef29235d45 chore: Bump version v0.2.61-metal v0.2.61-cu123 v0.2.61-cu122 v0.2.61-cu121 v0.2.61 Andrei Betlen 2024-04-10 03:44:46 -04:00
bb65b4d764 fix: pass correct type to chat handlers for chat completion logprobs Andrei Betlen 2024-04-10 03:41:55 -04:00
060bfa64d5 feat: Add support for yaml based configs Andrei Betlen 2024-04-10 02:47:01 -04:00
1347e1d050 feat: Add typechecking for ctypes structure attributes Andrei Betlen 2024-04-10 02:40:41 -04:00
889d0e8981 feat: Update llama.cpp Andrei Betlen 2024-04-10 02:25:58 -04:00
56071c956a feat: Update llama.cpp Andrei Betlen 2024-04-09 09:53:49 -04:00
08b16afe11 chore: Bump version v0.2.60-metal v0.2.60-cu123 v0.2.60-cu122 v0.2.60-cu121 v0.2.60 Andrei Betlen 2024-04-06 01:53:38 -04:00
7ca364c8bd feat: Update llama.cpp Andrei Betlen 2024-04-06 01:37:43 -04:00
b3bfea6dbf fix: Always embed metal library. Closes #1332 Andrei Betlen 2024-04-06 01:36:53 -04:00
f4092e6b46 feat: Update llama.cpp Andrei Betlen 2024-04-05 10:59:31 -04:00
2760ef6156 Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-04-05 10:51:54 -04:00
1ae3abbcc3 fix: missing logprobs in response, incorrect response type for functionary, minor type issues. Closes #1328 Closes #1314 Andrei Betlen 2024-04-05 10:50:49 -04:00
49bc66bfa2 fix: missing logprobs in response, incorrect response type for functionary, minor type issues. Closes #1328 #1314 Andrei Betlen 2024-04-05 10:50:49 -04:00
9111b6e03a feat: Update llama.cpp Andrei Betlen 2024-04-05 09:21:02 -04:00
7265a5dc0e fix(docs): incorrect tool_choice example (#1330) Sigbjørn Skjæret 2024-04-05 15:14:03 +02:00
909ef66951 docs: Rename cuBLAS section to CUDA Andrei Betlen 2024-04-04 03:08:47 -04:00
1db3b58fdc docs: Add docs explaining how to install pre-built wheels. Andrei Betlen 2024-04-04 02:57:06 -04:00
c50309e52a docs: LLAMA_CUBLAS -> LLAMA_CUDA Andrei Betlen 2024-04-04 02:49:19 -04:00
612e78d322 fix(ci): use correct script name Andrei Betlen 2024-04-03 16:15:29 -04:00
34081ddc5b chore: Bump version v0.2.59-metal v0.2.59-cu123 v0.2.59-cu122 v0.2.59-cu121 v0.2.59 Andrei Betlen 2024-04-03 15:38:27 -04:00
368061c04a Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-04-03 15:35:30 -04:00
5a5193636b feat: Update llama.cpp Andrei Betlen 2024-04-03 15:35:28 -04:00
5a930ee9a1 feat: Binary wheels for CPU, CUDA (12.1 - 12.3), Metal (#1247) Andrei 2024-04-03 15:32:13 -04:00
b5374e9273 Merge branch 'main' into binary-wheels binary-wheels Andrei 2024-04-03 15:31:08 -04:00
8649d7671b fix: segfault when logits_all=False. Closes #1319 Andrei Betlen 2024-04-03 15:30:31 -04:00
6f72de1382 Update workflow name Andrei Betlen 2024-04-03 14:59:46 -04:00
3fcfa8b13c Update generate index workflow Andrei Betlen 2024-04-03 14:58:55 -04:00
cdf7be7a44 Add workflows to build CUDA and Metal wheels Andrei Betlen 2024-04-03 02:10:07 -04:00
79de49514a Merge branch 'main' into binary-wheels Andrei Betlen 2024-04-03 01:10:16 -04:00
f96de6d920 Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-04-03 00:55:21 -04:00
e465157804 feat: Update llama.cpp Andrei Betlen 2024-04-03 00:55:19 -04:00
62aad610e1 fix: last tokens passing to sample_repetition_penalties function (#1295) Yuri Mikhailov 2024-04-02 04:25:43 +09:00
45bf5ae582 chore: Bump version v0.2.58 Andrei Betlen 2024-04-01 10:28:22 -04:00
a0f373e310 fix: Changed local API doc references to hosted (#1317) lawfordp2017 2024-04-01 08:21:00 -06:00
f165048a69 feat: add support for KV cache quantization options (#1307) Limour 2024-04-01 22:19:28 +08:00
aa9f1ae011 feat: Add logprobs support to chat completions (#1311) windspirit95 2024-04-01 02:30:13 +09:00
1e60dba082 feat: Update llama.cpp Andrei Betlen 2024-03-29 13:34:23 -04:00
dcbe57fcf8 feat: Update llama.cpp Andrei Betlen 2024-03-29 12:45:27 -04:00
125b2358c9 feat: Update llama.cpp Andrei Betlen 2024-03-28 12:06:46 -04:00
901fe02461 feat: Update llama.cpp Andrei Betlen 2024-03-26 22:58:53 -04:00
b64fa4e2c0 feat: Update llama.cpp Andrei Betlen 2024-03-25 23:09:07 -04:00
a93b9149f8 feat: Update llama.cpp Andrei Betlen 2024-03-25 11:10:14 -04:00
364678bde5 feat: Update llama.cpp Andrei Betlen 2024-03-24 12:27:49 -04:00
d11ccc3036 fix(server): minor type fixes Andrei Betlen 2024-03-23 17:14:15 -04:00
c1325dcdfb fix: tool_call missing first token. Andrei Betlen 2024-03-22 23:44:04 -04:00
e325a831f0 feat: Update llama.cpp Andrei Betlen 2024-03-22 23:43:29 -04:00
c89be28ef9 feat: Update llama.cpp Andrei Betlen 2024-03-20 20:50:47 -04:00
3db03b7302 feat: Update llama.cpp Andrei Betlen 2024-03-20 13:27:43 -04:00
740f3f3812 fix: set LLAMA_METAL_EMBED_LIBRARY=on on MacOS arm64 (#1289) bretello 2024-03-20 17:46:09 +01:00
f7decc9562 docs: Add chat examples to openapi ui Andrei Betlen 2024-03-19 10:52:53 -04:00
60d8498f21 feat: Add tools/functions variables to Jinja2ChatFormatter, add function response formatting for all simple chat formats (#1273) Andrei 2024-03-19 04:55:57 -04:00
18d7ce918f feat: Update llama.cpp Andrei Betlen 2024-03-19 04:40:24 -04:00
7d4a5ec59f Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main v0.2.57 Andrei Betlen 2024-03-18 11:37:33 -04:00
bf64752535 chore: Bump version Andrei Betlen 2024-03-18 11:37:30 -04:00
8a60c7bc8c fix: Fix and optimize functionary chat handler (#1282) Jeffrey Fong 2024-03-18 22:40:57 +08:00
8d298b4750 feat: Update llama.cpp Andrei Betlen 2024-03-18 10:26:36 -04:00
6eb25231e4 feat: Update llama.cpp Andrei Betlen 2024-03-15 12:58:45 -04:00
20e6815252 fix: json mode Andrei Betlen 2024-03-15 12:58:34 -04:00
e9d1b8d7be fallback to get_embeddings_ith fix-embeddings-for-non-embedding-models Andrei Betlen 2024-03-14 12:02:24 -04:00
1a9b8af2dd feat: Update llama.cpp Andrei Betlen 2024-03-14 11:46:48 -04:00
4084aabe86 fix: set default pooling type to unspecified Andrei Betlen 2024-03-14 10:04:57 -04:00
d318cc8b83 fix: Set default pooling_type to mean, check for null pointer. Andrei Betlen 2024-03-14 09:17:41 -04:00
dd0ee56217 feat: Update llama.cpp Andrei Betlen 2024-03-13 15:57:35 -04:00
08e910f7a7 feat: Update llama.cpp Andrei Betlen 2024-03-10 23:45:05 -04:00
a7281994d8 chore: Bump version v0.2.56 Andrei Betlen 2024-03-08 21:14:44 -05:00
919fca9f2b Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-03-08 21:10:56 -05:00
d02a9cf16f Fixed json strings grammar by blacklisting character control set. Closes #1259 Andrei Betlen 2024-03-08 21:10:53 -05:00
c139f8b5d5 feat: Add endpoints for tokenize, detokenize and count tokens (#1136) Felipe Lorenz 2024-03-08 21:09:00 -05:00
1f3156d4f2 fix: Check for existence of clip model path (#1264) Kevin Cao 2024-03-08 21:00:10 -05:00
2811014bae feat: Switch embed to llama_get_embeddings_seq (#1263) Douglas Hanley 2024-03-08 19:59:35 -06:00
40c6b54f68 feat: Update llama.cpp Andrei Betlen 2024-03-08 20:58:50 -05:00
93dc56ace8 Update llama.cpp Andrei Betlen 2024-03-06 01:32:00 -05:00

... 4 5 6 7 8 ...