llama-cpp-python

abetlen/llama-cpp-python

Fork 0

mirror of https://github.com/abetlen/llama-cpp-python.git synced 2026-03-27 05:33:00 +00:00

Commit Graph

Select branches

Hide Pull Requests

abetlen/bump-version-0.3.18

abetlen/fix-cuda-real-arch

abetlen/fix-hf-download-checks

abetlen/fix-macos-runner-label

abetlen/setup-ruff

abetlen/update-llama-cpp-bindings

add-paligemma-support

batch-processing

binary-wheels

dependabot/github_actions/conda-incubator/setup-miniconda-3.1.1

dependabot/github_actions/pypa/cibuildwheel-2.23.2

dependabot/pip/starlette-context-gte-0.3.6-and-lt-0.5

dev-add-rpc

docker

expose-libggml

fix-embeddings-for-non-embedding-models

fix-windows-import-error

generic-vlm-chat-format

main

patch-1

patch-3

remove-unwanted-bos

update-sampling-api

#10

#1002

#1005

#1007

#1010

#1011

#1012

#1015

#1019

#1021

#1023

#1028

#1037

#1050

#1052

#1058

#1059

#1067

#1067

#1068

#1078

#108

#1080

#1080

#1086

#1087

#1088

#1090

#1092

#1093

#1106

#1106

#111

#1110

#1115

#1116

#1120

#1122

#1131

#1136

#1147

#1156

#1157

#1160

#1160

#1170

#1170

#1172

#1175

#1176

#1179

#118

#1184

#1186

#1187

#1188

#119

#1193

#1194

#1205

#1206

#1209

#1209

#1210

#1212

#1212

#1214

#1222

#1230

#1232

#1244

#1246

#1247

#1248

#125

#1257

#126

#1263

#1264

#1272

#1273

#1281

#1281

#1282

#1289

#1295

#1296

#130

#1307

#1311

#1317

#1330

#1331

#1331

#1333

#1345

#1347

#135

#1350

#1351

#1351

#1353

#1364

#137

#1370

#1371

#1374

#1374

#1375

#1380

#1382

#1382

#1383

#1383

#1385

#1386

#1387

#1388

#139

#1390

#1391

#1392

#1396

#1397

#1398

#1399

#140

#1400

#1401

#1407

#1407

#1408

#1409

#1410

#1411

#1412

#1413

#1415

#1416

#1419

#1424

#1426

#1427

#1427

#1430

#1432

#1433

#1434

#1439

#144

#144

#1440

#1440

#1441

#1444

#1448

#1449

#1453

#1457

#1459

#146

#146

#1462

#1465

#1471

#1472

#1476

#1477

#1486

#1486

#1492

#1493

#1497

#15

#1503

#1503

#1509

#1509

#1513

#1514

#1515

#1515

#1516

#1522

#1525

#1526

#1527

#153

#1530

#1539

#154

#1544

#1550

#1550

#1551

#1552

#1552

#156

#1568

#1569

#157

#1576

#1583

#1583

#1585

#1589

#1590

#1594

#1596

#1597

#160

#1605

#1605

#161

#1613

#1613

#162

#1626

#163

#1630

#1632

#1633

#1634

#1635

#1637

#164

#1649

#165

#1651

#1651

#1653

#1656

#1657

#1658

#1659

#1667

#1668

#1673

#1677

#1677

#1679

#168

#1680

#1704

#1716

#1716

#1718

#1718

#172

#172

#1721

#173

#1738

#1739

#1739

#1741

#1742

#1743

#1750

#1751

#1758

#1758

#176

#176

#1761

#177

#1775

#1776

#1776

#1777

#1777

#178

#1782

#1782

#1783

#1786

#1786

#1788

#1790

#1791

#1791

#1793

#1795

#1796

#1798

#1800

#1800

#1802

#1807

#1817

#1817

#1819

#1820

#1820

#1821

#1825

#1825

#1833

#1834

#1834

#1838

#1842

#1842

#1843

#1843

#1844

#1845

#1858

#1862

#1864

#1864

#1867

#1867

#1871

#1877

#1877

#1878

#1878

#1879

#1884

#1884

#1887

#1887

#189

#1894

#1898

#1899

#1899

#190

#1901

#191

#1911

#1911

#1912

#1959

#1961

#1961

#1966

#1971

#1971

#1973

#1974

#1978

#198

#198

#1980

#1981

#1981

#1989

#1989

#1991

#1991

#2007

#2007

#2009

#2015

#2015

#2018

#2018

#2025

#2025

#2027

#203

#2030

#2034

#2034

#2039

#2039

#2040

#2041

#2041

#2042

#2042

#2044

#2044

#2046

#2049

#2056

#2056

#2058

#2058

#2059

#2059

#2066

#2066

#2071

#2071

#2072

#2072

#2077

#2082

#2082

#2083

#2083

#2085

#2085

#2088

#2088

#2099

#2099

#2100

#2102

#2102

#2108

#2108

#2109

#2109

#2112

#212

#2120

#2120

#2121

#2121

#2128

#2128

#2129

#2129

#2131

#2131

#2132

#2133

#2134

#2134

#2139

#214

#2141

#2141

#2143

#2147

#2147

#2148

#2149

#215

#2150

#2151

#2152

#2153

#2154

#2155

#2156

#2157

#2158

#2160

#2161

#2162

#2164

#2164

#217

#225

#235

#240

#246

#248

#249

#251

#252

#258

#262

#263

#264

#265

#270

#271

#274

#277

#28

#289

#29

#292

#293

#294

#295

#296

#297

#309

#310

#322

#323

#324

#325

#326

#328

#329

#329

#350

#351

#359

#360

#365

#366

#367

#368

#370

#376

#385

#390

#392

#402

#405

#41

#414

#414

#415

#42

#420

#429

#430

#431

#432

#435

#436

#442

#443

#444

#452

#453

#456

#460

#466

#481

#485

#486

#498

#499

#50

#501

#505

#505

#508

#511

#516

#517

#517

#521

#522

#523

#525

#530

#531

#532

#536

#540

#540

#548

#549

#55

#550

#551

#556

#564

#565

#567

#572

#573

#575

#575

#577

#583

#584

#594

#594

#598

#600

#604

#61

#610

#611

#612

#621

#624

#624

#625

#63

#631

#631

#633

#64

#644

#647

#648

#667

#668

#669

#670

#671

#678

#678

#680

#682

#684

#688

#690

#691

#692

#694

#694

#703

#704

#705

#709

#709

#711

#713

#716

#716

#73

#730

#733

#733

#748

#752

#752

#76

#767

#768

#772

#778

#781

#783

#784

#790

#795

#798

#799

#803

#808

#809

#820

#821

#824

#824

#826

#83

#834

#835

#842

#843

#843

#846

#848

#849

#850

#851

#854

#855

#864

#868

#868

#869

#87

#875

#892

#913

#913

#920

#921

#922

#922

#925

#927

#93

#930

#931

#934

#936

#937

#938

#940

#947

#951

#951

#952

#955

#955

#957

#958

#961

#962

#962

#967

#98

#986

#991

#993

#993

#994

#996

v0.1.29

v0.1.30

v0.1.31

v0.1.32

v0.1.33

v0.1.34

v0.1.35

v0.1.36

v0.1.37

v0.1.38

v0.1.39

v0.1.40

v0.1.41

v0.1.42

v0.1.43

v0.1.44

v0.1.45

v0.1.46

v0.1.47

v0.1.48

v0.1.49

v0.1.50

v0.1.51

v0.1.52

v0.1.53

v0.1.54

v0.1.55

v0.1.56

v0.1.57

v0.1.59

v0.1.60

v0.1.61

v0.1.62

v0.1.63

v0.1.64

v0.1.65

v0.1.66

v0.1.67

v0.1.68

v0.1.69

v0.1.70

v0.1.71

v0.1.72

v0.1.73

v0.1.74

v0.1.75

v0.1.76

v0.1.77

v0.1.78

v0.1.79

v0.1.80

v0.1.81

v0.1.82

v0.1.83

v0.1.84

v0.1.85

v0.2.0

v0.2.1

v0.2.10

v0.2.11

v0.2.12

v0.2.13

v0.2.14

v0.2.15

v0.2.16

v0.2.17

v0.2.18

v0.2.19

v0.2.2

v0.2.20

v0.2.21

v0.2.22

v0.2.23

v0.2.24

v0.2.25

v0.2.26

v0.2.27

v0.2.28

v0.2.29

v0.2.3

v0.2.30

v0.2.31

v0.2.32

v0.2.33

v0.2.34

v0.2.35

v0.2.36

v0.2.37

v0.2.38

v0.2.39

v0.2.4

v0.2.40

v0.2.41

v0.2.42

v0.2.43

v0.2.44

v0.2.45

v0.2.46

v0.2.47

v0.2.48

v0.2.49

v0.2.5

v0.2.50

v0.2.51

v0.2.52

v0.2.53

v0.2.54

v0.2.55

v0.2.56

v0.2.57

v0.2.58

v0.2.59

v0.2.59-cu121

v0.2.59-cu122

v0.2.59-cu123

v0.2.59-metal

v0.2.6

v0.2.60

v0.2.60-cu121

v0.2.60-cu122

v0.2.60-cu123

v0.2.60-metal

v0.2.61

v0.2.61-cu121

v0.2.61-cu122

v0.2.61-cu123

v0.2.61-metal

v0.2.62

v0.2.62-cu121

v0.2.62-cu122

v0.2.62-cu123

v0.2.62-metal

v0.2.63

v0.2.63-cu121

v0.2.63-cu122

v0.2.63-cu123

v0.2.63-metal

v0.2.64

v0.2.64-cu121

v0.2.64-cu122

v0.2.64-cu123

v0.2.64-metal

v0.2.65

v0.2.65-cu121

v0.2.65-cu122

v0.2.65-cu123

v0.2.65-metal

v0.2.66

v0.2.66-cu121

v0.2.66-cu122

v0.2.66-cu123

v0.2.66-cu124

v0.2.66-metal

v0.2.67

v0.2.67-cu121

v0.2.67-cu122

v0.2.67-cu123

v0.2.67-cu124

v0.2.67-metal

v0.2.68

v0.2.68-cu121

v0.2.68-cu122

v0.2.68-cu123

v0.2.68-cu124

v0.2.68-metal

v0.2.69

v0.2.69-cu121

v0.2.69-cu122

v0.2.69-cu123

v0.2.69-cu124

v0.2.69-metal

v0.2.7

v0.2.70

v0.2.70-cu121

v0.2.70-cu122

v0.2.70-cu123

v0.2.70-cu124

v0.2.70-metal

v0.2.71

v0.2.71-cu121

v0.2.71-cu122

v0.2.71-cu123

v0.2.71-cu124

v0.2.71-metal

v0.2.72

v0.2.72-cu121

v0.2.72-cu122

v0.2.72-cu123

v0.2.72-cu124

v0.2.72-metal

v0.2.73

v0.2.73-cu121

v0.2.73-cu122

v0.2.73-cu123

v0.2.73-cu124

v0.2.73-metal

v0.2.74

v0.2.74-cu121

v0.2.74-cu122

v0.2.74-cu123

v0.2.74-cu124

v0.2.74-metal

v0.2.75

v0.2.75-cu121

v0.2.75-cu122

v0.2.75-cu123

v0.2.75-cu124

v0.2.75-metal

v0.2.76

v0.2.76-cu121

v0.2.76-cu122

v0.2.76-cu123

v0.2.76-cu124

v0.2.76-metal

v0.2.77

v0.2.77-cu121

v0.2.77-cu122

v0.2.77-cu123

v0.2.77-cu124

v0.2.77-metal

v0.2.78

v0.2.78-metal

v0.2.79

v0.2.79-metal

v0.2.8

v0.2.80

v0.2.80-metal

v0.2.81

v0.2.81-cu121

v0.2.81-cu122

v0.2.81-cu123

v0.2.81-cu124

v0.2.81-metal

v0.2.82

v0.2.82-cu121

v0.2.82-cu122

v0.2.82-cu123

v0.2.82-cu124

v0.2.82-metal

v0.2.83

v0.2.83-cu121

v0.2.83-cu122

v0.2.83-cu123

v0.2.83-cu124

v0.2.83-metal

v0.2.84

v0.2.84-cu121

v0.2.84-cu122

v0.2.84-cu123

v0.2.84-cu124

v0.2.84-metal

v0.2.85

v0.2.85-cu121

v0.2.85-cu122

v0.2.85-cu123

v0.2.85-cu124

v0.2.85-metal

v0.2.86

v0.2.86-cu121

v0.2.86-cu122

v0.2.86-cu123

v0.2.86-cu124

v0.2.86-metal

v0.2.87

v0.2.87-cu121

v0.2.87-cu122

v0.2.87-cu123

v0.2.87-cu124

v0.2.87-metal

v0.2.88

v0.2.88-cu121

v0.2.88-cu122

v0.2.88-cu123

v0.2.88-cu124

v0.2.88-metal

v0.2.89

v0.2.89-cu121

v0.2.89-cu124

v0.2.89-metal

v0.2.9

v0.2.90

v0.2.90-cu121

v0.2.90-cu122

v0.2.90-cu123

v0.2.90-cu124

v0.2.90-metal

v0.3.0

v0.3.0-metal

v0.3.1

v0.3.1-metal

v0.3.10

v0.3.11

v0.3.11-metal

v0.3.12

v0.3.12-cu121

v0.3.12-cu122

v0.3.12-cu123

v0.3.12-cu124

v0.3.12-metal

v0.3.13

v0.3.13-cu121

v0.3.13-cu122

v0.3.13-cu123

v0.3.13-cu124

v0.3.13-metal

v0.3.14

v0.3.14-cu124

v0.3.14-metal

v0.3.15

v0.3.15-cu121

v0.3.15-cu122

v0.3.15-cu123

v0.3.15-cu124

v0.3.15-metal

v0.3.16

v0.3.16-cu121

v0.3.16-cu122

v0.3.16-cu123

v0.3.16-cu124

v0.3.16-metal

v0.3.17

v0.3.17-metal

v0.3.18

v0.3.18-cu121

v0.3.18-cu122

v0.3.18-cu123

v0.3.18-cu124

v0.3.18-metal

v0.3.19

v0.3.19-cu121

v0.3.19-cu122

v0.3.19-cu123

v0.3.19-cu124

v0.3.19-metal

v0.3.2

v0.3.2-metal

v0.3.3

v0.3.4

v0.3.4-cu121

v0.3.4-cu122

v0.3.4-cu123

v0.3.4-cu124

v0.3.4-metal

v0.3.5

v0.3.5-metal

v0.3.6

v0.3.7

v0.3.8

v0.3.9

22bc1e8a36 Use official cibuildwheel action Andrei Betlen 2024-03-04 13:13:06 -05:00
af42fb7ab6 Update download label Andrei Betlen 2024-03-04 12:49:42 -05:00
5c501c00b2 Add total release downloads badge Andrei Betlen 2024-03-04 12:42:03 -05:00
3dd2616d60 Generate binary wheel index on release Andrei Betlen 2024-03-03 11:43:37 -05:00
87a6e5797e feat: Update llama.cpp Andrei Betlen 2024-03-03 11:27:04 -05:00
13177aae0f chore: Bump version v0.2.55 Andrei Betlen 2024-03-02 22:46:40 -05:00
663659f730 docs: fix small typo in README: 'model know how' -> 'model knows how' (#1244) Kenneth Hoste 2024-03-03 04:20:41 +01:00
0e70984fb6 feat: Update llama.cpp Andrei Betlen 2024-03-02 22:20:04 -05:00
d5df431278 chore: Bump version v0.2.54 Andrei Betlen 2024-03-01 13:15:16 -05:00
97aa3a153d docs: Add information re: auto chat formats. Closes #1236 Andrei Betlen 2024-03-01 13:10:25 -05:00
f062a7f51d feat: Update llama.cpp Andrei Betlen 2024-03-01 12:57:16 -05:00
cf1fdd8a9a docs: fix typo in README.md embeddings example. (#1232) Douglas Hanley 2024-02-29 12:55:50 -06:00
8c71725d53 fix: Remove deprecated cfg sampling functions Andrei Betlen 2024-02-28 14:37:07 -05:00
727d60c28a misc: Format Andrei Betlen 2024-02-28 14:27:40 -05:00
0d37ce52b1 feat: Update llama.cpp Andrei Betlen 2024-02-28 14:27:16 -05:00
ffcd4b2636 chore: Bump version v0.2.53 Andrei Betlen 2024-02-28 01:38:32 -05:00
c36ab15e68 fix: eos/bos_token set correctly for Jinja2ChatFormatter and automatic chat formatter (#1230) Sigbjørn Skjæret 2024-02-28 07:30:31 +01:00
fea33c9b94 feat: Update llama.cpp Andrei Betlen 2024-02-27 12:22:17 -05:00
4d574bd765 feat(server): Add support for pulling models from Huggingface Hub (#1222) Andrei 2024-02-26 14:35:08 -05:00
b3e358dee4 docs: Add example of local image loading to README Andrei Betlen 2024-02-26 11:58:33 -05:00
afe1e445c9 chore: Bump version v0.2.52 Andrei Betlen 2024-02-26 11:43:24 -05:00
9558ce7878 feat: Update llama.cpp Andrei Betlen 2024-02-26 11:40:58 -05:00
a57d5dff86 feat: Update llama.cpp Andrei Betlen 2024-02-26 11:37:43 -05:00
79c649c2d1 docs: Update multimodal example Andrei Betlen 2024-02-26 11:34:45 -05:00
bf315ee7a9 docs: Update multimodal example Andrei Betlen 2024-02-26 11:32:11 -05:00
dbaba3059d fix: positional arguments only for low-level api Andrei Betlen 2024-02-26 11:31:11 -05:00
78e536dcfe fix: typo Andrei Betlen 2024-02-26 11:14:26 -05:00
44558cbd7a misc: llava_cpp use ctypes function decorator for binding Andrei Betlen 2024-02-26 11:07:33 -05:00
8383a9e562 fix: llava this function takes at least 4 arguments (0 given) Andrei Betlen 2024-02-26 11:03:20 -05:00
34111788fe feat: Update llama.cpp Andrei Betlen 2024-02-26 10:58:41 -05:00
5fc4c1efb6 Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main v0.2.51 Andrei Betlen 2024-02-25 21:15:54 -05:00
8e03fd9957 chore: Bump version Andrei Betlen 2024-02-25 21:14:01 -05:00
e857c133fb feat: Update llama.cpp Andrei Betlen 2024-02-25 21:14:01 -05:00
252e1ff2b4 docs(examples): Add huggingface pull example Andrei Betlen 2024-02-25 21:09:41 -05:00
bd4ec2e612 docs(examples): Add gradio chat example Andrei Betlen 2024-02-25 21:09:13 -05:00
dcf38f6141 fix: remove prematurely commited change Andrei Betlen 2024-02-25 21:00:37 -05:00
cbbcd888af feat: Update llama.cpp Andrei Betlen 2024-02-25 20:52:14 -05:00
19234aa0db fix: Restore type hints for low-level api Andrei Betlen 2024-02-25 16:54:37 -05:00
2292af5796 feat: Update llama.cpp Andrei Betlen 2024-02-25 16:53:58 -05:00
221edb9ef1 feat: Update llama.cpp Andrei Betlen 2024-02-24 23:47:29 -05:00
20ea6fd7d6 chore: Bump version v0.2.50 Andrei Betlen 2024-02-23 12:38:36 -05:00
b681674bf2 docs: Fix functionary repo_id Andrei Betlen 2024-02-23 12:36:13 -05:00
f94faab686 Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-02-23 12:34:03 -05:00
702306b381 docs: Restore functionary docs in README Andrei Betlen 2024-02-23 12:34:02 -05:00
bce6dc0ac2 docs: Update Functionary OpenAI Server Readme (#1193) Jeffrey Fong 2024-02-24 01:24:10 +08:00
47bad30dd7 fix: LlamaHFTokenizer now receives pre_tokens Andrei Betlen 2024-02-23 12:23:24 -05:00
ded5d627a5 chore: Bump version v0.2.49 Andrei Betlen 2024-02-23 11:32:43 -05:00
858496224e feat: Auto detect Mixtral's slightly different format (#1214) Luke Stanley 2024-02-23 16:27:38 +00:00
db776a885c fix: module 'llama_cpp.llama_cpp' has no attribute 'c_uint8' Andrei Betlen 2024-02-23 11:24:53 -05:00
427d816ebf chore: Bump version v0.2.48 Andrei Betlen 2024-02-23 04:54:08 -05:00
52d9d70076 docs: Update README.md to fix pip install llama cpp server (#1187) Aditya Purandare 2024-02-23 15:11:22 +05:30
251a8a2cad feat: Add Google's Gemma formatting via chat_format="gemma" (#1210) Alvaro Bartolome 2024-02-23 18:40:52 +09:00
eebb102df7 feat: Update llama.cpp Andrei Betlen 2024-02-23 03:42:08 -05:00
5f96621e92 misc: only search tests folder for tests Andrei Betlen 2024-02-23 03:40:25 -05:00
b9aca612af misc: use typesafe byref for internal classes Andrei Betlen 2024-02-23 03:40:07 -05:00
a0ce429dc0 misc: use decorator to bind low level api functions, fixes docs Andrei Betlen 2024-02-23 03:39:38 -05:00
410e02da51 docs: Fix typo Andrei Betlen 2024-02-23 00:43:31 -05:00
eb56ce2e2a docs: fix low-level api example Andrei Betlen 2024-02-22 11:33:05 -05:00
0f8cad6cb7 docs: Update README Andrei Betlen 2024-02-22 11:31:44 -05:00
045cc12670 docs: Update README Andrei Betlen 2024-02-22 03:53:52 -05:00
e10af30cf1 fix: TypeAlias import error Andrei Betlen 2024-02-22 03:27:28 -05:00
3561ebf536 Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-02-22 03:25:13 -05:00
32efed7b07 docs: Update README Andrei Betlen 2024-02-22 03:25:11 -05:00
d80c5cf29d docs: fix indentation for mkdocs-material Andrei Betlen 2024-02-22 02:30:24 -05:00
aefcb8f71a misc: additional type annotations for low level api Andrei Betlen 2024-02-22 02:00:09 -05:00
3921e10770 feat: support minItems/maxItems in JSON grammar converter (by @nopperl) Andrei Betlen 2024-02-22 00:17:06 -05:00
e6d6260a91 fix: Update from_pretrained defaults to match hf_hub_download Andrei Betlen 2024-02-22 00:10:23 -05:00
dd22010e85 fix: Raise exceptions when llama model or context fails to load Andrei Betlen 2024-02-22 00:09:45 -05:00
3632241e98 chore: Bump version v0.2.47 Andrei Betlen 2024-02-21 23:09:13 -05:00
0653e15c20 feat: Update llama.cpp Andrei Betlen 2024-02-21 23:04:52 -05:00
7981e9ce1e chore: Bump version v0.2.46 Andrei Betlen 2024-02-21 16:30:59 -05:00
7f3962e11c feat: Update llama.cpp Andrei Betlen 2024-02-21 16:27:56 -05:00
14191e9036 docs: Add create_chat_completion_openai_v1 to api reference Andrei Betlen 2024-02-21 16:26:49 -05:00
fe5626cd40 misc: add .local pattern to gitignore Andrei Betlen 2024-02-21 16:26:30 -05:00
7f51b6071f feat(low-level-api): Improve API static type-safety and performance (#1205) Andrei 2024-02-21 16:25:38 -05:00
0f8aa4ab5c feat: Pull models directly from huggingface (#1206) Andrei 2024-02-21 16:25:10 -05:00
e42f62c247 chore: Bump version v0.2.45 Andrei Betlen 2024-02-21 11:09:40 -05:00
4edde21b3d feat: Update llama.cpp Andrei Betlen 2024-02-21 11:05:58 -05:00
f57b01ac9b ci: add debug build to dev makefile Andrei Betlen 2024-02-21 11:04:30 -05:00
04fe33b999 feat: Update llama.cpp Andrei Betlen 2024-02-20 02:59:02 -05:00
d122bd7858 feat: Update llama.cpp Andrei Betlen 2024-02-19 22:10:16 -05:00
6225f027e5 feat: Update llama.cpp Andrei Betlen 2024-02-19 04:11:34 -05:00
748c0ce057 feat: Update llama.cpp Andrei Betlen 2024-02-18 21:30:36 -05:00
53f6f5f415 fix: self.numa missing Andrei Betlen 2024-02-17 01:02:33 -05:00
fdce078cb9 feat: Update llama.cpp Andrei Betlen 2024-02-17 00:37:51 -05:00
c2a234a086 docs: Add embeddings section Andrei Betlen 2024-02-15 23:15:50 -05:00
f736827b9b chore: Bump version v0.2.44 Andrei Betlen 2024-02-15 23:10:50 -05:00
0ce66bc080 fix: create_embedding broken response for input type str Andrei Betlen 2024-02-15 16:09:48 -05:00
ea1f88dd29 fix: Use '\n' seperator for EventSourceResponse (#1188) khimaros 2024-02-15 20:20:13 +00:00
a5cfeb7763 feat: Update llama.cpp Andrei Betlen 2024-02-15 15:17:30 -05:00
7bb91f025f fix: Incorporate embedding pooling layer fixes (#1194) Douglas Hanley 2024-02-15 14:16:30 -06:00
ae71ad1a14 Bump version v0.2.43 Andrei Betlen 2024-02-14 04:31:42 -05:00
f300d4310a Merge branch 'main' of https://github.com/abetlen/llama-cpp-python into main Andrei Betlen 2024-02-14 04:27:33 -05:00
c336f78269 Update llama.cpp Andrei Betlen 2024-02-14 04:27:30 -05:00
d7a67917ba feat: Support batch embeddings (#1186) Douglas Hanley 2024-02-14 03:26:09 -06:00
36b843228f misc: fix makefile build commands Andrei Betlen 2024-02-14 03:47:40 -05:00
7b9960d1cb Update llama.cpp Andrei Betlen 2024-02-14 03:47:21 -05:00
6943bab6d8 fix: destructor exception where internal classes are missing some uninitialized attributes Andrei Betlen 2024-02-14 03:38:41 -05:00
07a783779a fix: Update openbuddy prompt format. Closes #1155 Andrei Betlen 2024-02-13 23:57:10 -05:00
7a79e5ac49 Update llama.cpp Andrei Betlen 2024-02-13 23:54:05 -05:00

... 5 6 7 8 9 ...