whisper.cpp

ggml-org/whisper.cpp

Fork 0

mirror of https://github.com/ggml-org/whisper.cpp.git synced 2026-03-28 19:27:07 +00:00

Commit Graph

Select branches

Hide Pull Requests

arghh

avx512

batched

bench-memcpy

chess

ci/env

copilot/add-duplicate-text-removal

coreml-with-state

cuda-cublas-opts

diarization

distil-support

experiment/model-compression

fa-decoder

feature/debug-gradle-signing

fix-bench

fix-coreml-ane

fix-vzip

fix_vs_sdl2

gg/alloc-enc-results

gg/bench-fix-print

gg/benches-update

gg/chess

gg/ci-cuda-fix

gg/ci-fix-android

gg/ci-fix-windows

gg/cuda-fix-mmvq

gg/cuda-no-async

gg/disable-cuda-graphs

gg/fix-external-encoder

gg/hipblas-fix

gg/make-fix-glob

gg/objc

gg/prompt-tokens

gg/reduce-ctx-use

gg/wchess

gg/whisper-short-audio-check

ggml-backend

ggml-backend-no-sched

ggml-conv

grammar-debug

guided

java-bindings

large-v3

llama-podcast

macros-cvt-fp16

master

metal

metal-and-alloc

nvblas

parallel-states

quantize-encoder

stream

sync-ggml-25-04-02-2

sync-ggml-25-05-07

sync-ggml-25-05-13

sync-ggml-25-09-30-2

sync-ggml-25-12-12

sync-ggml-25-12-17

talk.llama-coreml

threads

timing

try-fix-abort

word-ts-2

#1001

#1002

#1003

#1010

#1012

#1015

#102

#1021

#1021

#1024

#1027

#1029

#1031

#1032

#1034

#1037

#1041

#1042

#1045

#1046

#1049

#1054

#1058

#1060

#1062

#1064

#1067

#107

#1074

#1074

#1077

#1081

#1086

#1086

#1092

#1097

#1097

#110

#1101

#111

#1110

#1111

#1112

#1113

#1114

#1115

#1118

#1118

#1120

#1124

#1128

#1129

#1130

#1131

#1134

#1136

#1137

#114

#1142

#1143

#1144

#1147

#1148

#115

#1154

#116

#1160

#1162

#1164

#1164

#1173

#1174

#1196

#1204

#1205

#1209

#121

#1210

#1211

#1212

#1214

#1216

#1217

#1218

#1220

#1224

#1227

#1228

#1229

#123

#1231

#1235

#1238

#124

#1243

#1247

#1250

#1251

#1253

#1254

#1255

#1261

#1261

#1263

#1264

#1265

#1267

#127

#127

#1270

#1275

#128

#1286

#1290

#1293

#1294

#1298

#130

#130

#1303

#1304

#1305

#1306

#131

#1310

#1313

#1317

#1330

#1334

#1335

#1345

#1349

#135

#1350

#1352

#1356

#1358

#136

#1362

#1364

#1368

#1370

#1375

#1375

#1380

#1381

#1381

#1382

#1389

#1400

#1404

#141

#1415

#1417

#1418

#1418

#1420

#1422

#1424

#143

#1432

#1434

#1440

#1441

#1442

#1444

#1445

#1452

#1455

#1455

#1456

#1457

#1458

#1459

#1462

#1466

#1467

#147

#1472

#1473

#1474

#1475

#1478

#1478

#1479

#1484

#1485

#1486

#1487

#1492

#1493

#1499

#1499

#150

#1500

#1500

#1501

#1505

#1519

#1521

#1522

#1523

#1524

#1524

#1529

#1530

#1533

#1534

#1535

#1539

#1541

#1544

#1545

#1546

#1547

#1548

#1549

#1549

#155

#1551

#1554

#1559

#1559

#1560

#1561

#1563

#1563

#1565

#1567

#1568

#1574

#1575

#1576

#1578

#1582

#1583

#1586

#1588

#1589

#1595

#160

#1602

#1604

#1604

#1605

#1606

#1607

#1615

#1617

#1627

#1627

#163

#1633

#1649

#1649

#1650

#1651

#1655

#1658

#1667

#1669

#1672

#1673

#1674

#1675

#1677

#1679

#1679

#1681

#1691

#1692

#1694

#1695

#170

#1701

#1703

#1704

#1713

#1714

#1716

#1717

#1725

#1727

#1728

#1729

#1735

#174

#1740

#1741

#1744

#1747

#1749

#175

#1750

#1753

#1754

#1755

#1758

#1763

#1764

#1765

#1768

#1768

#1772

#1774

#1778

#1781

#1785

#179

#1791

#1791

#1792

#1802

#1806

#1809

#1812

#1813

#1819

#1823

#1823

#183

#1833

#1833

#1838

#1839

#1840

#1841

#1841

#1842

#1850

#1854

#1854

#1857

#1859

#1860

#1861

#1863

#1865

#1871

#1872

#1874

#1878

#1888

#1889

#1890

#1891

#1895

#1897

#19

#1902

#1913

#1913

#1917

#1924

#1924

#1925

#1926

#1928

#1929

#193

#1932

#1933

#1938

#194

#1942

#1943

#1944

#1945

#1947

#195

#1952

#1952

#1953

#1964

#1965

#1966

#1969

#1969

#1970

#1973

#1973

#1978

#1980

#1981

#1982

#1983

#1990

#1990

#1994

#1997

#1998

#20

#2000

#2001

#2004

#2005

#2005

#201

#2012

#2019

#2020

#2024

#2025

#2026

#203

#203

#2043

#2044

#2045

#2048

#2049

#2054

#2058

#2063

#2068

#2068

#2069

#2070

#2071

#2071

#2072

#2073

#2075

#2075

#2080

#2086

#2088

#2090

#2094

#2095

#2095

#21

#2100

#2102

#2108

#2115

#2119

#2121

#2123

#2127

#2127

#2128

#2129

#2133

#2138

#2142

#2152

#2153

#2154

#2166

#2170

#2181

#2182

#2184

#2184

#2189

#2194

#2196

#2198

#2206

#2208

#2217

#222

#2220

#2227

#2231

#2232

#2234

#2235

#2236

#2237

#2238

#2239

#224

#2240

#2242

#2254

#2254

#2256

#2261

#2264

#2266

#2267

#2270

#2272

#2272

#2279

#2279

#228

#2288

#229

#2290

#2291

#2294

#2299

#23

#230

#2302

#231

#2311

#2324

#2330

#2336

#2339

#2342

#2343

#2346

#2350

#2358

#2360

#2367

#2369

#2369

#2376

#2382

#2383

#2384

#2386

#2387

#239

#2391

#2393

#2396

#24

#2401

#2406

#2406

#2407

#2410

#2414

#2416

#2417

#2419

#2424

#2425

#2427

#2429

#2431

#2432

#2432

#2433

#2440

#2443

#2444

#2449

#245

#2451

#2455

#2464

#2475

#2477

#2481

#2484

#2485

#2488

#2489

#2495

#2505

#2506

#2511

#2515

#2516

#2517

#2518

#2519

#252

#2523

#2525

#2528

#2529

#253

#2534

#254

#2543

#2546

#2547

#2548

#2549

#2550

#2551

#2555

#2560

#2560

#2561

#2562

#2567

#2569

#257

#2570

#2573

#2574

#2576

#2577

#2577

#2579

#2580

#2585

#2589

#2593

#2593

#260

#2604

#2608

#2611

#2613

#2617

#2623

#2624

#2625

#2629

#2633

#2634

#2634

#2635

#2637

#2638

#2639

#2641

#2642

#2643

#2648

#2649

#2653

#2654

#2656

#2659

#2663

#2664

#2670

#2674

#2676

#2683

#2684

#2686

#2687

#2690

#2690

#2691

#2691

#2692

#2693

#2694

#2694

#2699

#27

#2700

#2707

#2709

#271

#2711

#2716

#2718

#2728

#273

#2734

#2736

#2737

#274

#2745

#2749

#2756

#2759

#2760

#2769

#2769

#277

#2770

#2777

#2779

#2790

#2796

#2797

#2799

#28

#2800

#2800

#2816

#282

#2821

#2822

#2824

#2826

#2826

#2831

#2831

#2832

#2832

#2836

#2838

#2838

#284

#284

#2840

#2842

#2842

#2843

#2844

#2845

#2846

#285

#2851

#2853

#2855

#2858

#286

#2862

#2863

#2868

#287

#2873

#2875

#2876

#2877

#2878

#2879

#288

#2880

#2882

#2887

#2889

#2891

#2893

#2895

#2896

#29

#2900

#2902

#2904

#2905

#2908

#291

#2910

#2911

#2912

#2914

#2915

#2916

#2918

#2919

#2921

#2923

#2924

#2925

#2932

#2935

#2937

#2938

#2939

#294

#2941

#2942

#2943

#2945

#2946

#2947

#2948

#2949

#2951

#2952

#2953

#2955

#2956

#2958

#2959

#296

#2960

#2962

#2966

#2968

#2969

#2971

#2972

#2973

#2975

#2976

#2977

#2979

#298

#2981

#2985

#2986

#2987

#2988

#299

#2990

#2991

#2992

#2993

#2994

#2997

#2999

#3

#3000

#3001

#3002

#3004

#3005

#3006

#3007

#301

#3016

#302

#3021

#3022

#3024

#3025

#3027

#3028

#3029

#3031

#3033

#3038

#3042

#3043

#3044

#3045

#3050

#3052

#3054

#3054

#3055

#3056

#3057

#306

#3060

#3062

#3064

#3065

#3068

#3069

#3070

#3071

#3073

#3075

#3076

#308

#3082

#3083

#3084

#3085

#3086

#3087

#3090

#3097

#3098

#31

#3100

#3101

#3102

#3103

#3104

#3106

#3108

#3109

#3112

#3114

#3120

#3124

#3125

#3126

#3127

#3130

#3131

#3132

#3133

#3134

#3136

#3138

#3140

#3141

#3142

#3143

#3145

#3147

#3148

#3149

#3150

#3151

#3152

#3156

#3157

#3158

#3160

#3160

#3163

#3164

#317

#3170

#3171

#3172

#3173

#3175

#3177

#3178

#3179

#318

#3180

#3181

#3183

#3184

#3185

#3186

#3187

#3189

#319

#3190

#3191

#3192

#3193

#3195

#3196

#3197

#3199

#320

#3200

#3201

#3202

#3203

#3206

#3208

#3209

#3214

#3215

#3217

#3218

#3218

#3219

#322

#3220

#3221

#3222

#3223

#3223

#3229

#323

#3230

#3231

#3233

#3234

#3237

#3239

#324

#3241

#3242

#3243

#3244

#3244

#3245

#3246

#3247

#3251

#3255

#3257

#3257

#3261

#3262

#3264

#3265

#3266

#3268

#3270

#3272

#3273

#3274

#3274

#3275

#3276

#3277

#3281

#3282

#3283

#3284

#3287

#3288

#3289

#3291

#3291

#3292

#3294

#3296

#3298

#3298

#3300

#3301

#3307

#331

#3310

#3313

#3313

#3318

#3319

#3321

#3321

#3322

#3323

#3324

#3325

#3325

#3327

#3328

#3329

#3332

#3333

#3336

#3342

#3346

#3349

#3350

#3354

#336

#3363

#3365

#3369

#3371

#3372

#3374

#3374

#3378

#3378

#3379

#3381

#3383

#3387

#3387

#3389

#3394

#3395

#34

#340

#3401

#3401

#3406

#3408

#3409

#3412

#3412

#3416

#3417

#3417

#3419

#3419

#3422

#3422

#3423

#3423

#3425

#3426

#3428

#343

#343

#3430

#3430

#3433

#3433

#3436

#3437

#3438

#3439

#3441

#3442

#3443

#3445

#3447

#3448

#345

#3453

#3456

#3457

#3457

#346

#3461

#3462

#3462

#3463

#3466

#3467

#3468

#3468

#3469

#3470

#3471

#3471

#3472

#3472

#3473

#3473

#3474

#3474

#3477

#3478

#3482

#3483

#3484

#3485

#3485

#3487

#3488

#3489

#3489

#349

#3490

#3492

#3494

#3494

#3495

#3495

#3496

#3498

#350

#3502

#3502

#3503

#3505

#3505

#3506

#3506

#3507

#351

#3513

#3514

#3516

#3518

#3519

#3522

#3524

#3526

#3527

#3527

#3528

#3529

#3529

#353

#3532

#3533

#3534

#3535

#3540

#3540

#3543

#3543

#3555

#3555

#3557

#3558

#3559

#3563

#3564

#3565

#3565

#3566

#3568

#3569

#357

#3572

#3573

#3575

#3578

#3578

#3579

#3580

#3581

#3582

#3582

#3583

#3585

#3587

#3587

#3588

#3588

#3589

#3589

#359

#3590

#3591

#3592

#3593

#3594

#3594

#3598

#3598

#3599

#3599

#36

#3600

#3605

#3605

#3606

#3608

#3608

#3610

#3610

#3612

#3612

#3615

#3615

#3616

#3616

#3617

#3617

#3619

#3619

#362

#3621

#3624

#3625

#3625

#3626

#3626

#3630

#3632

#3632

#3633

#3636

#3637

#3637

#3638

#3638

#3639

#3641

#3641

#3644

#3646

#3647

#3649

#365

#3650

#3650

#3652

#3653

#3653

#3655

#3656

#3656

#3658

#3658

#366

#3660

#3660

#3661

#3661

#3665

#3668

#3670

#3671

#3672

#3675

#3675

#3677

#3677

#3678

#3679

#368

#3684

#3685

#3686

#3686

#3689

#369

#3691

#3691

#3693

#3693

#3696

#3696

#3699

#3699

#3703

#3704

#3704

#3705

#3707

#3709

#3710

#3711

#3713

#3714

#3714

#3715

#3716

#3716

#3719

#3719

#3727

#3727

#379

#38

#381

#383

#384

#387

#388

#390

#391

#398

#404

#409

#41

#415

#42

#424

#425

#43

#431

#435

#436

#439

#443

#444

#446

#451

#453

#454

#454

#455

#456

#459

#461

#462

#468

#473

#474

#476

#482

#484

#485

#486

#494

#495

#497

#500

#501

#502

#502

#503

#506

#515

#520

#523

#532

#534

#537

#538

#540

#542

#552

#563

#566

#569

#572

#576

#58

#583

#60

#600

#605

#613

#613

#615

#619

#624

#624

#626

#627

#628

#629

#629

#638

#640

#642

#645

#648

#649

#650

#650

#659

#659

#664

#668

#67

#677

#682

#685

#686

#687

#688

#697

#70

#704

#706

#710

#711

#712

#716

#718

#72

#720

#721

#725

#728

#733

#737

#739

#740

#755

#759

#760

#763

#764

#768

#77

#776

#78

#798

#81

#810

#811

#812

#815

#816

#832

#833

#834

#835

#836

#837

#842

#845

#853

#854

#862

#863

#867

#87

#871

#871

#874

#875

#883

#885

#890

#891

#893

#899

#902

#908

#910

#915

#926

#927

#931

#935

#939

#939

#94

#944

#95

#956

#964

#968

#968

#971

#971

#972

#995

0.0.5-3

0.0.6-1

1.0.3

1.0.4

1.1.0

1.4.1-1

1.4.1-2

1.5.2

b2250

b2251

b2252

b2253

b2254

b2257

b2258

b2259

b2260

b2261

b2262

b2263

b2266

b2267

b2268

b2269

b2270

b2271

b2273

b2274

b2275

b2276

b2279

b2280

b2339

b2340

b2341

b2342

b2348

b2349

b2350

b2351

b2352

b2353

b2362

b2364

b2365

danbev-java-jar-artifact-test

danbev-testing-xcframework-release

v1.0.4

v1.1.0

v1.1.1

v1.2.0

v1.2.1

v1.3.0

v1.4.0

v1.4.1

v1.4.2

v1.4.3

v1.5.0

v1.5.1

v1.5.2

v1.5.3

v1.5.4

v1.5.5

v1.6.0

v1.6.1

v1.6.2

v1.7.0

v1.7.1

v1.7.2

v1.7.2-pre

v1.7.3

v1.7.3-pre

v1.7.4

v1.7.4-pre-0

v1.7.4-pre-1

v1.7.5

v1.7.6

v1.8.0

v1.8.1

v1.8.2

v1.8.3

v1.8.4

9e429c47e1 cmake : fix ARM feature verification (llama/17170) Adrien Gallouët 2025-11-17 21:37:29 +01:00
bb88c2545f ggml : add missing AVX512 feature checks (llama/17270) Adrien Gallouët 2025-11-17 12:12:00 +01:00
418314941e ggml : remove dirty flag from version string (ggml/1391) Daniel Bevenius 2025-11-24 12:51:50 +01:00
9f5ed26e43 go : Enable VAD for Go bindings (#3563) Josh Montoya 2025-12-10 03:31:36 -08:00
a8f45ab11d go : reset context.n in Process() (#3503) Josh Montoya 2025-12-08 08:33:07 -08:00
a88b93f85f vad : fix buffer overflow in sample reduction loop (#3558) Joseph Sellers 2025-12-06 11:28:32 +00:00
d566358a1d tests : update VAD tests to use Silero V6.2.0 (#3534) Daniel Bevenius 2025-12-06 10:58:58 +01:00
19ceec8eac examples : fix typo in vad-speech-segments command [no ci] (#3535) Daniel Bevenius 2025-11-20 13:35:11 +01:00
40e788a5d1 readme : minor (#3516) gzq 2025-11-20 19:57:55 +08:00
961aec7384 metal : fix compile on macos 11 (#3533) YangLe 2025-11-20 19:54:54 +08:00
7d79ef9fb0 Initial plan copilot/add-duplicate-text-removal copilot-swe-agent[bot] 2025-11-18 10:37:04 +00:00
b12abefa9b sync : llama.cpp Georgi Gerganov 2025-11-17 16:31:08 +02:00
0e5deca8e2 sync : ggml Georgi Gerganov 2025-11-17 16:26:39 +02:00
661567357c metal : support I32 -> I32 copy (llama/17317) Georgi Gerganov 2025-11-17 11:52:00 +02:00
74bb8a8b23 metal : faster argsort (llama/17315) Georgi Gerganov 2025-11-17 11:51:48 +02:00
57c0e6f8b6 metal : add cumsum (llama/17305) Georgi Gerganov 2025-11-17 11:51:13 +02:00
d3f5487464 CANN: Use smart pointers to manage ACL objects (llama/17238) hipudding 2025-11-17 08:43:59 +08:00
9d95d9a1ee vulkan: add LOG operation support for F32 and F16 (llama/17183) Pavels Zaicenkovs 2025-11-16 22:50:09 +01:00
f571655e8e vulkan: fix MMQ quantize_y condition (llama/17301) Ruben Ortlam 2025-11-16 19:38:17 +01:00
9549cc1051 metal : remove obosolete asserts (llama/17295) Georgi Gerganov 2025-11-16 09:50:26 +02:00
a75525cad0 opencl: fix rms_norm_mul (llama/17250) lhez 2025-11-15 17:40:14 -08:00
c78845bfa9 opencl: add kernel to handle mat mul in attention to improve encoding speed (llama/17181) shaofeiqi 2025-11-15 17:33:10 -08:00
1fd63da9f2 sycl : unify unary kernels with a generic implementation and enable wide operator support (llama/17213) shani-f 2025-11-16 01:52:42 +02:00
ea3ebd8b0d vulkan: Fuse mul_mat_id+add_id+mul and mul_mat+add+add. (llama/17287) Jeff Bolz 2025-11-15 12:54:23 -06:00
7caea54450 vulkan: Replace 16-bit unpack8 calls to work around legacy Windows AMD driver bug (llama/17285) Ruben Ortlam 2025-11-15 15:18:58 +01:00
4c4e663da0 vulkan: implement ABS and NEG (llama/17245) Giuseppe Scrivano 2025-11-15 12:00:29 +01:00
e1846fc599 vulkan: Use ggml_vk_tensor_subbuffer in mul_mat_vec(id) paths (llama/17244) Jeff Bolz 2025-11-15 04:56:15 -06:00
9614a56314 vulkan: skip all-negative-inf blocks in FA (llama/17186) Jeff Bolz 2025-11-15 03:37:25 -06:00
37d4bba152 vulkan: change graph_compute to be async and enable get_tensor_async (llama/17158) Jeff Bolz 2025-11-15 02:06:41 -06:00
523a6c27ea metal : support argsort for ne00 > 1024 (llama/17247) Georgi Gerganov 2025-11-14 09:36:06 +02:00
b4d7df3ba2 metal : make the FA extra sizes consistent (llama/17143) Georgi Gerganov 2025-11-14 09:13:34 +02:00
a81fbfc78e ggml-cpu: handle 3d tensors in repack mat_mul (llama/17241) Alberto Cabrera Pérez 2025-11-13 20:53:00 +00:00
3e684f26c1 ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM (llama/17063) Piotr Wilkin (ilintar) 2025-11-13 19:54:47 +01:00
e8e0004fe5 vulkan: remove shell call from vulkan-shaders-gen tool, revert file check (llama/17219) Ruben Ortlam 2025-11-13 14:51:21 +01:00
210f0f860b sched : fix reserve ignoring user tensor assignments (llama/17232) Diego Devesa 2025-11-13 04:14:02 -08:00
91fa5b5cac ggml-cpu : add RISC-V vector intrinsic support for silu and cvar operations (llama/17227) ixgbe 2025-11-13 20:13:32 +08:00
265d326fa8 metal: accelerated conv2d (llama/17175) bagheera 2025-11-13 05:32:44 -06:00
6a1d830dfd Revert "ggml-cpu: handle 3d tensors in repack mat_mul (llama/17030)" (llama/17233) Georgi Gerganov 2025-11-13 12:59:37 +02:00
6a91780c3b ggml-cpu : use template for argsort (llama/17222) Diego Devesa 2025-11-13 00:59:05 -08:00
726912d1cb CANN: Add cross_entropy_loss op support (llama/16886) TecJesh 2025-11-13 09:39:51 +08:00
84275fc493 CUDA: fuse rope + set_rows (llama/16884) Aman Gupta 2025-11-13 08:50:01 +08:00
566c4c4469 CUDA: static assert to prevent misuse of memcpy_1 (llama/17198) Johannes Gäßler 2025-11-12 23:13:55 +01:00
3810a6180b ggml : use std::sort in ggml_argsort CPU implementation (llama/17211) Georgi Gerganov 2025-11-12 20:43:38 +02:00
7df8515824 ggml-cpu: handle 3d tensors in repack mat_mul (llama/17030) Alberto Cabrera Pérez 2025-11-12 12:52:19 +00:00
e8b66d9f94 CANN: Add L2_NORM op support (llama/16856) TecJesh 2025-11-12 15:11:42 +08:00
8388350c66 fix ci crash about SSM_CONV (llama/17169) Neo Zhang Jianyu 2025-11-12 14:44:29 +08:00
6748d27f55 hexagon: various Op fixes (llama/17135) Max Krasnyansky 2025-11-11 15:25:04 -08:00
559091005a disable rms norm mul rope for chips with no fp16 rte (llama/17134) Eve 2025-11-11 18:53:30 +00:00
cd8f64d1b5 ggml-cpu : add RISC-V RVV (Zvfh) optimization for FP16 to FP32 conversion (llama/17161) ixgbe 2025-11-11 19:41:51 +08:00
1cefb03571 ggml-cpu: templateify ggml_compute_forward_rope_f32 and _f16 (llama/16805) duduta 2025-11-11 13:33:24 +02:00
3920ecce3a kleidiai: add optimized per-channel kernels for Q8_0 (llama/16993) Charles Xu 2025-11-11 12:20:31 +01:00
c01bf73dd1 cmake : add version to all shared object files (llama/17091) Mike Abbott 2025-11-11 04:19:50 -07:00
46615d74d3 opencl: add fastdiv and use it in set_rows, ported from cuda (llama/17090) lhez 2025-11-10 15:00:13 -08:00
ccf525baf0 cpu: skip NOPs to avoid barriers (llama/17133) Max Krasnyansky 2025-11-10 12:44:49 -08:00
40aebfe8bf metal : cap threadgroups size of set_rows (llama/17146) Georgi Gerganov 2025-11-10 21:33:35 +02:00
86be60093e ggml-cpu : inspect -march and -mcpu to found the CPU (llama/16333) Adrien Gallouët 2025-11-10 20:03:36 +01:00
ef71d83b76 vulkan: check glslc executable string (llama/17144) Ruben Ortlam 2025-11-10 16:59:26 +01:00
43f2c1ff54 vulkan: fix validation issue introduced by #16868 (llama/17145) Ruben Ortlam 2025-11-10 16:59:10 +01:00
bb92c79f56 metal : enable tensor API for A19 (llama/17087) Georgi Gerganov 2025-11-10 15:38:42 +02:00
4fea91f06e arm64: add i8mm route with SVE ggml_vec_dot_q4_K_q8_K and ggml_vec_dot_q6_K_… (#15277) fj-y-saito 2025-11-10 22:12:59 +09:00
58a97d988f cuda/vulkan : bicubic interpolation (llama/17022) Acly 2025-11-10 10:19:39 +01:00
2e04e7a906 vulkan: fix memory allocations (llama/17122) Ruben Ortlam 2025-11-09 16:14:41 +01:00
27f485a14c vad : Silero VAD v6.2.0 (#3524) KITAITI Makoto 2025-11-17 22:26:17 +09:00
d9b7613b34 ruby : VAD separately from ASR (#3518) KITAITI Makoto 2025-11-13 10:15:26 +09:00
a1867e0dad sync : llama.cpp Georgi Gerganov 2025-11-09 22:01:21 +02:00
e67dfbc51b sync : ggml Georgi Gerganov 2025-11-09 18:49:56 +02:00
1993e397bb vulkan: iGPU memory reporting fix (llama/17110) Ruben Ortlam 2025-11-09 09:54:47 +01:00
ee8349cf10 vulkan: fix mmq out of bounds reads (llama/17108) Ruben Ortlam 2025-11-09 09:52:57 +01:00
db98e8c5b4 vulkan: fuse mul_mat_id + mul (llama/17095) Jeff Bolz 2025-11-09 02:48:42 -06:00
a4339e2ea7 metal : retain src and dst buffers during async ops (llama/17101) Georgi Gerganov 2025-11-09 08:28:51 +02:00
6de3404773 vulkan: Use spec constants for conv2d s/d/p and kernel W/H (llama/16978) Jeff Bolz 2025-11-08 13:24:29 -06:00
8967c9ad9b Revert "CUDA: add expert reduce kernel (ggml/16857)" (llama/17100) Aman Gupta 2025-11-08 21:05:19 +08:00
522b9bce33 CUDA: skip fusion for repeating adds in bias (llama/17080) Aman Gupta 2025-11-08 16:58:05 +08:00
0caa32c772 vulkan: Increase BK to 32; use BK/4 for non-CM mul_mm.comp (llama/16636) SavicStefan 2025-11-08 09:28:22 +01:00
3c975ad523 ggml: disable vxe for cross-compilation by default (llama/16966) Aleksei Nikiforov 2025-11-08 09:00:20 +01:00
257ce2f5c0 vulkan: fuse rms_norm + mul + rope (+ view + set_rows) (llama/16977) Jeff Bolz 2025-11-08 01:52:15 -06:00
4eef518167 vulkan: Fix test-thread-safety crashes (llama/17024) Jeff Bolz 2025-11-08 01:39:45 -06:00
358f77aca7 CUDA: fix MMQ stream-k fixup ne1 indices (llama/17089) Johannes Gäßler 2025-11-08 08:26:18 +01:00
78ea6c5b67 ggml webgpu: faster matrix multiplication/matrix-vector multiplication (llama/17031) Reese Levine 2025-11-07 19:27:20 -08:00
547724b0a5 CUDA: properly handle nb00=nb02 case for cpy (llama/17081) bssrdf 2025-11-07 17:41:58 -05:00
11543bf446 vulkan : refactor buffer handling in vk_op_f32 (llama/16840) Acly 2025-11-07 21:08:50 +01:00
af8a88792f CUDA: fix should_use_mmvf for ne11 == 1 (llama/17085) Johannes Gäßler 2025-11-07 20:53:14 +01:00
a1746097bc Revert "ggml-cpu: detect correct cpu flags for arm64 (llama/16229) (#16239)" (llama/17084) Adrien Gallouët 2025-11-07 17:34:05 +01:00
512592513c ggml-cpu: detect correct cpu flags for arm64 (ggml/16229) (llama/16239) iron 2025-11-08 00:18:14 +08:00
5bce732795 ggml-cpu : optimize RVV q2_k and q3_k kernels (llama/16887) xctan 2025-11-07 00:12:45 +08:00
b5d6fa438f CUDA: fix crash on uneven context without FA (llama/16988) Johannes Gäßler 2025-11-06 14:05:47 +01:00
32ed574370 metal : initial Metal4 tensor API support (llama/16634) Georgi Gerganov 2025-11-06 14:45:10 +02:00
45588b272e sycl: add CONCAT operator support (llama/16047) YehuditE 2025-11-06 12:02:33 +02:00
b3324ae7d1 ggml-hexagon: graceful fallback for older socs where rpcmem_alloc2 and FASTRPC_GET_URI is unsupported (llama/16987) l3utterfly 2025-11-06 13:46:38 +08:00
13cd906501 improve CUDA cpy memory bandwidth when copying transposed tensor (llama/16841) bssrdf 2025-11-05 15:55:04 -05:00
558a04c9c7 vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion (llama/16919) Jeff Bolz 2025-11-05 12:51:03 -06:00
e734b5d6ef ggml webgpu: minor set rows optimization (llama/16810) Reese Levine 2025-11-09 14:44:39 +02:00
44e77ccee6 refactor: replace sprintf with snprintf for safer string handling in dump functions (llama/16913) nullname 2025-11-05 04:25:39 +08:00
1672d41ab0 vulkan: remove the need for the dryrun (llama/16826) Jeff Bolz 2025-11-04 13:28:17 -06:00
997fdde0c4 ggml-cpu : bicubic interpolation (llama/16891) Acly 2025-11-04 13:12:20 +01:00
52e43a2fa5 Fix garbled output with REPACK at high thread counts (llama/16956) Noah 2025-11-04 05:04:59 +00:00
e51a2f90fe CUDA: avoid mul + bias fusion when doing fusion (llama/16935) Aman Gupta 2025-11-04 10:53:48 +08:00
f856023f46 opencl: support imrope (llama/16914) lhez 2025-11-03 11:47:57 -08:00
82ede64cd0 ggml: CUDA: add head size 72 for flash-attn (llama/16962) theo77186 2025-11-03 14:29:11 +01:00
79801188f7 ggml : LoongArch fixes (llama/16958) Jinyang He 2025-11-03 14:40:02 +08:00

... 5 6 7 8 9 ...