whisper.cpp

ggml-org/whisper.cpp

Fork 0

mirror of https://github.com/ggml-org/whisper.cpp.git synced 2026-03-28 19:27:07 +00:00

Commit Graph

Select branches

Hide Pull Requests

arghh

avx512

batched

bench-memcpy

chess

ci/env

copilot/add-duplicate-text-removal

coreml-with-state

cuda-cublas-opts

diarization

distil-support

experiment/model-compression

fa-decoder

feature/debug-gradle-signing

fix-bench

fix-coreml-ane

fix-vzip

fix_vs_sdl2

gg/alloc-enc-results

gg/bench-fix-print

gg/benches-update

gg/chess

gg/ci-cuda-fix

gg/ci-fix-android

gg/ci-fix-windows

gg/cuda-fix-mmvq

gg/cuda-no-async

gg/disable-cuda-graphs

gg/fix-external-encoder

gg/hipblas-fix

gg/make-fix-glob

gg/objc

gg/prompt-tokens

gg/reduce-ctx-use

gg/wchess

gg/whisper-short-audio-check

ggml-backend

ggml-backend-no-sched

ggml-conv

grammar-debug

guided

java-bindings

large-v3

llama-podcast

macros-cvt-fp16

master

metal

metal-and-alloc

nvblas

parallel-states

quantize-encoder

stream

sync-ggml-25-04-02-2

sync-ggml-25-05-07

sync-ggml-25-05-13

sync-ggml-25-09-30-2

sync-ggml-25-12-12

sync-ggml-25-12-17

talk.llama-coreml

threads

timing

try-fix-abort

word-ts-2

#1001

#1002

#1003

#1010

#1012

#1015

#102

#1021

#1021

#1024

#1027

#1029

#1031

#1032

#1034

#1037

#1041

#1042

#1045

#1046

#1049

#1054

#1058

#1060

#1062

#1064

#1067

#107

#1074

#1074

#1077

#1081

#1086

#1086

#1092

#1097

#1097

#110

#1101

#111

#1110

#1111

#1112

#1113

#1114

#1115

#1118

#1118

#1120

#1124

#1128

#1129

#1130

#1131

#1134

#1136

#1137

#114

#1142

#1143

#1144

#1147

#1148

#115

#1154

#116

#1160

#1162

#1164

#1164

#1173

#1174

#1196

#1204

#1205

#1209

#121

#1210

#1211

#1212

#1214

#1216

#1217

#1218

#1220

#1224

#1227

#1228

#1229

#123

#1231

#1235

#1238

#124

#1243

#1247

#1250

#1251

#1253

#1254

#1255

#1261

#1261

#1263

#1264

#1265

#1267

#127

#127

#1270

#1275

#128

#1286

#1290

#1293

#1294

#1298

#130

#130

#1303

#1304

#1305

#1306

#131

#1310

#1313

#1317

#1330

#1334

#1335

#1345

#1349

#135

#1350

#1352

#1356

#1358

#136

#1362

#1364

#1368

#1370

#1375

#1375

#1380

#1381

#1381

#1382

#1389

#1400

#1404

#141

#1415

#1417

#1418

#1418

#1420

#1422

#1424

#143

#1432

#1434

#1440

#1441

#1442

#1444

#1445

#1452

#1455

#1455

#1456

#1457

#1458

#1459

#1462

#1466

#1467

#147

#1472

#1473

#1474

#1475

#1478

#1478

#1479

#1484

#1485

#1486

#1487

#1492

#1493

#1499

#1499

#150

#1500

#1500

#1501

#1505

#1519

#1521

#1522

#1523

#1524

#1524

#1529

#1530

#1533

#1534

#1535

#1539

#1541

#1544

#1545

#1546

#1547

#1548

#1549

#1549

#155

#1551

#1554

#1559

#1559

#1560

#1561

#1563

#1563

#1565

#1567

#1568

#1574

#1575

#1576

#1578

#1582

#1583

#1586

#1588

#1589

#1595

#160

#1602

#1604

#1604

#1605

#1606

#1607

#1615

#1617

#1627

#1627

#163

#1633

#1649

#1649

#1650

#1651

#1655

#1658

#1667

#1669

#1672

#1673

#1674

#1675

#1677

#1679

#1679

#1681

#1691

#1692

#1694

#1695

#170

#1701

#1703

#1704

#1713

#1714

#1716

#1717

#1725

#1727

#1728

#1729

#1735

#174

#1740

#1741

#1744

#1747

#1749

#175

#1750

#1753

#1754

#1755

#1758

#1763

#1764

#1765

#1768

#1768

#1772

#1774

#1778

#1781

#1785

#179

#1791

#1791

#1792

#1802

#1806

#1809

#1812

#1813

#1819

#1823

#1823

#183

#1833

#1833

#1838

#1839

#1840

#1841

#1841

#1842

#1850

#1854

#1854

#1857

#1859

#1860

#1861

#1863

#1865

#1871

#1872

#1874

#1878

#1888

#1889

#1890

#1891

#1895

#1897

#19

#1902

#1913

#1913

#1917

#1924

#1924

#1925

#1926

#1928

#1929

#193

#1932

#1933

#1938

#194

#1942

#1943

#1944

#1945

#1947

#195

#1952

#1952

#1953

#1964

#1965

#1966

#1969

#1969

#1970

#1973

#1973

#1978

#1980

#1981

#1982

#1983

#1990

#1990

#1994

#1997

#1998

#20

#2000

#2001

#2004

#2005

#2005

#201

#2012

#2019

#2020

#2024

#2025

#2026

#203

#203

#2043

#2044

#2045

#2048

#2049

#2054

#2058

#2063

#2068

#2068

#2069

#2070

#2071

#2071

#2072

#2073

#2075

#2075

#2080

#2086

#2088

#2090

#2094

#2095

#2095

#21

#2100

#2102

#2108

#2115

#2119

#2121

#2123

#2127

#2127

#2128

#2129

#2133

#2138

#2142

#2152

#2153

#2154

#2166

#2170

#2181

#2182

#2184

#2184

#2189

#2194

#2196

#2198

#2206

#2208

#2217

#222

#2220

#2227

#2231

#2232

#2234

#2235

#2236

#2237

#2238

#2239

#224

#2240

#2242

#2254

#2254

#2256

#2261

#2264

#2266

#2267

#2270

#2272

#2272

#2279

#2279

#228

#2288

#229

#2290

#2291

#2294

#2299

#23

#230

#2302

#231

#2311

#2324

#2330

#2336

#2339

#2342

#2343

#2346

#2350

#2358

#2360

#2367

#2369

#2369

#2376

#2382

#2383

#2384

#2386

#2387

#239

#2391

#2393

#2396

#24

#2401

#2406

#2406

#2407

#2410

#2414

#2416

#2417

#2419

#2424

#2425

#2427

#2429

#2431

#2432

#2432

#2433

#2440

#2443

#2444

#2449

#245

#2451

#2455

#2464

#2475

#2477

#2481

#2484

#2485

#2488

#2489

#2495

#2505

#2506

#2511

#2515

#2516

#2517

#2518

#2519

#252

#2523

#2525

#2528

#2529

#253

#2534

#254

#2543

#2546

#2547

#2548

#2549

#2550

#2551

#2555

#2560

#2560

#2561

#2562

#2567

#2569

#257

#2570

#2573

#2574

#2576

#2577

#2577

#2579

#2580

#2585

#2589

#2593

#2593

#260

#2604

#2608

#2611

#2613

#2617

#2623

#2624

#2625

#2629

#2633

#2634

#2634

#2635

#2637

#2638

#2639

#2641

#2642

#2643

#2648

#2649

#2653

#2654

#2656

#2659

#2663

#2664

#2670

#2674

#2676

#2683

#2684

#2686

#2687

#2690

#2690

#2691

#2691

#2692

#2693

#2694

#2694

#2699

#27

#2700

#2707

#2709

#271

#2711

#2716

#2718

#2728

#273

#2734

#2736

#2737

#274

#2745

#2749

#2756

#2759

#2760

#2769

#2769

#277

#2770

#2777

#2779

#2790

#2796

#2797

#2799

#28

#2800

#2800

#2816

#282

#2821

#2822

#2824

#2826

#2826

#2831

#2831

#2832

#2832

#2836

#2838

#2838

#284

#284

#2840

#2842

#2842

#2843

#2844

#2845

#2846

#285

#2851

#2853

#2855

#2858

#286

#2862

#2863

#2868

#287

#2873

#2875

#2876

#2877

#2878

#2879

#288

#2880

#2882

#2887

#2889

#2891

#2893

#2895

#2896

#29

#2900

#2902

#2904

#2905

#2908

#291

#2910

#2911

#2912

#2914

#2915

#2916

#2918

#2919

#2921

#2923

#2924

#2925

#2932

#2935

#2937

#2938

#2939

#294

#2941

#2942

#2943

#2945

#2946

#2947

#2948

#2949

#2951

#2952

#2953

#2955

#2956

#2958

#2959

#296

#2960

#2962

#2966

#2968

#2969

#2971

#2972

#2973

#2975

#2976

#2977

#2979

#298

#2981

#2985

#2986

#2987

#2988

#299

#2990

#2991

#2992

#2993

#2994

#2997

#2999

#3

#3000

#3001

#3002

#3004

#3005

#3006

#3007

#301

#3016

#302

#3021

#3022

#3024

#3025

#3027

#3028

#3029

#3031

#3033

#3038

#3042

#3043

#3044

#3045

#3050

#3052

#3054

#3054

#3055

#3056

#3057

#306

#3060

#3062

#3064

#3065

#3068

#3069

#3070

#3071

#3073

#3075

#3076

#308

#3082

#3083

#3084

#3085

#3086

#3087

#3090

#3097

#3098

#31

#3100

#3101

#3102

#3103

#3104

#3106

#3108

#3109

#3112

#3114

#3120

#3124

#3125

#3126

#3127

#3130

#3131

#3132

#3133

#3134

#3136

#3138

#3140

#3141

#3142

#3143

#3145

#3147

#3148

#3149

#3150

#3151

#3152

#3156

#3157

#3158

#3160

#3160

#3163

#3164

#317

#3170

#3171

#3172

#3173

#3175

#3177

#3178

#3179

#318

#3180

#3181

#3183

#3184

#3185

#3186

#3187

#3189

#319

#3190

#3191

#3192

#3193

#3195

#3196

#3197

#3199

#320

#3200

#3201

#3202

#3203

#3206

#3208

#3209

#3214

#3215

#3217

#3218

#3218

#3219

#322

#3220

#3221

#3222

#3223

#3223

#3229

#323

#3230

#3231

#3233

#3234

#3237

#3239

#324

#3241

#3242

#3243

#3244

#3244

#3245

#3246

#3247

#3251

#3255

#3257

#3257

#3261

#3262

#3264

#3265

#3266

#3268

#3270

#3272

#3273

#3274

#3274

#3275

#3276

#3277

#3281

#3282

#3283

#3284

#3287

#3288

#3289

#3291

#3291

#3292

#3294

#3296

#3298

#3298

#3300

#3301

#3307

#331

#3310

#3313

#3313

#3318

#3319

#3321

#3321

#3322

#3323

#3324

#3325

#3325

#3327

#3328

#3329

#3332

#3333

#3336

#3342

#3346

#3349

#3350

#3354

#336

#3363

#3365

#3369

#3371

#3372

#3374

#3374

#3378

#3378

#3379

#3381

#3383

#3387

#3387

#3389

#3394

#3395

#34

#340

#3401

#3401

#3406

#3408

#3409

#3412

#3412

#3416

#3417

#3417

#3419

#3419

#3422

#3422

#3423

#3423

#3425

#3426

#3428

#343

#343

#3430

#3430

#3433

#3433

#3436

#3437

#3438

#3439

#3441

#3442

#3443

#3445

#3447

#3448

#345

#3453

#3456

#3457

#3457

#346

#3461

#3462

#3462

#3463

#3466

#3467

#3468

#3468

#3469

#3470

#3471

#3471

#3472

#3472

#3473

#3473

#3474

#3474

#3477

#3478

#3482

#3483

#3484

#3485

#3485

#3487

#3488

#3489

#3489

#349

#3490

#3492

#3494

#3494

#3495

#3495

#3496

#3498

#350

#3502

#3502

#3503

#3505

#3505

#3506

#3506

#3507

#351

#3513

#3514

#3516

#3518

#3519

#3522

#3524

#3526

#3527

#3527

#3528

#3529

#3529

#353

#3532

#3533

#3534

#3535

#3540

#3540

#3543

#3543

#3555

#3555

#3557

#3558

#3559

#3563

#3564

#3565

#3565

#3566

#3568

#3569

#357

#3572

#3573

#3575

#3578

#3578

#3579

#3580

#3581

#3582

#3582

#3583

#3585

#3587

#3587

#3588

#3588

#3589

#3589

#359

#3590

#3591

#3592

#3593

#3594

#3594

#3598

#3598

#3599

#3599

#36

#3600

#3605

#3605

#3606

#3608

#3608

#3610

#3610

#3612

#3612

#3615

#3615

#3616

#3616

#3617

#3617

#3619

#3619

#362

#3621

#3624

#3625

#3625

#3626

#3626

#3630

#3632

#3632

#3633

#3636

#3637

#3637

#3638

#3638

#3639

#3641

#3641

#3644

#3646

#3647

#3649

#365

#3650

#3650

#3652

#3653

#3653

#3655

#3656

#3656

#3658

#3658

#366

#3660

#3660

#3661

#3661

#3665

#3668

#3670

#3671

#3672

#3675

#3675

#3677

#3677

#3678

#3679

#368

#3684

#3685

#3686

#3686

#3689

#369

#3691

#3691

#3693

#3693

#3696

#3696

#3699

#3699

#3703

#3704

#3704

#3705

#3707

#3709

#3710

#3711

#3713

#3714

#3714

#3715

#3716

#3716

#3719

#3719

#3727

#3727

#379

#38

#381

#383

#384

#387

#388

#390

#391

#398

#404

#409

#41

#415

#42

#424

#425

#43

#431

#435

#436

#439

#443

#444

#446

#451

#453

#454

#454

#455

#456

#459

#461

#462

#468

#473

#474

#476

#482

#484

#485

#486

#494

#495

#497

#500

#501

#502

#502

#503

#506

#515

#520

#523

#532

#534

#537

#538

#540

#542

#552

#563

#566

#569

#572

#576

#58

#583

#60

#600

#605

#613

#613

#615

#619

#624

#624

#626

#627

#628

#629

#629

#638

#640

#642

#645

#648

#649

#650

#650

#659

#659

#664

#668

#67

#677

#682

#685

#686

#687

#688

#697

#70

#704

#706

#710

#711

#712

#716

#718

#72

#720

#721

#725

#728

#733

#737

#739

#740

#755

#759

#760

#763

#764

#768

#77

#776

#78

#798

#81

#810

#811

#812

#815

#816

#832

#833

#834

#835

#836

#837

#842

#845

#853

#854

#862

#863

#867

#87

#871

#871

#874

#875

#883

#885

#890

#891

#893

#899

#902

#908

#910

#915

#926

#927

#931

#935

#939

#939

#94

#944

#95

#956

#964

#968

#968

#971

#971

#972

#995

0.0.5-3

0.0.6-1

1.0.3

1.0.4

1.1.0

1.4.1-1

1.4.1-2

1.5.2

b2250

b2251

b2252

b2253

b2254

b2257

b2258

b2259

b2260

b2261

b2262

b2263

b2266

b2267

b2268

b2269

b2270

b2271

b2273

b2274

b2275

b2276

b2279

b2280

b2339

b2340

b2341

b2342

b2348

b2349

b2350

b2351

b2352

b2353

b2362

b2364

b2365

danbev-java-jar-artifact-test

danbev-testing-xcframework-release

v1.0.4

v1.1.0

v1.1.1

v1.2.0

v1.2.1

v1.3.0

v1.4.0

v1.4.1

v1.4.2

v1.4.3

v1.5.0

v1.5.1

v1.5.2

v1.5.3

v1.5.4

v1.5.5

v1.6.0

v1.6.1

v1.6.2

v1.7.0

v1.7.1

v1.7.2

v1.7.2-pre

v1.7.3

v1.7.3-pre

v1.7.4

v1.7.4-pre-0

v1.7.4-pre-1

v1.7.5

v1.7.6

v1.8.0

v1.8.1

v1.8.2

v1.8.3

v1.8.4

2f33395197 ggml-hexagon: gelu optimization (llama/18151) Shouyu 2025-12-22 13:56:52 -05:00
5b0c1c1580 llamafile: add rvv support for sgemm kernels (llama/18199) Taimur Ahmad 2025-12-22 23:20:23 +05:00
f2fe1e5baf opencl: unpack q4_0 for adreno in get_tensor (llama/18278) lhez 2025-12-22 10:19:01 -08:00
dbbe6c11b5 vulkan: Extend rope fusions to allow mrope (llama/18264) Jeff Bolz 2025-12-22 11:03:13 -06:00
98e59a43d1 vulkan: Implement set_tensor_async and the event interfaces (llama/18047) Jeff Bolz 2025-12-21 14:52:09 -06:00
b68b12f2d5 llama: fix RPC for -fit on (llama/18233) Johannes Gäßler 2025-12-21 19:33:08 +01:00
b893e0813a vulkan: fix im2col overflowing maxworkgroupcount (llama/18180) Jeff Bolz 2025-12-21 03:32:58 -06:00
f407c5e562 vulkan/cuda: fix topk_moe with exp_probs_b (llama/18071) Jeff Bolz 2025-12-21 03:27:34 -06:00
ad6ee3865d vulkan: support GGML_UNARY_OP_XIELU (llama/18062) Jeff Bolz 2025-12-21 03:17:58 -06:00
3cd141f1a9 vulkan: in graph_optimize, try to group ADD operations (llama/18060) Jeff Bolz 2025-12-21 03:05:08 -06:00
449fc7c024 Vulkan: some improvement on mul_mat_iq2_xs (llama/18031) lovedheart 2025-12-21 09:59:52 +01:00
0983985f06 Added comments explaining thread block size selection logic based on row count and column size, derived from historical commit context (llama/18212) Aadeshveer Singh 2025-12-20 16:58:57 +05:30
17a4cb15b8 ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations (llama/17977) Alfred 2025-12-19 12:42:28 -05:00
195d8d0c65 vulkan: Add perf logger mode with concurrency (llama/17944) Jeff Bolz 2025-12-18 23:36:46 -06:00
fea481f412 model : add ASR support for LFM2-Audio-1.5B (conformer) (llama/18106) Xuan-Son Nguyen 2025-12-19 00:18:01 +01:00
956fac433b ggml-cpu: extend support for RVV floating-point kernels (llama/17318) Taimur Ahmad 2025-12-18 19:02:09 +05:00
325a9b739c remove i_major_dual (llama/18157) yulo 2025-12-18 19:50:56 +08:00
c3a16089e3 ggml-hexagon: swiglu_oai operation (llama/18114) Shouyu 2025-12-17 16:38:21 -05:00
c7ccedb5ba ggml-hexagon: gelu operation (llama/17921) Shouyu 2025-12-17 13:39:32 -05:00
1f72f00542 ggml-cpu: ARM64: repack version of q8_0 (dotprod and i8mm) (llama/18096) Alberto Cabrera Pérez 2025-12-17 11:39:13 +00:00
9118c05dc4 HIP: Refactor mma for RDNA and CDNA (llama/17990) yulo 2025-12-17 16:34:54 +08:00
6114e69213 ruby : add Whisper::Token, fix model URI (#3575) KITAITI Makoto 2025-12-24 16:52:16 +09:00
6c22e792cb talk-llama : sync llama.cpp Georgi Gerganov 2025-12-17 15:20:22 +02:00
698348aadc sync : ggml Georgi Gerganov 2025-12-17 15:19:57 +02:00
00108bb713 llama.android : Rewrite Android binding (w/o cpu_features dep) (llama/17413) Naco Siren 2025-12-17 00:14:47 -08:00
41a95b8ba7 ggml : use WARP_SIZE/2 for argmax reduction offset (llama/18092) Aadeshveer Singh 2025-12-17 09:17:01 +05:30
8dd70bdc85 ggml-hexagon: mm for mtmd (llama/17894) Shouyu 2025-12-15 13:53:56 -05:00
b90ec07aba metal: use shared buffers on eGPU (llama/17866) Jeremy Demeule 2025-12-15 15:14:49 +01:00
aaf3f39b4a llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (llama/16653) Johannes Gäßler 2025-12-15 09:24:59 +01:00
b5e352a52f Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai (llama/17826) Neo Zhang Jianyu 2025-12-15 10:35:15 +08:00
3bb4e1e0ac vulkan: fix mul_mat_vec_iq1_s formatting (llama/18026) Ruben Ortlam 2025-12-14 14:52:46 +01:00
af2c8cba6f vulkan: Fix data race/hang in scalar/cm1 flash attention (llama/17887) Jeff Bolz 2025-12-14 02:00:00 -06:00
7e5df2975e vulkan: improve mul_mat_vec_iq1_s speed (llama/17874) lovedheart 2025-12-14 08:47:49 +01:00
cdadfc3b72 vulkan: faster q6_k matmul (llama/17813) Eve 2025-12-14 07:29:37 +00:00
b62ef9af7a ggml : arm repack fix build (llama/0) Georgi Gerganov 2025-12-13 22:54:14 +02:00
b901ebe4a3 vulkan: support get_rows for i32 (llama/17941) Jeff Bolz 2025-12-13 03:12:53 -06:00
f33446643e vulkan: support GGML_OP_DIAG (llama/17893) Jeff Bolz 2025-12-13 03:07:49 -06:00
939d3085e9 vulkan: Multi-pass softmax for large number of cols (llama/17892) Jeff Bolz 2025-12-13 03:04:29 -06:00
13bb296dbf vulkan: Allow non-pow2 n_experts in topk_moe (llama/17872) Jeff Bolz 2025-12-13 01:40:04 -06:00
feb856d4a1 CUDA: fix overflow in MMA kernel without stream-k (llama/17939) Johannes Gäßler 2025-12-12 17:43:58 +01:00
db1fcd958f cann : fix ops broken by circular padding guard (llama/17825) Sigbjørn Skjæret 2025-12-12 15:49:27 +01:00
2c782ec325 ggml-cpu : fix RISC-V Q4_0 repack select and RVV feature reporting (llama/17951) ixgbe 2025-12-12 22:26:03 +08:00
25d99e9135 HIP: enable mmf for RDNA3 (llama/17879) yulo 2025-12-12 18:34:33 +08:00
e0af519a61 SOLVE_TRI extension to more dimensions (llama/17793) Piotr Wilkin (ilintar) 2025-12-11 17:20:43 +01:00
fcaa8f8a7b talk-llama : sync llama.cpp sync-ggml-25-12-17 Georgi Gerganov 2025-12-17 15:20:22 +02:00
82ee376b93 sync : ggml Georgi Gerganov 2025-12-17 15:19:57 +02:00
7e9b6ebb47 llama.android : Rewrite Android binding (w/o cpu_features dep) (llama/17413) Naco Siren 2025-12-17 00:14:47 -08:00
6829ed7187 ggml : use WARP_SIZE/2 for argmax reduction offset (llama/18092) Aadeshveer Singh 2025-12-17 09:17:01 +05:30
5cac224741 ggml-hexagon: mm for mtmd (llama/17894) Shouyu 2025-12-15 13:53:56 -05:00
90e623c297 metal: use shared buffers on eGPU (llama/17866) Jeremy Demeule 2025-12-15 15:14:49 +01:00
542f01ec8b llama: automatically set parameters not set by the user in such a way that maximizes GPU utilization (llama/16653) Johannes Gäßler 2025-12-15 09:24:59 +01:00
6f0e8fecb2 Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai (llama/17826) Neo Zhang Jianyu 2025-12-15 10:35:15 +08:00
631c3b3955 vulkan: fix mul_mat_vec_iq1_s formatting (llama/18026) Ruben Ortlam 2025-12-14 14:52:46 +01:00
72c24f6e01 vulkan: Fix data race/hang in scalar/cm1 flash attention (llama/17887) Jeff Bolz 2025-12-14 02:00:00 -06:00
a2380a3864 vulkan: improve mul_mat_vec_iq1_s speed (llama/17874) lovedheart 2025-12-14 08:47:49 +01:00
9a211ac983 vulkan: faster q6_k matmul (llama/17813) Eve 2025-12-14 07:29:37 +00:00
e500fa6ce4 ggml : arm repack fix build (llama/0) Georgi Gerganov 2025-12-13 22:54:14 +02:00
c62c6104cd vulkan: support get_rows for i32 (llama/17941) Jeff Bolz 2025-12-13 03:12:53 -06:00
5cb35693b4 vulkan: support GGML_OP_DIAG (llama/17893) Jeff Bolz 2025-12-13 03:07:49 -06:00
96fa63886c vulkan: Multi-pass softmax for large number of cols (llama/17892) Jeff Bolz 2025-12-13 03:04:29 -06:00
4de5699291 vulkan: Allow non-pow2 n_experts in topk_moe (llama/17872) Jeff Bolz 2025-12-13 01:40:04 -06:00
8d05cf479c CUDA: fix overflow in MMA kernel without stream-k (llama/17939) Johannes Gäßler 2025-12-12 17:43:58 +01:00
240cb2db53 cann : fix ops broken by circular padding guard (llama/17825) Sigbjørn Skjæret 2025-12-12 15:49:27 +01:00
a21115d3ce ggml-cpu : fix RISC-V Q4_0 repack select and RVV feature reporting (llama/17951) ixgbe 2025-12-12 22:26:03 +08:00
951f8a978f HIP: enable mmf for RDNA3 (llama/17879) yulo 2025-12-12 18:34:33 +08:00
43be89ef7c SOLVE_TRI extension to more dimensions (llama/17793) Piotr Wilkin (ilintar) 2025-12-11 17:20:43 +01:00
3e79e73eee build: link whisper target against Threads::Threads for FreeBSD support (#3568) Russ 2025-12-17 09:13:38 +00:00
2551e4ce98 server: allow custom temp directory for ffmpeg (#3564) Marcos Del Sol Vives 2025-12-13 08:37:44 +01:00
f0c9017a2f ggml : arm repack fix build (#0) sync-ggml-25-12-12 Georgi Gerganov 2025-12-13 08:04:09 +02:00
179d8b1c9c talk-llama : sync llama.cpp Georgi Gerganov 2025-12-12 17:56:43 +02:00
48cdc06e91 sync : ggml Georgi Gerganov 2025-12-12 17:55:11 +02:00
72714d169c whisper : adjust to ggml changes (#0) Georgi Gerganov 2025-12-12 17:54:58 +02:00
324dd21d3c cmake : set CMAKE_RUNTIME_OUTPUT_DIRECTORY for non standalone build (ggml/1394) Congcong Cai 2025-12-12 22:37:38 +08:00
1da1a6865c ggml-alloc : fix reuse-parent logic for misaligned sizes (llama/17884) Georgi Gerganov 2025-12-11 14:30:10 +02:00
0c88de5c69 ggml-hexagon: fix rope failure at test-backend-ops (llama/17565) nullname 2025-12-11 06:45:43 +08:00
a2886fba48 Fix race conditions in threadpool when dealing with dynamic/frequent n_threads changes (llama/17748) Max Krasnyansky 2025-12-10 12:32:23 -08:00
cd9b8c6d18 ggml : remove GGML_KQ_MASK_PAD constant (llama/17910) Georgi Gerganov 2025-12-10 20:53:16 +02:00
ca8ea18d06 cuda : add missing support check for xielu (llama/17895) Sigbjørn Skjæret 2025-12-10 16:16:20 +01:00
ea1829134f CUDA: fix unpadded strides in MMA FA kernel (llama/17891) Johannes Gäßler 2025-12-10 12:39:56 +01:00
c10b4f9a01 fix softmax for iGPU (llama/17838) Neo Zhang Jianyu 2025-12-10 16:59:57 +08:00
307dc525bb metal: SSM kernel improvements (llama/17876) Gabe Goodhart 2025-12-09 12:30:02 -07:00
2817582be2 Add DIAG for CUDA (llama/17873) Piotr Wilkin (ilintar) 2025-12-09 20:28:57 +01:00
41bbc034f0 ggml : Provide macos-specific backtrace printing to avoid terminal death (llama/17869) Gabe Goodhart 2025-12-09 09:29:07 -07:00
b6ae0b29d1 metal : print node names for debugging (llama/17882) Georgi Gerganov 2025-12-09 15:25:49 +02:00
ba463fb577 ggml : allow fill node alloc inplace (llama/17870) Sigbjørn Skjæret 2025-12-09 12:23:47 +01:00
79d86a5c2c CANN: add support for partial RoPE and Vision mode (llama/17543) Chenguang Li 2025-12-09 17:53:23 +08:00
bef1f5a57e CUDA: fix FP16 overflow in tile FA kernel (llama/17875) Johannes Gäßler 2025-12-09 09:34:02 +01:00
821c2071ab cuda : add FILL op support (llama/17851) Jay Zenith 2025-12-08 05:10:12 -08:00
e1562e85fc cuda: optimize SOLVE_TRI using registers and FMAF (llama/17703) wsbagnsv1 2025-12-08 10:41:08 +01:00
c8d0ee2f9f ggml-cpu: add ggml_thread_cpu_relax with Zihintpause support (llama/17784) ixgbe 2025-12-08 16:41:34 +08:00
d6d44fac69 Vulkan: improve mul_mat_vec_iq1_m (llama/16907) lovedheart 2025-12-07 18:40:42 +01:00
447ef8633b sycl: add missing BF16 conversion support for Intel oneAPI (llama/17780) Law Po Ying 2025-12-07 09:18:18 +08:00
898f876fe2 vulkan: perf_logger improvements (llama/17672) Jeff Bolz 2025-12-06 11:46:46 -06:00
ebff8f9db9 ggml-zendnn : add ZenDNN backend for AMD CPUs (llama/17690) Vishal Singh 2025-12-06 21:43:33 +05:30
c5e1807071 ggml : add circular tiling support to pad, for Vulkan, CUDA, and CPU (used for making seamless textures) (llama/16985) Phylliida Dev 2025-12-06 06:07:02 -08:00
94be71911f HIP: fix RDNA3 FP16/BF16 matrix multiplication (llama/17817) Johannes Gäßler 2025-12-06 13:45:36 +01:00
b67e3abdb2 ggml : improve error handling for search path existence checks (llama/17653) Sky 2025-12-06 19:28:16 +08:00
c66c71e9f4 vulkan: Use one row per workgroup for f32 mmv (llama/17711) Jeff Bolz 2025-12-06 04:12:26 -06:00
875d861473 vulkan: support solve_tri with larger N/K values (llama/17781) Jeff Bolz 2025-12-06 01:56:45 -06:00
41cf229d72 metal : fix build(#17799) Georgi Gerganov 2025-12-06 09:33:59 +02:00

... 3 4 5 6 7 ...