Commit Graph

  • 01eb415772 Update demo link in README.md main Yan Xia 2026-03-10 15:49:46 +08:00
  • 0fdaa16ae3 Merge pull request #421 from microsoft/fix/unsafe-deserialization-gpu-pipeline tsong-ms 2026-03-09 20:20:21 +08:00
  • eb60fc39cb fix: add weights_only=True to torch.load in GPU inference pipeline fix/unsafe-deserialization-gpu-pipeline Ubuntu 2026-03-09 12:09:19 +00:00
  • 8fd3412fbc Merge pull request #406 from XsquirrelC/main tsong-ms 2026-02-03 13:30:42 +08:00
  • 3987a503fd [fix] convert pt to gguf XSquirrelC 2026-02-03 05:24:24 +00:00
  • ade47a535c Merge pull request #380 from XsquirrelC/main tsong-ms 2026-01-27 13:48:51 +08:00
  • 77e136fb08 [fix] change README link XSquirrelC 2026-01-27 03:39:11 +00:00
  • cc3c9e4c80 Merge pull request #379 from XsquirrelC/main tsong-ms 2026-01-27 11:24:02 +08:00
  • 1876a3e889 [merge] submodule llama.cpp XSquirrelC 2026-01-27 03:09:32 +00:00
  • e8c8107dcf [modify] some test picture and add power test script XSquirrelC 2026-01-25 06:51:33 +00:00
  • 7b2c52b9d5 [modify] some utils test script XSquirrelC 2026-01-24 08:40:36 +00:00
  • 2fed9af730 [fix] setup_env.py bug COMPILER_EXTRA_ARGS XSquirrelC 2026-01-22 11:11:14 +00:00
  • 7e6f0e14f1 [modify] update README; [feat] some test script in utils deva100 2026-01-22 06:33:03 +00:00
  • 7ea1f2601f [modify] fine_tuning_result.png deva100 2026-01-20 07:40:37 +00:00
  • b68802ff17 [fix] embed-quant q6_k; [modify] README update deva100 2026-01-20 04:56:50 +00:00
  • 35b1c28585 [fix] correct README deva100 2026-01-15 03:44:50 +00:00
  • 53ffe5e92b [chore] update README deva100 2026-01-15 03:37:16 +00:00
  • 43da5e5f76 [fix] make demo_benchmark.sh more fast deva100 2025-12-23 07:23:14 +00:00
  • 41cc304868 [chore] add some automation bash script for BitNet Tech Report deva100 2025-12-23 06:48:33 +00:00
  • 112f853414 [feat] I2S kernels for weight & activation parallel on Intel & ARM machine; [feat] I2S GEMV & GEMM(llama.cpp); [feat] quantize activation & dequantize embedding(llama.cpp); [fix] compile bug: cannot define __ARM_FEATURE_DOTPROD(llama.cpp) deva100 2025-11-19 07:35:05 +00:00
  • 404980eeca Merge pull request #290 from microsoft/gpu-readme-dev Junhui He 2025-06-03 14:14:20 +08:00
  • 088e607b25 Merge pull request #280 from microsoft/fix-convert-dev Junhui He 2025-06-03 13:59:47 +08:00
  • c1e9a9a237 Update readme for gpu kernels gpu-readme-dev ZeonfaiHo 2025-05-31 21:38:39 +08:00
  • 43e9b2d4a0 Enable conversion from .safetensors checkpoints to gguf files fix-convert-dev junhuihe 2025-05-21 20:13:37 +08:00
  • 69a20459f5 Merge pull request #268 from younesbelkada/add-falcon-e-final tsong-ms 2025-05-21 16:28:05 +08:00
  • 5c12850ed9 Merge branch 'add-falcon-e-final' of github.com:younesbelkada/BitNet into add-falcon-e-final younesbelkada 2025-05-21 11:53:40 +04:00
  • 765741d80b update submodule younesbelkada 2025-05-21 11:52:30 +04:00
  • f314d18863 feat: add also base models Younes Belkada 2025-05-21 04:11:07 +04:00
  • 9e9575665e Merge branch 'microsoft:main' into add-falcon-e-final Younes Belkada 2025-05-20 17:05:11 +04:00
  • 70285e0154 Merge pull request #276 from microsoft/readme-dev tsong-ms 2025-05-20 16:14:18 +08:00
  • 6197e9feb0 refine readme for gpu kernel readme-dev tsong-ms 2025-05-20 12:29:56 +08:00
  • 6c2c08f67e Merge pull request #266 from microsoft/gpu-dev Junhui He 2025-05-19 12:46:20 +08:00
  • 154c92b704 Init gpu branch gpu-dev Junhui He 2025-05-15 05:55:42 +00:00
  • 0015ad5201 Update README.md Younes Belkada 2025-05-15 18:49:28 +04:00
  • de371b708d add falcon-e support younesbelkada 2025-05-14 17:07:05 +04:00
  • c9e752c9d7 Fix build error with GCC by forcing Clang compiler in CMake on android/aarch64 (#242) Benjamin Wegener 2025-05-08 10:22:45 +02:00
  • 1792346223 Add run_inference_server.py for Running llama.cpp Built-in Server (#204) Benjamin Wegener 2025-05-08 10:22:12 +02:00
  • c17d1c5d77 Merge pull request #212 from microsoft/arch-name-dev Junhui He 2025-04-23 11:20:15 +08:00
  • 488dc1e876 Fix model architecture name junhuihe 2025-04-22 17:28:59 +08:00
  • fd9f1d6e46 Merge pull request #176 from microsoft/readme-dev tsong-ms 2025-04-16 12:35:53 +08:00
  • 874e6bd5fb refine readme tsong 2025-04-16 04:34:59 +00:00
  • 034b34cb70 Merge pull request #175 from microsoft/readme-dev tsong-ms 2025-04-15 22:42:12 +08:00
  • 71fdd9472f add third-party demo tsong 2025-04-15 14:36:05 +00:00
  • 1c77bd8966 Update README.md Yan Xia 2025-04-15 17:11:23 +08:00
  • 8f75f99c72 Update README.md (#172) Yan Xia 2025-04-15 17:07:20 +08:00
  • 0e7dadba1e Update README.md Yan Xia 2025-04-15 15:24:42 +08:00
  • fd3f355a0b update readme and setup script to support official BitNet b1.58 model (#171) Yan Xia 2025-04-15 14:53:56 +08:00
  • fa854cf8f8 Merge pull request #167 from potassiummmm/bitnet-25 tsong-ms 2025-04-15 14:27:46 +08:00
  • 09f91066d6 add conversion logic for new model potassiummmm 2025-03-12 18:34:05 +08:00
  • 4f2e41a514 add support for bitnet2b_2501 model potassiummmm 2025-03-12 18:16:45 +08:00
  • caf17ec438 update README Eddie-Wang1120 2025-02-18 21:13:27 +08:00
  • 3dcfd14628 update README paper Eddie-Wang1120 2025-02-18 15:42:06 +08:00
  • 0ab05d6f64 update 3rdparty & fix tl2 bug Eddie-Wang1120 2025-02-16 15:39:15 +08:00
  • 61e37b5430 update README Eddie-Wang1120 2025-02-16 15:07:08 +08:00
  • 4c736e3728 commit paper code Eddie-Wang1120 2025-02-16 15:03:25 +08:00
  • 437b321dcf Merge pull request #145 from potassiummmm/readme-new-model potassiummmm 2024-12-20 16:22:10 +08:00
  • d0fc8c9a39 Update README.md potassiummmm 2024-12-20 14:58:53 +08:00
  • 1791e8eb1c Merge branch 'microsoft:main' into readme-new-model potassiummmm 2024-12-20 14:56:08 +08:00
  • 253954811b Merge pull request #130 from lfoppiano/patch-1 potassiummmm 2024-12-20 14:55:38 +08:00
  • 933e8950bd Update README.md potassiummmm 2024-12-19 18:47:53 +08:00
  • b441b76118 Update README.md potassiummmm 2024-12-19 18:35:31 +08:00
  • fa83380d99 Update README.md potassiummmm 2024-12-19 18:32:54 +08:00
  • 3e19f15cd0 Merge pull request #142 from microsoft/f3-fix-2 potassiummmm 2024-12-19 01:21:21 +08:00
  • c96c2499d6 Fix model name in setup_env.py potassiummmm 2024-12-19 01:20:14 +08:00
  • e255fef69b Merge pull request #141 from potassiummmm/f3-fix potassiummmm 2024-12-18 21:27:26 +08:00
  • 0a446952e1 fix readme issue and -cnv option issue potassiummmm 2024-12-18 21:20:26 +08:00
  • aa39c0cdcc fix version requirement of transformers pypi package and model list for codegen potassiummmm 2024-12-18 17:54:23 +08:00
  • 33ceabed0b Merge pull request #137 from younesbelkada/f3-changes tsong-ms 2024-12-18 11:41:20 +08:00
  • 6a5134a6f0 change younesbelkada 2024-12-17 08:45:25 +00:00
  • 85c3247323 add changes on README younesbelkada 2024-12-17 07:05:35 +00:00
  • c1c55417c2 fix issues younesbelkada 2024-12-16 15:26:33 +00:00
  • de19627eef add 10b model younesbelkada 2024-12-16 14:42:15 +00:00
  • a838911a55 more changes to support chat models younesbelkada 2024-12-09 16:45:31 +00:00
  • 22566ab52e Make the coverage table more readable with both dark and light theme Luca Foppiano 2024-12-05 12:02:16 +00:00
  • 7c57a5ae20 fix weird character issue younesbelkada 2024-11-23 14:49:44 +00:00
  • 22575a47cf update submodule younesbelkada 2024-11-14 15:08:47 +00:00
  • c1892d6818 updated submodule younesbelkada 2024-11-14 14:53:43 +00:00
  • 18cfa8af89 add fc3 support younesbelkada 2024-11-14 14:51:09 +00:00
  • bf11a49f11 Add support for ios platform potassiummmm 2024-11-11 15:13:55 +08:00
  • 4645960add Update README.md Shuming Ma 2024-11-08 17:23:21 +08:00
  • 37c247c4dc Merge pull request #79 from MrEcco/main potassiummmm 2024-11-07 00:30:10 +08:00
  • 338973feb8 Merge pull request #83 from JCGoran/jelic/gcc_fixes potassiummmm 2024-11-07 00:29:17 +08:00
  • 80b94aecb2 Fix llama-bench path error on Windows potassiummmm 2024-10-31 16:50:47 +08:00
  • 9b29748910 Update README.md acknowledgement section Yan Xia 2024-10-29 21:25:19 +08:00
  • 489b7c5abf Add -fpermissive if using GCC Goran Jelic-Cizmek 2024-10-26 17:10:54 +02:00
  • 141ddfd4fe Fix compiler errors on GCC Goran Jelic-Cizmek 2024-10-25 16:03:18 +02:00
  • 9d37b8692d Add GCC to compiler check Goran Jelic-Cizmek 2024-10-25 16:03:18 +02:00
  • 4943bfd286 Update README.md Shuming Ma 2024-10-25 13:52:52 +08:00
  • 5fc289c395 Merge pull request #54 from deiteris/fix-memleak Eddie-Wang 2024-10-24 16:52:09 +08:00
  • 70804c68e4 Fixing compilation error for ARM64+TL1 settings: microsoft#74 Andre Buryndin 2024-10-23 21:50:04 +02:00
  • a97dfe458a update the new technical report in readme Shaoguang Mao 2024-10-22 11:11:44 +08:00
  • 60766967a4 Fix memory leak in quantize_i2_s Yury 2024-10-21 12:27:46 +03:00
  • 5e39e75325 Update README.md Shaoguang Mao 2024-10-20 23:26:55 +08:00
  • e9ab8830fa update readme, add performance results on x86 cpu Shaoguang Mao 2024-10-18 14:49:18 +08:00
  • 04b16bd292 Merge pull request #7 from jasondavies/typos Shaoguang Mao 2024-10-18 10:55:12 +08:00
  • c82b5e6674 update 3rdparty/llama.cpp Eddie-Wang1120 2024-10-18 10:08:22 +08:00
  • b384804b95 Fix typos. Jason Davies 2024-10-17 20:57:45 +01:00
  • a82fabc7d7 update readme Shaoguang Mao 2024-10-17 23:27:30 +08:00
  • 6ed5335555 refine readme Ting Song 2024-10-17 21:31:09 +08:00
  • 6cfd8831fd initial commit potassiummmm 2024-10-17 21:21:10 +08:00