SIGN IN SIGN UP

Large Language Model Text Generation Inference

0 0 0 Python

COMMITS

/ Dockerfile_intel
git_3.3.2
May 6, 2025
April 15, 2025
March 24, 2025
N
Torch 2.6 (#3134)
Nicolas Patry committed
March 18, 2025
N
Intel docker. (#3121)
Nicolas Patry committed
March 17, 2025
W
xpu 2.6 update (#3051)
Wang, Yi committed
March 4, 2025
N
Patch rust release. (#3069)
Nicolas Patry committed
N
Revert "Patch rust release."
Nicolas Patry committed
N
Patch rust release.
Nicolas Patry committed
February 20, 2025
February 18, 2025
February 7, 2025
February 6, 2025
W
Triton fix (#2995)
Wang, Yi committed
N
Using the "lockfile". (#2992)
Nicolas Patry committed
January 22, 2025
January 17, 2025
January 15, 2025
N
Upgrading our rustc version. (#2908)
Nicolas Patry committed
January 9, 2025
December 19, 2024
December 6, 2024
November 26, 2024
November 20, 2024
November 18, 2024
November 10, 2024
October 30, 2024
October 16, 2024
O
feat: prefill chunking (#2600)
OlivierDehaene committed
October 14, 2024
October 8, 2024
September 12, 2024
September 11, 2024
N
Fix tokenization yi (#2507)
Nicolas Patry committed
September 5, 2024
August 29, 2024
August 13, 2024
August 9, 2024
July 31, 2024
N
Rebase TRT-llm (#2331)
Nicolas Patry committed
July 3, 2024
July 2, 2024
June 25, 2024
W
Cpu tgi (#1936)
Wang, Yi committed
June 24, 2024
June 17, 2024
June 6, 2024
W
Xpu gqa (#2013)
Wang, Yi committed
N
Internal runner ? (#2023)
Nicolas Patry committed
June 5, 2024
June 3, 2024
May 23, 2024
W
reenable xpu for tgi (#1939)
Wang, Yi committed
May 6, 2024
April 26, 2024