Commits: Dockerfile_intel - huggingface/text-generation-inference

huggingface / text-generation-inference UNCLAIMED

Large Language Model Text Generation Inference

0 0 0 Python

COMMITS

/ Dockerfile_intel

git_3.3.2

May 6, 2025

IPEX support FP8 kvcache/softcap/slidingwindow (#3144)

Wang, Yi committed 10mo ago

51a0b9d

April 15, 2025

transformers flash llm/vlm enabling in ipex (#3152)

Wang, Yi committed 11mo ago

459fbde

March 24, 2025

Torch 2.6 (#3134)

Nicolas Patry committed 1y ago

54d1546

March 18, 2025

Intel docker. (#3121)

Nicolas Patry committed 1y ago

67ce543

March 17, 2025

xpu 2.6 update (#3051)

Wang, Yi committed 1y ago

0b3e3db

March 4, 2025

Patch rust release. (#3069)

Nicolas Patry committed 1y ago

491ed9e

Revert "Patch rust release."

Nicolas Patry committed 1y ago

a914a21

Patch rust release.

Nicolas Patry committed 1y ago

aad9c2b

February 20, 2025

update ipex and torch to 2.6 for cpu (#3039)

Wang, Yi committed 1y ago

feaa247

February 18, 2025

It's find in some machine. using hf_hub::api::sync::Api to download c… (#3030)

Nicolas Patry committed 1y ago

5543fdc

February 7, 2025

Updating mllama after strftime. (#2993)

Nicolas Patry committed 1y ago

4b8cda6

February 6, 2025

Triton fix (#2995)

Wang, Yi committed 1y ago

36223f8

Using the "lockfile". (#2992)

Nicolas Patry committed 1y ago

0ef8c8a

January 22, 2025

fix moe in quantization path (#2935)

Wang, Yi committed 1y ago

1d3c9be

January 17, 2025

Moving to `uv` instead of `poetry`. (#2919)

Nicolas Patry committed 1y ago

de19e7e

Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu (#2815)

Wang, Yi committed 1y ago

8851441

January 15, 2025

Upgrading our rustc version. (#2908)

Nicolas Patry committed 1y ago

203cade

January 9, 2025

update ipex xpu to fix issue in ARC770 (#2884)

Wang, Yi committed 1y ago

afb6c72

December 19, 2024

change xpu lib download link (#2852)

Wang, Yi committed 1y ago

ab5f616

December 6, 2024

use oneapi 2024 docker image directly for xpu (#2793)

Wang, Yi committed 1y ago

6685e8f

November 26, 2024

upgrade ipex cpu to fix coredump in tiiuae/falcon-7b-instruct (pageat… (#2778)

Wang, Yi committed 1y ago

892a26e

November 20, 2024

Install compressed-tensors in Docker CPU builds

Daniël de Kok committed 1y ago

45013b6

November 18, 2024

add ipex moe implementation to support Mixtral and PhiMoe (#2707)

Wang, Yi committed 1y ago

a5ecd6e

November 10, 2024

Add initial support for compressed-tensors checkpoints (#2732)

Daniël de Kok committed 1y ago

a785000

October 30, 2024

add xpu triton in dockerfile, or will show "Could not import Flash At… (#2702)

Wang, Yi committed 1y ago

46aeb08

October 16, 2024

feat: prefill chunking (#2600)

OlivierDehaene committed 1y ago

a6a0c97

October 14, 2024

update ipex to fix incorrect output of mllama in cpu (#2640)

Wang, Yi committed 1y ago

7a82ddc

October 8, 2024

Upgrade minor rust version (Fixes rust build compilation cache) (#2617)

Nicolas Patry committed 1y ago

8b295aa

September 12, 2024

hotfix : enable intel ipex cpu and xpu in python3.11 (#2517)

Wang, Yi committed 1y ago

3ac7df2

September 11, 2024

Fix tokenization yi (#2507)

Nicolas Patry committed 1y ago

dae3bf1

September 5, 2024

hotfix: fix regression of attention api change in intel platform (#2439)

Wang, Yi committed 1y ago

5cd8025

August 29, 2024

Lots of improvements (Still 2 allocators) (#2449)

Nicolas Patry committed 1y ago

e415b69

August 13, 2024

add numa to improve cpu inference perf (#2330)

Wang, Yi committed 1y ago

59922f9

August 9, 2024

Using HF_HOME instead of CACHE to get token read in addition to models. (#2288)

Nicolas Patry committed 1y ago

952b450

July 31, 2024

Rebase TRT-llm (#2331)

Nicolas Patry committed 1y ago

2b19d67

July 3, 2024

Fixing the dockerfile warnings. (#2173)

Nicolas Patry committed 1y ago

2b3bd1e

July 2, 2024

fix FlashDecoding change's regression in intel platform (#2161)

Wang, Yi committed 1y ago

5d97e0c

June 25, 2024

Cpu tgi (#1936)

Wang, Yi committed 1y ago

b64c70c

use xpu-smi to dump used memory (#2047)

Wang, Yi committed 1y ago

83634dc

June 24, 2024

Fix cargo-chef prepare (#2101)

ur4t committed 1y ago

405765b

June 17, 2024

Set maximum grpc message receive size to 2GiB (#2075)

Daniël de Kok committed 1y ago

c8c7ccd

June 6, 2024

Xpu gqa (#2013)

Wang, Yi committed 1y ago

4dabddb

Internal runner ? (#2023)

Nicolas Patry committed 1y ago

ed1cfde

June 5, 2024

feat: move allocation logic to rust (#1835)

OlivierDehaene committed 1y ago

8aece3b

June 3, 2024

reable xpu, broken by gptq and setuptool upgrade (#1988)

Wang, Yi committed 1y ago

d1d724b

May 23, 2024

reenable xpu for tgi (#1939)

Wang, Yi committed 1y ago

f41d644

May 6, 2024

update xpu docker image and use public ipex whel (#1860)

Wang, Yi committed 1y ago

59b3ffe

Upgrading to rust 1.78. (#1851)

Nicolas Patry committed 1y ago

ac7076b

April 26, 2024

add intel xpu support for TGI (#1475)

Wang, Yi committed 1y ago

45ecf9d