Activity - vllm-project/vllm - Morph

SIGN IN SIGN UP

vllm-project / vllm UNCLAIMED

A high-throughput and memory-efficient inference and serving engine for LLMs

0 0 1 Python

ACTIVITY

week month year

COMMITS

20

in the last week

CONTRIBUTORS

19

active

STARS

0

total

FORKS

0

total

TOP CONTRIBUTORS

C

Cyrus Leung

2 commits

W

Wentao Ye

1 commit

M

Mateusz Sokół

1 commit

K

Kunshang Ji

1 commit

V

Vadim Gimpelson

1 commit

F

Fadi Arafeh

1 commit

T

Terry Gao

1 commit

J

Jee Jee Li

1 commit

M

Matej Rojec

1 commit

B

BadrBasowid

1 commit

RECENT COMMITS

W

[Refactor] Remove unused utils (#38153)

Wentao Ye 10h ago

M

DOC: Documentation pages fixes (#38125)

Mateusz Sokół 10h ago

K

[XPU] Disable xpu graph by default (#38193)

Kunshang Ji 10h ago

C

[Doc] Fix outdated reference to CUDAGraphManager (#38209)

Cyrus Leung 10h ago

C

[Model] Use helper function to run MM processors with token inputs (where applicable) (#38018)

Cyrus Leung 10h ago

V

[Bugfix] Fix DeepGemm E8M0 accuracy degradation for Qwen3.5 FP8 on Blackwell (#38083)

Vadim Gimpelson 11h ago

F

[cpu][ci] remove soft-fail for Arm CI and add quant model tests (#37691)

Fadi Arafeh 12h ago

T

[Model] Add torch.compile support for InternVL vision encoder (#38049)

Terry Gao 12h ago

J

[Bugfix] Fix benchmark_fused_collective.py (#38082)

Jee Jee Li 12h ago

M

Add `/v1/chat/completions/batch` endpoint for batched chat completions (#38011)

Matej Rojec 15h ago

B

[Bugfix][CI] Fix Marlin FP8 Linear Kernel for Compressed Tensors Format (#38092)

BadrBasowid 15h ago

W

Relocate Encoder CUDA graph manager (#38116)

Woosuk Kwon 15h ago

F

[Tool Parser][1/3] Pass tools to ToolParser constructor (#38029)

Flora Feng 17h ago

C

[Revert] Remove DeepGEMM availability check in DeepseekV32IndexerMetadataBuilder (#38076)

Chauncey 17h ago

A

[Misc] Optimized check to encapsulate both CUDA and ROCm platforms (#34549)

Andreas Karatzas 17h ago

X

Disable dual stream execution of input projection for Qwen3 (#38152)

Xin Yang 18h ago

W

Fix minimax m2.5 nvfp4 kv scales weight loading (#37214)

Wei Zhao 18h ago

J

[Bugfix] Fix Qwen3.5-FP8 Weight Loading Error on TPU (#37348)

Jacob Platin 18h ago

H

Various Transformers v5 fixes (#38127)

Harry Mellor 19h ago

E

[Cohere] Enable Cohere-Transcribe (#38120)

Ekagra Ranjan 20h ago