SIGN IN SIGN UP
vllm-project / vllm UNCLAIMED

A high-throughput and memory-efficient inference and serving engine for LLMs

74452 0 1 Python

[Bugfix] Fix Qwen3.5-FP8 Weight Loading Error on TPU (#37348)

Signed-off-by: Jacob Platin <jacobplatin@google.com>
J
Jacob Platin committed
d7d51a7ee5ce3b0fe420e773e2dcd38336c338ff
Parent: 3c3c084
Committed by GitHub <noreply@github.com> on 3/26/2026, 12:46:01 AM