A high-throughput and memory-efficient inference and serving engine for LLMs
[Bugfix] Fix Qwen3.5-FP8 Weight Loading Error on TPU (#37348)
Signed-off-by: Jacob Platin <jacobplatin@google.com>
J
Jacob Platin committed
d7d51a7ee5ce3b0fe420e773e2dcd38336c338ff
Parent: 3c3c084
Committed by GitHub <noreply@github.com>
on 3/26/2026, 12:46:01 AM