feat: Update sampling API for llama.cpp (#1742)
* Initial samplng api update * Fix logger * Update tests * Update * Remove seed * Add sampling chain * Remove unnused test * Use Qwen2 0.5B for ci tests * Fix typo * Fix typo * Update cache version * Use real model for tests * Add huggingface-hub as a test dependency * Remove RUST_LOG=trace * Add actual logit processor test
A
Andrei committed
f8fcb3ea3424bcfba3a5437626a994771a02324b
Parent: a4e1451
Committed by GitHub <noreply@github.com>
on 9/19/2024, 12:00:19 AM