SIGN IN SIGN UP

feat: Update sampling API for llama.cpp (#1742)

* Initial samplng api update

* Fix logger

* Update tests

* Update

* Remove seed

* Add sampling chain

* Remove unnused test

* Use Qwen2 0.5B for ci tests

* Fix typo

* Fix typo

* Update cache version

* Use real model for tests

* Add huggingface-hub as a test dependency

* Remove RUST_LOG=trace

* Add actual logit processor test
A
Andrei committed
f8fcb3ea3424bcfba3a5437626a994771a02324b
Parent: a4e1451
Committed by GitHub <noreply@github.com> on 9/19/2024, 12:00:19 AM