feat: Add Llama-3 chat format (#1371)
* feat: Add Llama-3 chat format * feat: Auto-detect Llama-3 chat format from gguf template * feat: Update llama.cpp to b2715 Includes proper Llama-3 <|eot_id|> token handling. --------- Co-authored-by: Andrei Betlen <abetlen@gmail.com>
A
abk16 committed
8559e8ce88b7c7343004eeccb7333b806034b01c
Parent: 617d536
Committed by GitHub <noreply@github.com>
on 4/23/2024, 6:33:29 AM