feat: Add Llama-3 chat format (#1371)

* feat: Add Llama-3 chat format

* feat: Auto-detect Llama-3 chat format from gguf template

* feat: Update llama.cpp to b2715

Includes proper Llama-3 <|eot_id|> token handling.

---------

Co-authored-by: Andrei Betlen <abetlen@gmail.com>

abk16 committed 1y ago

8559e8ce88b7c7343004eeccb7333b806034b01c

Parent: 617d536

Committed by GitHub <noreply@github.com> on 4/23/2024, 6:33:29 AM