Hacker News new | ask | show | jobs
by Oranguru 764 days ago
You can easily fix this using a grammar constraint with llama.cpp. Add this to the command: --grammar "root ::= [^一-鿿ぁ-ゟァ-ヿ가-힣]*"

This will ban Chinese characters from the sampling process. Works for Yi and Qwen models.