Hacker News new | ask | show | jobs
by khimaros 883 days ago
llama.cpp supports custom grammars to constrain inference. maybe this is a helpful starting point? https://github.com/ggerganov/llama.cpp/tree/master/grammars