|
|
|
|
|
by lelag
426 days ago
|
|
I was going to point that out. What I will add is that constrained generation is supported by the major inference engine like llama.cpp, vllm and the likes, so what you are describing is actually trivial on locally hosted models, you just have to provide a regex that prevent them to use the letter 'e' in the output. |
|
https://github.com/sam-paech/antislop-sampler
https://arxiv.org/abs/2306.15926