Hacker News new | ask | show | jobs
by poly2it 285 days ago
You could throw the output into a cleansing, "nonthinking" LLM, removing the steering tokens and formatting the response in a more natural way. Diffusion models are otherwise certainly a very interesting field of research.