Hacker News new | ask | show | jobs
by jaan 824 days ago
Does it support constrained generation during training?

This is what we need for the large language models I am training for health care use cases.

For example, constraining LLM output is currently done by masking, and having this rust based library would enable novel ways to train LLMs.

Relevant papers:

https://github.com/epfl-dlab/transformers-CFG

https://neurips.cc/virtual/2023/poster/70782

1 comments

It's definitely very exciting direction, which we have not explored at all!