Hacker News new | ask | show | jobs
by 2bitencryption 1032 days ago
TypeChat: let's try really hard to try to convince the model to make the highest-scoring tokens follow the grammar we want.

Guidance (and this project?): Let's not even bother with trying to convince the model; instead, we'll only sample from the set of tokens that are guaranteed to be correct for the grammar we want to emit.

1 comments

Yeah, and our addition to all that is to almost completely remove the cost of determining the next valid tokens on each step.