|
|
|
|
|
by activatedgeek
1041 days ago
|
|
> I'm not sure of why you would want to use raw llama-2 Sure. My concern was not specific to llama-2, and was only using it as a placeholder example of a decent pre-trained base model. Replace it with your favorite base model, which you want to use for guided generation. My question is more fundamental - how does post-hoc guided generation interfere with the potential benefits of instruction-tuning? > About your second point, the goal is that the model can only generate JSON (for example), which can 100% be done by constraining which output token can and cannot be used. Mechanistically, yes. I am not arguing that. The whole point is to generate JSON that is "useful". |
|