Hacker News new | ask | show | jobs
by verdverm 993 days ago
You can pretty much make up any pseudo grammar like this one, which is a reduced JSON object that is close to CUE: https://github.com/hofstadter-io/hof/blob/_dev/flow/chat/pro...

No need to be formal or use a standard format, just need a pattern the LLM can fill or follow

1 comments

I have no idea what happens when I put data through these models and how they work.

My thought was it may have been trained on so much JSON and JSON schema data, that simply providing a JSON schema and telling it the data it outputs must validate against the schema will produce good results.

gpt-3.5-turbo and gpt-4 have worked superb at this so far and I'm excited to test with the new gpt-3.5-turbo-instruct model!

yes, if you finetune, you can get some preferred formats, see something like codellama. When using more generic model, you can do more generic things.

One of the benefits of using a reduced syntax is reduced tokens, so the LLM can focus on the interesting parts while ignoring uninteresting " and ,

Let the model think more (and about the important parts)