| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by leigna 991 days ago

I have no idea what happens when I put data through these models and how they work.

My thought was it may have been trained on so much JSON and JSON schema data, that simply providing a JSON schema and telling it the data it outputs must validate against the schema will produce good results.

gpt-3.5-turbo and gpt-4 have worked superb at this so far and I'm excited to test with the new gpt-3.5-turbo-instruct model!

1 comments

verdverm 991 days ago

yes, if you finetune, you can get some preferred formats, see something like codellama. When using more generic model, you can do more generic things.

One of the benefits of using a reduced syntax is reduced tokens, so the LLM can focus on the interesting parts while ignoring uninteresting " and ,

Let the model think more (and about the important parts)

link