| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by verdverm 993 days ago
	You can pretty much make up any pseudo grammar like this one, which is a reduced JSON object that is close to CUE: https://github.com/hofstadter-io/hof/blob/_dev/flow/chat/pro... No need to be formal or use a standard format, just need a pattern the LLM can fill or follow

1 comments

leigna 993 days ago

I have no idea what happens when I put data through these models and how they work.

My thought was it may have been trained on so much JSON and JSON schema data, that simply providing a JSON schema and telling it the data it outputs must validate against the schema will produce good results.

gpt-3.5-turbo and gpt-4 have worked superb at this so far and I'm excited to test with the new gpt-3.5-turbo-instruct model!

link

verdverm 993 days ago

yes, if you finetune, you can get some preferred formats, see something like codellama. When using more generic model, you can do more generic things.

One of the benefits of using a reduced syntax is reduced tokens, so the LLM can focus on the interesting parts while ignoring uninteresting " and ,

Let the model think more (and about the important parts)

link