Hacker News new | ask | show | jobs
Stop using JSON for LLM structured output (nehmeailabs.com)
2 points by 44za12 141 days ago
1 comments

For simple extraction tasks, a delimiter-separated string uses 11 tokens vs 35 for JSON. Output tokens are the latency bottleneck.