Hacker News new | ask | show | jobs
by jncfhnb 743 days ago
Don’t do it all in one prompt.
1 comments

Right, but now I’m basically running a huge performance hit, need to parallelize my queries etc.

I was parsing a document recently, 10-ish questions for 1 document, would make things expensive.

Might be what’s needed but not ideal.

LLM performance is a function of the number of tokens, not queries