Hacker News new | ask | show | jobs
by janpieterz 744 days ago
Right, but now I’m basically running a huge performance hit, need to parallelize my queries etc.

I was parsing a document recently, 10-ish questions for 1 document, would make things expensive.

Might be what’s needed but not ideal.

1 comments

LLM performance is a function of the number of tokens, not queries