Hacker News new | ask | show | jobs
by kouteiheika 162 days ago
In this context by "real time" people usually mean "as fast as I can read the reply", so, 0.0002 tokens per minute would not be considered "real time".
1 comments

Real time typically means guaranteed reaction time below 30ms, because slower reactions will make the body through up.