Hacker News new | ask | show | jobs
by UltraSane 578 days ago
async is just parallelized waiting and LLMs make CPUs wait ages for responses to async calls to LLMs allow for much higher CPU utilization.