Hacker News new | ask | show | jobs
by breckenedge 889 days ago
It doesn’t appear to be token-by-token inference. Each new completion uses a different model, but the new completion is entirely created by that model.