Hacker News new | ask | show | jobs
by darknoon 1125 days ago
afaict OpenAI's instance is massively overloaded, you can see with the 32k context model actually being faster in practice rather than slower