Hacker News new | ask | show | jobs
by wongarsu 104 days ago
But on inference they have to compete with other inference provider that just has a homepage, a bunch of GPUs running vllm and none of the training cost. Their only real advantage are the performance optimizations that they might have implemented in their inference clusters and not made public
1 comments

Qwen, at least, IIRC has some proprietary models, specifically the Max series. IIRC these have larger context windows.