| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wongarsu 104 days ago
	But on inference they have to compete with other inference provider that just has a homepage, a bunch of GPUs running vllm and none of the training cost. Their only real advantage are the performance optimizations that they might have implemented in their inference clusters and not made public

1 comments

Qwen, at least, IIRC has some proprietary models, specifically the Max series. IIRC these have larger context windows.