| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by coder68 91 days ago
	Is there interest in benchmarking the proprietary LLMs for translation? Curious as I often use Gemini 3 Flash, but I have no idea how good it is for my language family. I prefer open models (in fact the smaller the better for offline), but it'd be useful to know how well the Big Three do.

1 comments

cointegrated 89 days ago

We did some benchmarking of them internally, but not sure if we'll publish the detailed results. Just in case, keep an eye on https://huggingface.co/spaces/facebook/bouquet: if we release the evaluation results, they will be there.

link

coder68 89 days ago

Thanks! Super interested in LLMs for translation :D glad to see you folks doing this work.

link