|
|
|
|
|
by woah
490 days ago
|
|
The three goals featured prominently above the fold are: > truly open
> including data, documentation, training and testing code, and evaluation metrics; including community involvement > compliant
> under EU regulations, OpenEuroLLM will provide a series of transparent and performant LLMs > diverse
> for European languages and other socially and economically interesting ones, preserving linguistic and cultural diversity The first one seems good, but the second two seem to be pretty beside the point of creating models that compete with the cutting edge of China and the USA. |
|
Others have responded to your "diversity" point, but making sure to train on adequate amounts of data in all EU languages is valuable, especially because LLMs are so prone to generating convincing BS when working close to the edges of their training set. If this exists, people in Malta are going to want to use it, so better for it to generate good Maltese than gibberish that sort of looks like Maltese, right?