Hacker News new | ask | show | jobs
by Gijs4g 1046 days ago
At Mirage Studio we have successfully finetuned Llama 2 7B on a Dutch dataset to get it to output Dutch in a coherent way: https://huggingface.co/Mirage-Studio/llama-gaan-2-7b-chat-hf...
1 comments

Dutch is a low-hanging fruit though, ain't it? Closely related Germanic language with heavy English influence post-war?

edit: To make the implied question explicit, I guess it might do well on other similar Germanic languages (say Norwegian) but struggle beyond that? Or?

Isn't it only a matter of languages of the input that the model was trained on? If we want it to spit out Klingon, I'd have to be trained on Klingon input, no?