|
|
|
|
|
by Timpy
188 days ago
|
|
A lot of training data was curated in Kenya[0]. I would imagine if LLM data was curated in Japan our LLMs would sound a lot like the authors of their most popular English text books. Maybe other common Japanese idioms would leak in to the training data, like "ね" or "でしょう", ChatGPT would say "Don't you agree?" at the end of every message. [0] https://www.theverge.com/features/23764584/ai-artificial-int... |
|
[0] https://en.wikipedia.org/wiki/Malkiat_Singh