Hacker News new | ask | show | jobs
by akdev1l 494 days ago
If it’s not talked about much, it won’t be in the training dataset much
1 comments

From what I’ve seen, AI can do a solid job of bridging language gaps—though it depends on how it’s set up. The data exists, but it’s often buried in native-language sources. By focusing on local content, like newspapers and forum discussions you can shape a dataset that pulls in more authentic, region-specific data.