Hacker News new | ask | show | jobs
by bawolff 1054 days ago
> It might work for highly technical, unambiguous, simple content

I mean, the goal is wikipedia lite basically - so they are targeting technical unambigious simple content.

My understanding is the goal to target small languages where it is unlikely anyone is ever going to put in the effort (or have a big enough corpus) to do the statistical translation methods. Sort of a - this will be better than nothing approach.

2 comments

The original paper [0] envisages a much wider scope. Vrandecic literally quotes "a world in which every single human being can freely share in the sum of all knowledge".

It also makes the task of the editor much, much more difficult than it is now.

[0] https://arxiv.org/pdf/2004.04733.pdf

Tbf, that quote gets thrown around wikimedia every 10 seconds. I wouldn't take the quote too literally.
But it seems like a huge amount of work to achieve that goal.

I suspect a large proportion of the realistic target audience are bilingual.