Hacker News new | ask | show | jobs
by londons_explore 1565 days ago
It is notable that they got only 2.5 extra BLEU while translating text to english, and 6.2 extra when translating text from english to another language.

Since the network will have seen far more english text than text of other languages, it suggests that performance on limited training data is more improved.

1 comments

That's exciting. Getting better performance with more data is trivial, the hard part is getting better performance with less data.