|
|
|
|
|
by bananaface
2151 days ago
|
|
I don't feel this way at all. Videos have automatic subtitles which you can automatically translate into the language of your choice, speech recognition is so good that the right tool will let you program with it, text to speech is a button-click away for basically any Web page (all I need is an extension). Post-processing for color blindness is amazing, left-to-right languages render readably on a consistent basis. OCR is progressing dramatically and we're starting to see projects focused at individual users, and automatic image tagging gives textual descriptions of a huge amount of picture content. We're at a point where a lot of these tools haven't matured in their consumer implementations, but that's coming. It's just a matter of time. That's all ignoring the soft accessibility of things like iPads that have made computing accessible to Grandma. |
|
I agree that we're progressing fast, but fully automated machine translation is IMHO still lightyears away (if at all feasible). And to automate subtitle generation in a foreign language, you first need to have speech to text, which is also still error-prone, so now you have two sources of errors.
We're seeing the uncanny valley problem: By now, things like machine translation are so good for simple use cases, that they're being aggressively pushed, and at first it may even appear correct / as if it was done by a human, but then suddenly the translation becomes nonsensical and weird. Even for the well-received deepl, it's still surprisingly easy to give it some text that it really struggles with.
Incidentally, I remember attending a lecture about 12 years ago by the then new professor of NLP who was talking about his success with using machine aided human translation of subtitles from Swedish into Norwegian. Granted, a lot may have improved in 12 years, but it still struck me as impressive that even in languages that closely related, the best they could hope for in a research project was machine aided translation.