|
|
|
|
|
by TeMPOraL
979 days ago
|
|
Surprising. I tried OCR only once so far - I took a photo of a hand-drawn poster at my kid's kindergarten, about mental health, dense with hand-written-like text mixed up with various drawings. You know, the kind of hand-made infographic. And the text was 100% in Polish. I figured it's a good test as any - I fed that photo to ChatGPT and asked to summarize it. To my astonishment, it reproduced 100% of the content correctly, and even in the right order (i.e. how I'd read it myself, vs. strict left-right top-down). I don't know which blows my mind more - the above feat done on first try, or that the "voice chat mode" has unprecedented ability to correctly pick up on and transcribe what I'm saying. The error rate on this (tested both in English and Polish) is less than 5% - and that's with me walking outside, near a busy road, and mistakes it made were on words I know I pronounced somewhat unclearly. Compare that to voice assistants like Google one, which has error rate near 50%, making it entirely useless for me. I don't know how OpenAI is doing it, but I'd happily pay the API rates for GPT-4 voice powered phone assistant, because that would actually work. |
|