|
|
|
|
|
by manishsharan
980 days ago
|
|
The author mentions that GPT-4 is so good at Optical Character Recognition (OCR) My experience has been the opposite: I was trying to get it to read an image of a data table with header and the usual excel table color palette . It could not read most of the data. Then I tried similar read experiment with Enterprise architecture diagrams saved as png files ... same issue as it missed most of the data. I am not disputing the author .. I am trying to figure out what I am doing wrong. |
|
I don't know which blows my mind more - the above feat done on first try, or that the "voice chat mode" has unprecedented ability to correctly pick up on and transcribe what I'm saying. The error rate on this (tested both in English and Polish) is less than 5% - and that's with me walking outside, near a busy road, and mistakes it made were on words I know I pronounced somewhat unclearly. Compare that to voice assistants like Google one, which has error rate near 50%, making it entirely useless for me. I don't know how OpenAI is doing it, but I'd happily pay the API rates for GPT-4 voice powered phone assistant, because that would actually work.