|
|
|
|
|
by gnat
1136 days ago
|
|
I find that when it writes code, I can't trust its output. I handle that by being Grumpy Old Programmer: I eyeball it closely and ask myself about error handling, assumptions, off-by-one errors, is it confusing the 1.1 API with 2.0, is it efficient or naive, and all the other questions that happen in a code review. So you're pushing 100 Chinese numbers through ChatGPT to get Arabic equivalents. What do you then do to ensure the quality of output is high? Do you eyeball the list and go "hm, seems plausible"? Spot checks? Is there some context around the lists that means erroneous translations will be quite obvious to the trained eye? I'm always curious what QA looks like in other fields. |
|