According to my wife (a Chinese native speaker), GPT-3.5 is bad at outputting Chinese but GPT-4 does a good job