|
|
|
|
|
by johnfn
502 days ago
|
|
GPT can't "see" the results of the scatterplot (unless prompted with an image), it only sees the code it wrote. If a human had the same constraints I doubt they'd identify there was a gorilla there. Take a screenshot of the scatterplot and feed it into multimodal GPT and it does a fine job at identifying it. EDIT: Sorry, as a few people pointed out, I missed the part where the author did feed a PNG into GPT. I kind of jumped to conclusions when it worked fine for me. I still maintain that the article's conclusion ("Your AI Can't See Gorillas") is overly broad, given that I had no trouble getting it to see one. But I wonder why the author had trouble? My suspicion is that AI got stuck on summary statistics because the previous messages in the chat were all about summary statistics. |
|
> I asked the model to closely look at the plot, and also uploaded a png of the plot it had generated.