Y
Hacker News
new
|
ask
|
show
|
jobs
by
kmaitreys
56 days ago
I think there's a lot of difference between sounding like someone and being someone. The models are excellent at pretending indeed.
2 comments
falcor84
55 days ago
I don't think that sama was arguing that ChatGPT actually passed a PhD thesis defense. But arguably, it could make for an interesting benchmark.
link
kmaitreys
55 days ago
Please do not get swayed by nor defend the words vomited by a snake oil salesman.
Also what benchmark? How will you you design it?
link
0123456789ABCDE
56 days ago
exactly. this is what whole RL thing is optimizing for, even if that is not the intent.
link