|
|
|
|
|
by simonw
370 days ago
|
|
It wasn't until I put these slides together that I realized quite how well my joke benchmark correlates with actual model performance - the "better" models genuinely do appear to draw better pelicans and I don't really understand why! |
|