| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by gwern 925 days ago
	I'd say, trying to read this, the biggest problems are: - tons of visual clutter, all those gradients and lines like the header or hero image - a floating ToC which insists on jamming in 'recommended links' (?!) the entire time - no outlines. Every single image or screenshot blends into the actual article. - a visual summary which is hard to read because it has tiny text and looks like a correlation heatmap instead of a table - highly inconsistent use of linking. Like, why does 'We have evaluated Gemini across four separate vision tasks:' link only 2 of the 4, and then not to the section in this article? - highly repetitive screenshots, which add nothing, and in conjunction with the lack of outlines for the images and the many outlines inside the images, means that the benchmark sections are a frustrating visual jigsaw puzzle where you have to decode the screenshot again and again to look at the tiny text inside it. It would be better to provide one (1) screenshot of each model's UI, which is all I need to see to get an idea of what it looks like and the implied workflow and what sort of metadata/options it has, and then for each task simply show the image/prompt and each model's responses as a normal blockquote or text.

2 comments

tomp 925 days ago

^^ reformatted

- tons of visual clutter, all those gradients and lines like the header or hero image

- a floating ToC which insists on jamming in 'recommended links' (?!) the entire time

- no outlines. Every single image or screenshot blends into the actual article.

- a visual summary which is hard to read because it has tiny text and looks like a correlation heatmap instead of a table

- highly inconsistent use of linking. Like, why does 'We have evaluated Gemini across four separate vision tasks:' link only 2 of the 4, and then not to the section in this article?

- highly repetitive screenshots, which add nothing, and in conjunction with the lack of outlines for the images and the many outlines inside the images, means that the benchmark sections are a frustrating visual jigsaw puzzle where you have to decode the screenshot again and again to look at the tiny text inside it. It would be better to provide one (1) screenshot of each model's UI, which is all I need to see to get an idea of what it looks like and the implied workflow and what sort of metadata/options it has, and then for each task simply show the image/prompt and each model's responses as a normal blockquote or text.

link

zerojames 924 days ago

All: I sincerely appreciate the time spent sharing feedback. Your notes and comments are helpful and give me tools to be a better writer .

Regarding the screenshots, I am not a fan of this approach. We adopted it because of the early trend to share ChatGPT screenshots, and to ensure people could see the origin of our prompting (the web interface).

I will start a discussion about screenshots with the team. This can be better.

I will also discuss the layout, too. Machine learning and AI is difficult enough. To the extent to which we can focus attention on the most important part of the page — the content — we should.

Thank you again for your notes! I appreciate it.

link

rezonant 925 days ago

While I don't necessarily agree with all of these points,

> link only 2 of the 4, and then not to the section in this article?

This one is particularly prevalent on websites and it's quite annoying. When the site has any topic explainer articles, the terms that refer to those topics are always linked to those other articles, presumably to increase ad impressions and keep users on their site- but when there are legitimate article-specific links (which are almost always what I want), I have no way to locate those links (for instance when finding an original source).

Back in the day websites would use a different link style for this sort of "internal plug" style links, which was helpful. I guess it died out because users didn't want to click them. So the solution is, make it hard to tell which ones are internal plugs!

link