Hacker News new | ask | show | jobs
by yanis_t 882 days ago
Here are some better ideas from the top of my head:

- summarise an article

- find information on a given topic (free-form input text)

- full voice control ("click that link", "read that article", "find this")

- auto-submit a captcha

5 comments

> "auto-submit a captcha"

we have come full circle

well, they're too hard for me to solve on my own
When I am prompted to 'try again' with a new 3x3 of low-res images I often wonder if there was a bit of 'crosswalk' I missed in one, or if this is just how they get me to annotate another set for 'bicycles' for free.
When they decide I'm a bot and just give me the low res stuff I give up accessing the website.
Well, we have been training them to do this for a while...
Take this tab collection, build a model or a RAG or whatever around them:

- Let me chat with a bot that knows the information from the collection

- Use the information to generate a summary

- Let me guide it in generating a well sourced article

Build a knowledge graph from the web

- Trace a source of information back to the originating point to help eliminate derivative blog spam

- Help moderate media bias and challenge echo chambers

Automatically recognize spam, scams, etc.

Let me describe something I need in text, return back links to shopping sites that sell that thing, if nobody has it, generate a 3d model, or more formal description of it and supply me with connections to let me farm it out to an additive manufacture, one-off makerspace place or something.

- Post resulting article on your spam blog

- The circle continues

Seriously, their first example seems useless to most people. Naming a tab group??? That doesn't take any time, little thought, and who does that regularly?

Summarizing an article seems like something everyone else can do OK. It's a huge avenue for bias (maybe that's why it's reasonably elided) but at least it's actually useful.

It's not just naming, but grouping as well. If you have 200 tabs and don't want to spend too much time on it, this can be very helpful.
How long until malicious pages fool this into placing themselves into, say, a banking tab group?
How would that affect the end user except they go into the banking tab group and go "Huh. This isn't a banking website" and close it?
Tab groups are a mental shortcut so you can spend less time figuring out the nature of the tab you're looking at or finding particular tabs.

If something automates their creation then there is absolutely an advantage to sites that can subvert the classification method, because users will start with the "banking site" expectation instead of no expectation at all.

Who has that many bank website tabs open? And that are still active, because of course bank websites log you out if you do nothing within a minute or two. And then create a tab group from those tabs, and then the bad actor correctly guesses which bank you use and has bypassed Safe Browsing (which also uses ML now) and you visit the tab and manually (because the password manager won't work) type in your credentials?

Worrying about this as a potential threat is on the same level as my bank "disabling" right click.

There are already phishing websites which copy the look of the real website, and a user could easily mix the two up.
This is the only feature I'm excited about. I perpetually have 100+ tabs opened and have tried tab groups but eventually things get disorganized again. The ability to automatically group similar tabs, assuming it works, is going to be game changing.
To automatically group similar tabs I am using ToChunkA Smart TabS extension.
> Seriously, their first example seems useless to most people. Naming a tab group??? That doesn't take any time, little thought, and who does that regularly?

Funny, naming things, whether variables or groups of things is the main reason I use LLMs to date. Add in grouping as well and that handles something that puts me under a lot of cognitive load, because I can never shake the feeling I have ot yet manually grouped things optimally.

Maybe they are just testing AI with less used features first
1. This is actually a feature on Pixel through the assistant [0], surprised it's not on Chrome itself

2. That doesn't really seem like a Chrome feature? Belongs more on Bard.

3. That seems like a Google Assistant feature too, some of that actually may work on a pixel phone, though might be nice to have on desktop too.

4. Will never happen. Google themselves have a captcha product so defeats the point.

[0] https://support.google.com/assistant/answer/14163109?hl=en

I was excited to test it on my pixel 7 just to find out it's only available on Pixel 8. Why?....
...and only in the US.
> 1. This is actually a feature on Pixel through the assistant [0], surprised it's not on Chrome itself

It is only on the Pixel 8, not the previous models and their mid-range $ variants so they aren't giving it away for free just yet

I wonder if it's done locally or through the cloud. Some of the language around the release of Gemini Nano implies that it may be done locally?

https://store.google.com/intl/en/ideas/articles/pixel-featur...

I think for #2, they meant like AI-powered control-F / find in page.

Which is actually the first non-novelty AI tool I've heard someone pitch that actually sounded like a good idea. Way more visible failure mode than summarizing.

Watch an ad so i don't have to