Hacker News new | ask | show | jobs
by ilaksh 512 days ago
The best models are not language models but multimodal. The grounding of language in video data, new model architectures, and larger models will improve the robustness.

That HeyGen video does not suck. It's actually kind of hard to even tell it's AI if you are only looking at it for a few seconds.

The interesting thing comparing a human's learning to an AI is that AI skills and knowledge can be copied basically infinitely, whereas a human is a one of a kind.

I imagine some parents are putting in efforts with the goal of raising the most productive member of society they possibly can. AI teams have somewhat similar goals for the models they are training.

We could see AI take control of the planet within the next four years in order to end WWIII. We should just hope that they keep lots of us around in giant people zoos.

1 comments

I agree the multimodal stuff is amazing. I'm seriously impressed with the new Gemini 2.0 family of models and can't wait until the full multimodal capabilities are in general release.

In terms of the HeyGen vid, it's passable, but that was something I literally whipped up in 10 minutes. You can make ones that are much, much better if you invest in creating better training material. The voice and video model in this case only used the one 3-minute source video.

Funny you mention the "people zoo" thing. That's actually part of a sci-fi story I've been trying to write since I was in my teens. Roughed out here: https://youtu.be/2KLdaVs_ugw

Kurt Vonnegut did it first.
You know what's wild? Not ever having heard of him, I can find out who he is and how this relates to the conversation in milliseconds.

For others' benefit:

"Kurt Vonnegut explored themes of humans being observed by extraterrestrial beings in a zoo-like setting in his novel Slaughterhouse-Five. ... While this scenario involves humans being placed in a zoo by aliens, it does not specifically depict artificial intelligence (AI) as the captors. However, Vonnegut did address the impact of automation and machines on human society in his debut novel, Player Piano."

Amazing times we live in. Strange and scary, but also amazing.

Great book. Well worth a read.