Hacker News new | ask | show | jobs
by Eisenstein 828 days ago
But you are specifically talking about one type of AI, which is a generative language model. There are tons of other AIs with different applications that do not need to be trained on the entire internet. You have computer vision which separates in object recognition, classification, OCR, etc; you have audio which has text-to-speech (and reverse), music generation, and all sorts of other things; machine translation; sentiment analysis (I won't list all the categories in hugging face but you get my point). These are not differentiated merely by 'training data' to my understanding, so that's why your comment didn't make sense to me.

Calling all AI LLMs is like calling all of the internet the web. Of course if I am mistaken, corrections are welcome.

2 comments

I agree. There are other types of AIs with different applications that do not need to be trained on the internet. The examples you have given however, are examples where the deep nets are extremely data hungry.

Take computer vision for example - a "hello world" version of object recognition would use ImageNet, which is 14 million hand annotated images. Or Cifar10 which is 80 million images. That of course but sets the stage for training data differentiation. Google's image recognition algorithm is far superior to other search engines'. Why? Because of Google's data set.

Any Tom Dick and Harry can go create their own image recognition AI and train it based on all the public datasets (COCO, CIFAR, ImageNet) but that's considered pretty baseline nowadays. The differentiator is what _other_ datasets you have.

Different datasets yield different results. It doesn't matter the network. More data is better (usually).

> But you are specifically talking about one type of AI, which is a generative language model.

...Because that's easily and widely understood to be what people mean in recent times when they're talking about "AI", referring to the stuff that's in the news, without further qualifiers.

If you want to talk about something more specific, you are going to need to be explicit about it, rather than expecting everyone else to understand what you've got in your head without actually saying it.

This is like saying "but "crypto" means so much more than just cryptocurrency! there's a whole cryptography field out there that does lots of good stuff!" It's true, but it's not helpful, because it's ignoring the obvious (at least to the other participants in the discussion) context. In this particular case, the context should be even more obvious because it's so clear that's what the article is talking about.

I thought we were on a site where people were knowledgeable and precise about the technical subjects being discussed.
It doesn't matter how knowledgeable and precise the people you're talking with are; you still need to communicate clearly about what you're actually talking about.
I disagree with your take here. While LLMs also enable significant functionality, we of all categories of service providers should be clearer when we are referencing the specificity of the LLM fad or the adoption of AI to enable services generally, which is the vision that drives the excitement behind the LLM fad.

When people read our comments in 5 years, they will read "AI" and have a much broader topical take than the present excitement about LLMs.