Hacker News new | ask | show | jobs
by rossdavidh 1100 days ago
AHAHAHAHAHA! There is approximately a 0% chance that the big companies paying for data annotation will be far-sighted enough to avoid LLM-automated labeling of their data, for several reasons:

1) it will work well, at first, and only become low-quality after they (and their budgets) have become accustomed to paying 1/20th as much for the service

2) even if they pay for "human" labeling, they will go for the low cost bid, in a far-away country, which will subcontract to an LLM service without telling them

3) "hey, we should pay more for this input, in order to avoid not-yet-seen quality problems in the future", has practically never won an argument in any large corporation ever. I won't say absolutely 0 times, but pretty close.

Long story short, the use of LLM's by Big Tech may be doomed. Much like how "SEO optimization" turns quickly into clickbait and link farms if there is not high-urgency and high-priority efforts to combat it, LLM's (and other trendy forms of AI that require lots of labeled input) will quickly turn sour and produce even less impressive results than they already do.

The current wave of "AI" hype looks set to succeed about as well as IBM Watson.

1 comments

Have you worked with labeling services before? The quality is always terrible. At least with an LLM I can get consistently terrible output, quickly.
A few years ago, I started spoiling answers to captchas. I make a game of getting as many false positives and negatives I can get away with.
"Consistently terrible output, quickly!"(TM)