Hacker News new | ask | show | jobs
by treyd 954 days ago
I think it's very plausible but it would require first a ton of training data cleaning using existing models in order to be able to rework existing data sets to fit into that more narrow paradigm. They're so powerful and flexible since all they're doing is trying to model the statistical "shape" of existing text and being able to say "what's the most likely word here?" and "what's the most likely thing to come next?" is a really useful primitive, but it has its downsides like this.