Hacker News new | ask | show | jobs
by RicoElectrico 1805 days ago
Compare this with the famous quote:

> Every time I fire a linguist, the performance of our speech recognition system goes up. - Fred Jelinek

2 comments

> Every time I fire a linguist, the performance of our speech recognition system goes up. - Fred Jelinek

This one is easy to misapply. If you are applying your domain experts to the model, you might have a bad time. If you are applying them to the data, most likely not. And data is usually more important than the model.

> And data is usually more important than the model.

idk. we went from conv nets to transformers just to have the quality of our predictions go up as well as reducing the amount of data prep time by a factor of 20.

no change in data, just a better model.

in my field, improvements are nearly always made in the model. never in the data or data prep. (crowd countinf, people tracking, etc)

I very nearly said this myself!

I think the mistake of this quote is in the application of the expertise. The bitter lesson is that data + compute can outperform inductive biases but that doesn't mean you don't need domain expertise to get the right data.