Hacker News new | ask | show | jobs
by cdgore 3582 days ago
>> "Google's building models with billions of parameters that require much more than 200MB, and that are really, really good at scoring data. I have to believe either that a) Apple is not telling us everything, or b) they haven't figured out a way to bring their customers the most powerful AI yet. (And the answer could very well be c) that I don't understand what's going on...)"

The 200 MB figure quoted appears to refer only to the model stored locally on the phone. In my experience, 200 mb translates to a few million parameters in one or more sparse matrices.

The figure on the whiteboard in the background says "Hey Siri small". I take that to indicate the model that does feature extraction and prediction for some queries, such as "set a timer for 20 minutes", while there is a larger, more general model for other use cases in the cloud.