Hacker News new | ask | show | jobs
Falcon 7B running real time on CPU (youtube.com)
11 points by mezark 1080 days ago
1 comments

Falcon 7B running real time on CPU
The linked video seems to have no context provided? What is a titan ML server? Is 7B actually that useful? How does the model compare to others? Etc…
Hey there - TitanML is these guys: https://www.titanml.co/ . I think the impressive thing isn't actually whether the model is good (although it is a good model especially when fine-tuned) - but how fast this model runs on CPU with the TitanML server compared with before.