Hacker News new | ask | show | jobs
by sujayk_33 408 days ago
It's faster inference because of the Hardware (LPUs), here the question is about architectures (AR or Diffusions)
1 comments

I realize that, but it can be used now with many models in real-life situations. I just wanted to mention it if someone doesn't know it.