Hacker News new | ask | show | jobs
by specproc 1180 days ago
Yeah, #1 just makes this seem pointless for the time being. The whole point of needing something like this is horizontal scaling.

Also not clear from my phone down the pub if inference is needed at each step. That would be slow, no? Even (especially?) if you owned the model.

1 comments

No inference is needed. IME it can do a single page in ~10s, $0.01/page. Not practical for most use cases, great for a limited few right now.