Hacker News new | ask | show | jobs
by dealuromanet 1045 days ago
Whoa, 50 tokens/second locally sounds amazing. Any recommendations on guides or documentation for setting up the stack to run on hardware like that?