Hacker News new | ask | show | jobs
Petals runs Llama 2 (70B) from Colab at 5 tokens/sec (github.com)
5 points by borzunov 1070 days ago
2 comments

We've moved to a new domain, the chat is now at https://chat.petals.dev
Great project and I'm happy to see it expand to more models!