Hacker News new | ask | show | jobs
by borzunov 1269 days ago
There's a lightweight HTTP API for inference: https://github.com/borzunov/chat.petals.ml#http-api-methods