Hacker News new | ask | show | jobs
by magdyks 950 days ago
A great framework for serving many fine-tuned llms in production by quickly swapping adapters for the same base model (eg. Llama-2-70b)