Hacker News new | ask | show | jobs
by satvikpendem 134 days ago
How's the inference speed? What was the price? I'm guessing you can fit the entire model without quantization?