Hacker News new | ask | show | jobs
by bryukh 489 days ago
How does this compare to just training separate quantized models? Is it actually easier in practice?