Hacker News new | ask | show | jobs
by WithinReason 191 days ago
> On the 224× compression language, the claim is specifically about task-specific inference paths, NOT about compressing the entire model or eliminating the teacher.

I understand that after reading the paper, but it's not in the title and that's what people read first. Omitting it from the title might have given you a much more favorable reception.

It's not easy to get noticed when you're not from a big lab, don't get discouraged. It's nice work.