|
|
|
|
|
by WithinReason
191 days ago
|
|
> On the 224× compression language, the claim is specifically about task-specific inference paths, NOT about compressing the entire model or eliminating the teacher. I understand that after reading the paper, but it's not in the title and that's what people read first. Omitting it from the title might have given you a much more favorable reception. It's not easy to get noticed when you're not from a big lab, don't get discouraged. It's nice work. |
|