Hacker News new | ask | show | jobs
by timfsu 61 days ago
The question is - if the SOTA model disappear - do these follow-on models have the ability to improve themselves without distillation?