Hacker News new | ask | show | jobs
by hedgehog 23 days ago
The data I've seen is stuff like the KL Divergence comparisons that Unsloth does which show something but not clearly whether there's an observable or significant difference in task performance.