Hacker News new | ask | show | jobs
by scv119 1047 days ago
The tweet is referring to a paper that fine tunes Chinese dataset on english base model. I'm not surprised with LoRA's poor result in this setup.