Hacker News new | ask | show | jobs
by Szpadel 1158 days ago
you are referring to the dolly model? I think the training set could achieve similar performance if we would fine tune similarly sized model