Hacker News new | ask | show | jobs
by mluo 494 days ago
Nice, very glad to see it works! Small models are very sensitive to the dtype :(