Hacker News new | ask | show | jobs
by kromem 897 days ago
Others provided good advice about technically doing it.

A note in terms of the project goals:

Make sure to remember when interpreting your results that your findings will only apply to similarly sized models as what you trained.

So you'll have found the differences between using Reddit vs Wikipedia for a 7B model (or whatever size you go with) and those results shouldn't be assumed to extrapolate to larger models.