Hacker News new | ask | show | jobs
by Gracana 200 days ago
That is literally the thing the parent poster wants to avoid by running open models.

[edit] I was a little unfair -- lack of access to training data is a bit of an issue (perhaps moreso for analysis than for for actual use, considering what it takes to train these models). I'm thankful that some of them are also distributed as base models, which should be relatively unbiased compared to what happens later during finetuning.

1 comments

Run them on what though?
Three power supplies, an old server, a grocery cart and a box fan, and every 3090 you and your friends can get your hands on.