Hacker News new | ask | show | jobs
by _ea1k 501 days ago
I feel like a lot of people in this thread have never done continued training on an LLM and it shows.

Seriously, a set of weights that already works really well is basically the ideal basis for a _lot_ of ML tasks.

1 comments

The question is not, whether it is ideal to do some ML tasks with it, the question is, whether you can do the things you could typically do with open sourced software, including looking at the source and build it, or modify the source and build it. If you don't have the original training data, or mechanism of getting the training data, the compiled result is not reproducible, like normal code would be, and you cannot make a version saying for example: "I want just the same, but without it ever learning from CCP prop."