Hacker News new | ask | show | jobs
by samj 607 days ago
Nobody's asking for exact reproducibility — if the source code produces the software and it's appropriately licensed then it's Open Source.

Similarly, if you run the scripts and it produces the model then it's Open Source that happens to be AI.

To quote Bruce Perens (definition author): the training data IS the source code. Not a perfect analogy but better than a recipe calling for unicorn horns (e.g., FB/IG social graphs) and other toxic candy (e.g., NYT articles that will get users sued).