| > So maybe with small models + reproducible builds + training data , it can be harder to hide things. Eh, not quite. Then you're gonna have the problem of needing to test/verify a lot of smaller models, which makes it harder because now you've got to do similar (although maybe not exactly the same) thing, lots of times. > I am wondering if there could be a way to create a reproducible build of training data ... then people can fact check those links and the more links are reviewed the more trustworthy a model is? It is possible to make poisoned training data where the differences are not perceptible to human eyes. Human review isn't a solution in all cases (maybe some, but not all). > If we are using ai in defense systems. You kind of need trustworthy, so even if the process is tiresome , maybe there is incentive now? DARPA has funded a lot of research on this over the last 10 years. There's been incentive for a long while. > Or maybe we shouldn't use ai in defense systems Do not use an unsecured, untrusted, unverified dependency in any system in which you need trust. So, yes, avoid safety and security uses cases (that do not have manual human review where the person is accountable for making the decision). |