I personally would love to be able to store datasets next to code for regression testing, easier deployment, easier dev workstation spin up, etc.
Once you get to that amount of images it would be much easy to manage it with some files storage solution.
Or I'm missing something important?
Which is a huge hassle, and a lot of work I’d rather not do.
My current photogrammetry dataset is well over 1TB, and it isn’t a lot for the industry by any stretch of the imagination.
In fact, the only thing that considers it ‘a lot’ and is hard to work with is git.
Once you get to that amount of images it would be much easy to manage it with some files storage solution.
Or I'm missing something important?