|
|
|
|
|
by gschoeni
921 days ago
|
|
We've been working on a data version control system called "oxen" optimized for large unstructured datasets that we are seeing more and more with the advent of many of the generative AI techniques. Many of these datasets have many many images, videos, audio files, text as well as structured tabular datasets that git or git-lfs just falls flat on. Would love anyone to kick the tires on it and let us know what you think: https://github.com/Oxen-AI/oxen-release The commands are mirrored after git so it is easy to learn, but optimized under the hood for larger datasets. |
|