Hacker News new | ask | show | jobs
A general representation modal across vision, audio, language modalities (github.com)
1 points by logikblok 1119 days ago