Hacker News new | ask | show | jobs
by slfreference 151 days ago
I think this indicates the features from vision and audio got aligned properly and hence we know what is what intuitively.