Hacker News new | ask | show | jobs
by kren1 3785 days ago
0.7 r is a bit optimistic I would say. We only got 0.7 for people we have been able to train on. For completely different people it learned nothing with audio spectrograms. I speculate that convolutions on raw audio would be better.

Estimation from video seemed much better. There we actually got 0.5 r on unseen people, which I find very promising.