|
|
|
|
|
by dangom
1275 days ago
|
|
This idea is presented by Jeremy Howard on literally their first Deep Learning for Coders class (most recent edition). A student wanted to classify sounds, but only knew how to do vision, so they converted sounds to spectrograms, fine tuned the model on the labelled spectra, and the classification worked pretty well on test data.
That of course does not take the merit away from the Riffusion authors though. |
|
edit: added a bit more to the thought