|
|
|
|
|
by nicklo
2309 days ago
|
|
On the contrary, with a more generous reading of the previous comment, it holds some merit. 1. CNN's are used fairly commonly for sequence tasks nowadays. Convolutions can be 1D after all.
2. It's also possible the previous comment was referring to using 2D convolutions on the spectrogram of the audio, which is a common approach.
3. Neural networks are capable of more than classification. Scoring is a regression task which is common application of neural networks. |
|
2. Since data is MIDI-encoded, would a convolution hold any merit here? I suppose you could render to an mp3 and analyze the audio itself but that seems very computationally expensive and prone to overfitting. 3) If we're training a scoring classifier, we would need labeled data, but getting those labels seems very challenging, not least because of how subjective our impressions of melodies can be (for instance, the opinions of a fan of atonality would be drastically different from a fan of pop). Do you have any ideas on how to mitigate this?