Hacker News new | ask | show | jobs
by jacobr1 1740 days ago
You might want to consider comparing generated sound files, rather than abstract notion. If you have the ground truth notion, render that using the same mechanism as your transcription. Then you can use various spectral comparison techniques on the sound, including things like fourier analysis to compare structure.