Hacker News new | ask | show | jobs
by ChrisGammell 2201 days ago
I have about 500 hours of high quality, channel isolated (separate from the person I was speaking to) audio. It comes from my podcast that I have done for many years. It's probably closer to 75-100 hours audio of me actually speaking, since I am more the interviewer.

Is that something that would be useful to a researcher in any context? I am intrigued by the idea of having my voice preserved (you know, ego), but also am happy to donate the sound files if they would help researchers in any way for datasets.

If so: chris@theamphour.com

1 comments

Do you have transcripts, even just for some of the episodes? Unsupervised learning is possible but more difficult.

In general, yes, this is probably useful data in some way for speech recognition or TTS.