Hacker News new | ask | show | jobs
by jackschultz 3491 days ago
I've been working on validating my thought that country music lyrics are all about very similar (and very cliche) topics, and I realized I needed to label all the lyrics I scraped from Genius. Since there are 5000 or so, and I didn't want to have to do all of it myself or rigged with some google doc, I built an app to more easily collect training data. It's generalized to allow for different question types and different documents other than just text as well.

It's also a front in case people want to get in contact for general data scraping or ML needs that I can help with, but the main app is the platform for training data.

I don't have a name for it, which is why it's running with a Heroku url, so suggestions are welcome!

https://fierce-mountain-21498.herokuapp.com/