Hacker News new | ask | show | jobs
by mohi13 2872 days ago
Makes a lot of sense, actually its really difficult to get a large enough dataset for moderation tasks to make a decent inhouse model for a fair enough comparison.

Sure, we can try scraping that from pornhub etc but fee then the negative classes would be very domain specific, using stock images may not provide a good measure.

Also, its really weird to assign such a task to any of your employees, feels kinda strange :)

1 comments

Yahoo's model could be fine-tuned: http://caffe.berkeleyvision.org/gathered/examples/finetune_f...

Yeah, it's definitely not a nice task but what's stopping someone (well, besides potential legal issues) from using these commercial APIs to create datasets programatically and training a cloned model from that?

I'm curious what the profit margins are on these APIs because I think they are way overpriced.