| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by lifeisstillgood 3207 days ago
	the most interesting thing i found was "We use movies as the source of AVA". while the datasets will only grow, movies are not realistic - they are by design faked, acted, well lit etc. While that is probably the best thing to do with a starting set i am waiting for the CNN/RNN to start saying (much like the early black female standford researcher who was not identified as human face) that person is not walking - i know walking, it's just like John Cleese.

1 comments

chimtim 3207 days ago

this is what makes this dataset poor. other datasets mentioned in the blog are based off youtube which is more realistic. movie based datasets have perfect lighting, center the subject are almost never useful (e.g. HMDB)

link

yodon 3207 days ago

YouTube/Flickr/etc are far from ideal data sources. Do dogs drive cars? Flickr has tons of photos of dogs driving cars, eating ice cream, and doing tons of other rare-for-dogs things. Ultimately whatever the raw data source is what matters is how well is it curated, and that’s always going to be a highly labor intensive job that can be done well or poorly regardless of the source of the images being curated.

link

chimtim 3206 days ago

these are not random, raw youtube video datasets. they are hand curated dataset in specific classes (like using mechanical turk). youtube has really diverse videos with different lighting, and real world scenarios which makes it an excellent dataset. Movie clips look great but models trained on them are useless in real world.

link