Why: http://webcache.googleusercontent.com/search?q=cache:H52osT6...
Getting it: http://webcache.googleusercontent.com/search?q=cache:0I8CMo0...
from the second link I found it's hosted here:
http://www.infochimps.com/collections/million-songs
where you can download a subset as well as the whole thing. I'm downloading the subset now, so I can't comment on the cleanliness or schema yet.