Put several instances of each song into a playlist and hit "shuffle". Try to identify which is which. That's the best way to make sure it's a real difference, not just your head playing tricks.
Agreed. Blind tests are the only reliable way. If nothing else tests in this area have managed to show is that people are not good at judging without bias if they know which is which.