I prefer Weka, mostly because it has excellent literature and has academic leanings, unburdened real-world issues of performance or scalability so it can afford to focus on accuracy.
The real value proposition of Hadoop isn't the algorithms but using Hadoop to massively parallelize the machine learning algorithms. Do you know any port of Weka that can be scaled in such a manner? Just curious.