Table 1 reminds me why even if these algorithms are available it's a big step to being able to understand and apply them. It's clear the author doesn't have a lot of familiarity with them.
He is co-founder of the Mahout project with a pretty extensive background in text analysis. I suspect he's familiar with the algorithms. In fact, he may be showing the reader that they _aren't_ as magical as one may believe, by showing that they don't work perfectly oob.
Unless you are being sarcastic, in which case, forgive me for missing it.
Unless you are being sarcastic, in which case, forgive me for missing it.