Hacker News new | ask | show | jobs
by IDisposableHero 5546 days ago
It got "how much wood would a woodchuck chuck if a wouldchuck would chuck wood" right for me. I am impressed now.

But it couldn't handle "Ph'nglui mglw'nafh Cthulhu R'lyeh wgah'nagl fhtagn" Maybe I'm not pronouncing it quite right.

2 comments

"how much wood would a woodchuck chuck if a wouldchuck would chuck wood"

That isn't really the best example, considering how these systems tend to work. If their system has a giant bank of text it's using to predict the statistical likelihood of your next word, once you've said "how much wood ..", 'would' is a very high probability candidate for the next word, and once you get to 'woodchuck' the rest is statistically almost inevitable.

A better 'difficult' test would be something along the lines of 'colorless green ideas sleep furiously', although we can't actually use that one since that example is so famous it would likely turn up in a web-derived corpus many times.

How is that even a test for it? If you input garbage into a system like that your going to get garbage out. And for fun, walk up to a stranger and say something like that and see if the stranger can provide useful information.
Yes, it kind of failed on "movie times for 'source code'". That's a sentence that's hard to parse without context ;)
Are you sure you pronounced all the schwas?