Hacker News new | ask | show | jobs
by scott-smith_us 1039 days ago
Is there significantly more to this than just tokenizing a document's words and then ZIP-ing the stream of tokens?