Hacker News new | ask | show | jobs
by dsamarin 1889 days ago
Can we calculate how many bits of entropy we are saving in this compressed form? There seems to be (n-2)! ways of writing a word in a different way that is still readable.
2 comments

I guess that could be a form of lossy text compression - where the end result is not completely right (the letters not being completely in the right order) but it's good enough to be able to read the text.
Information density of English text is 1ish bit per letter. There was some research about it, and also it's the compression rate you get from state-of-the-art algorithms.
Some research by none other than Claude Shannon :)