Hacker News new | ask | show | jobs
by drv 4701 days ago
The encoding looks like valid UTF-8, at least for the first few pages that I glanced at.

I did notice the section references look a little strange in vim, e.g. "act Aug. 10, 1956, ch. 1041, § 1" near the top; it consists of c2 a7 (section sign), which looks fine, followed by e2 80 af (narrow no-break space), which shows up as a box in vim.

1 comments

OK, maybe it's just a font thing here on Windows (wouldn't surprise me one bit). I'll try again when I get home tonight.
For posterity's sake, it does work fine here on a system with actual Unicode fonts. So I was wrong to blame the file itself.