Hacker News new | ask | show | jobs
by phowon 2620 days ago
And here is a tweet thread on why using NLP models to "fill in" the redacted portions is a horrendously terrible idea.

https://twitter.com/emilymbender/status/1119081131234611201

2 comments

Of course that's not the main aim of the person in the thread. The aim is a timeline cross referencing different data sources.
Bingo!
I certainly agree that this wouldn’t tell us anything real about the Mueller report, although it might be a useful exercise to learn about the language models. What really rubs me the wrong way is academics or anyone else with power trying to tell me what is or isn’t “funny” or “interesting” or “fun.” Like, that’s for me to decide, not you!