| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by samg_ 5375 days ago

I am just learning some of these machine learning tools and am rapt, so forgive me for asking, but would you be able to explain a little about what you are doing?

How are you generating features? Stanford parser? Are you using logistic regression or something more advanced?

I love the idea. I am interested in applying some of these concepts myself. Do you have any ideas that you are not able to pursue yourself, that I might take a crack at?

1 comments

michaelaiello 5375 days ago

Works just like spam filtering: We're using a naive Bayesian classifier with training data. Built and tested a custom extractor that makes the most sense for the legalese of privacy policies.

Ideas: email me at michaelaiello (at) michaelaiello.com

link

raphman 5375 days ago

Might I suggest adding at least this sentence to the training set: "We sell your private information."

Your Policizer happily tells me that the sentence means "They DO NOT sell your private information."

link