|
|
|
|
|
by Xichekolas
6687 days ago
|
|
Maybe have the HN software go pull the page that is submitted, and assign it a weight based on number of tokens on the page. Obviously you'd remove stop words and HTML tags. This would assign lower weights to shorter fluff, and lower weights to articles that are split up over a lot of pages (which in my experience tend to be fluff too, with a 4:1 ad to content ratio). It'd be kind of like Bayesian filtering for post importance. This was actually one of my ideas for submitting to YC, but I rather like HN, so maybe you could experiment with it here. |
|