Y
Hacker News
new
|
ask
|
show
|
jobs
by
karmacondon
3514 days ago
Practically this almost never works out. The 10% cluster is using a very small dataset and will produce inferior results. If you train a model based on only 10 people, you're prone to overfit that small sample