Hacker News new | ask | show | jobs
by MallikaSinha 2656 days ago
If you have a large dataset and you train your data accuracy till 100%, what about the noisy data? Will this classification system will reduce the noisy values in the system? Machines during training or runtime prediction do have some amount of noises. Can we reduce those through this classification?
1 comments

That depends on what you mean by noise. In the case of text classification, a lot of the noise in training is disagreement between human labelings. Unfortunately, classification systems will only be as good as the labels they are trained on.

However, if you are referring to noise as in typos and misspellings - then yes, depending on the training and/or preprocessing steps, classification systems could potentially reduce the noise in the input data to still achieve good results.

By noise I mean to say that overfitting your data which means it starts to take your wrong input data and plot the graph involving those input data as well. Anyway, I got your point. Thankyou