|
|
|
|
|
by ma2rten
2832 days ago
|
|
This approach only works if you use OP's assumption that a text's sentiment is the average of it's word's sentiment. That assumption is obviously flawed (e.g. "The movie was not boring at all" would have negative sentiment). Making this assumption is fine in some cases (for example if you don't have training data for your domain), but if you build a classifier based on this assumption why don't you just use an off-the-shelf sentiment lexicon? Do you really need to assign a sentiment to every noun known to mankind? I doubt that this improves the classification results regardless of the bias problem. |
|
> Do you really need to assign a sentiment to very noun known to mankind?
No, but it seems like a simple (and seemingly innocuous) mistake that many programmers can and will make.