Hacker News new | ask | show | jobs
by lightsidelabs 4289 days ago
How does this compare to other benchmarks? It looks like you're using the sentence-level dataset out of Cornell, based on your Github. Even a naive unigram baseline easily beats the 70% threshold you mentioned in your post. A few years ago, I co-authored [1] a publication with very similar graph-based features on this dataset that achieved 77% accuracy, and the state of the art has moved beyond that since then. Without a comparison to baseline it's hard to tell whether this (much more sophisticated) technique is adding value.

[1] Shilpa Arora, et al. "Sentiment classification using automatically extracted subgraph features." Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text.

1 comments

The benchmarks really depend on the classifier used. The explanation in this blog post is more about the natural language parsing model. I thought that part up based on books I read from thought leaders (Gleick, Kurzweil, J. Hawkins). The value I'm hoping to provide is less in my own research and more on the use of a graph database. I'll leave the research part to fine people like you.

As for benchmarks, I've seen differences at different sample sizes during training. The model seems to do better with more training examples. Though that increases the number of features and the dimension of the vectors when calculating cosine similarity. I'm really hoping to attract more input like this as Graphify grows as an open source project. Please feel free to get in touch with me. Skype is kenny.bastani.

I'll post benchmarks in the next blog post.