- HMMs and Perceptrons for Part-of-Speech Tagging and Chunking - http://www.aclweb.org/anthology/W02-1001
- MaxEnt for Part-of-Speech Tagging - http://www.aclweb.org/anthology/W96-0213
- RNNs for Slot Filling - http://www.iro.umontreal.ca/~lisa/pointeurs/RNNSpokenLanguag...
Not related to NLP, but I really like the Facebook paper that covered delta of delta compression for time series data.
- http://www.vldb.org/pvldb/vol8/p1816-teller.pdf