A basic technique is to normalize each term within a document following the term frequency-inverse document frequency statistic.
https://en.wikipedia.org/wiki/Tf%E2%80%93idf