| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by vladpowerman 243 days ago
	The compression framing is super interesting. It makes me wonder if there’s an equivalent notion for source code - like how much “information” or entropy a commit contains vs. boilerplate churn. I’ve been exploring Git activity analysis recently and ran into similar trade-offs: how do you tokenize real-world code and avoid counting noise?