| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hansvm 480 days ago
	Why not though? Both kinds of models jumble around the data and spit out a probability distribution. Why is the tesseract distribution inherently more explainable (aside from the UI/UX problem of the uncertainty being per-token instead of per-character)?