| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jmalicki 55 days ago
	Are there any good papers on this you would suggest/specific search terms? I am vaguely aware of some stuff, but would love to study more, I don't quite understand what this is all about (but I do see how LLMs can do attention to all prior tokens so you don't have the single-point-of-failure HMMs do which more necessitates Viterbi decodes)