| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rrr_oh_man 815 days ago
	Could you explain for a dum-dum?

1 comments

karalala 815 days ago

Results of xlstm are promising but will need larger scale experiments.

However they completely messed up benchmarking experiments for various RNN models which in their papers claim comparable and even better performance than base transformer.

link

AIsore 815 days ago

These experiments seem pretty large already though, no? How are you so sure they messed up benchmarking? Is the code out already?

link