Hacker News new | ask | show | jobs
by throwawaymaths 377 days ago
the llm is also a lookup table! but your point is correct. they should have looked at subsequent layers that aggregate information over distance.