Hacker News new | ask | show | jobs
by smallerize 472 days ago
Well I can't be positive, but it looks like some of the factors that support a long context length might be set wrong. https://blog.eleuther.ai/yarn/