Hacker News new | ask | show | jobs
by drincanngao 76 days ago
I was going to suggest implementing RoPE to fix the context limit, but realized that would make it anatomically incorrect.
1 comments

I intentionally removed all optimizations to keep it vanilla.