Hacker News new | ask | show | jobs
by wwizo 267 days ago
You guys rock! I'm very curious how will this perform against real word data, where small nuance matters. Also have you tested it beyond 128K context window?