Hacker News new | ask | show | jobs
by causal 724 days ago
Thanks for your work on this; excited to try it out!

The Google API models support 1M+ tokens, but these are just 8K. Is there a fundamental architecture difference, training set, something else?