Hacker News new | ask | show | jobs
by fgfarben 36 days ago
On both the llama.cpp based version and the custom Metal version, the model forgets how to use tools somewhere around the 50,000 token mark.