Hacker News new | ask | show | jobs
by Aerroon 96 days ago
Either some q3 or since it's a MoE, maybe a REAP version of q4 might work (or could be terrible, I'm not sure about REAP'd models).