Hacker News new | ask | show | jobs
by segmondy 66 days ago
This is obviously a continuation training of 3.5, it's not a new model architecture but an incremental improvement.