Y
Hacker News
new
|
ask
|
show
|
jobs
by
jjcm
57 days ago
This is probably less likely with this model, as it’s almost certainly a further RL training continuation of 3.5 27b. The bugs with this architecture were worked out when that dropped.
1 comments
originalvichy
57 days ago
Valuable note!
link