|
|
|
|
|
by meander_water
39 days ago
|
|
> We find that
models are not failing due to “death by a thousand
cuts” (i.e., many small errors). Instead, they main-
tain near-perfect reconstruction in some rounds, and
experience critical failures in a few rounds, typically
losing 10-30+ points in a single round trip > We find that
weaker models’ degradation originates primarily from
content deletion, while frontier models’ degradation is
attributable to corruption of content. I think we largely already knew this. This is why we fudge around with harnesses and temperature etc. |
|