Y
Hacker News
new
|
ask
|
show
|
jobs
by
CuriouslyC
265 days ago
I've done it? I have a benchmark called scramblebench that will do rewriting to evaluate model performance degradation with symbol replacement and layers of indirection.