Y
Hacker News
new
|
ask
|
show
|
jobs
by
andai
135 days ago
That might actually boost performance since attention pays attention to stuff that stands out. If I make a typo, the models often hyperfixate on it.