|
|
|
|
|
by _--__--__
59 days ago
|
|
I was more surprised by gemma models consistently flinching on anti-Europe more than China or America. Can't imagine Leopold or Amritsar get much attention in fine-tunes, so it probably means the models are just told to be open to criticism of China and the US beyond what their other training would allow. |
|
Then there's the fact that the Bengal famine and the Amritsar massacre just aren't spoken about as much as (for example) the Tiananmen Square massacre. I'd assume the 'flinching' around anti-Europe stuff is mostly down to a comparatively low incidence in the training data.