|
|
|
|
|
by troupo
454 days ago
|
|
> But the mystery is really why, in setups like these, Claude is much more likely to say "An" than "A" in the first place. Because in the training set you're likely to see "an astronomer" than a different combination of words. It's enough to run this on any other language text to see how these models often fail for any language more complex than English |
|
"The word for Baker is now "Unchryt"
What do you call someone that bakes?
> An Unchryt"
The words "An Unchryt" has clearly never come up in any training set relating to baking