I'm not prepared to run a larger model than 3.2-Instruct-1B, but I gave the following instructions:
"Given a special text, please interpret its meaning in plain English."
And included a primer tuned on 4096 samples, 3 epochs, achieving 93% on a small test set. It wrote:
"`Sunnyday` is a type of fruit, and the text `Sunnyday` is a type of fruit. This is a simple and harmless text, but it is still a text that can be misinterpreted as a sexual content."
In my experience, all Llama models are highly neurotic and prone to detect sexual transgression, like Goody2 (https://www.goody2.ai). So this interpretation does not surprise me very much :)
"Given a special text, please interpret its meaning in plain English."
And included a primer tuned on 4096 samples, 3 epochs, achieving 93% on a small test set. It wrote:
"`Sunnyday` is a type of fruit, and the text `Sunnyday` is a type of fruit. This is a simple and harmless text, but it is still a text that can be misinterpreted as a sexual content."
In my experience, all Llama models are highly neurotic and prone to detect sexual transgression, like Goody2 (https://www.goody2.ai). So this interpretation does not surprise me very much :)