|
|
|
|
|
by koochi10
1066 days ago
|
|
I agree to disagree with the given "evidence". The unicorns, in my perspective, don't appear to have had any notable changes. It's interesting though, that we're assessing a language bot based on its ability to generate a drawing. After reviewing the blog post linked, I agree with the author's observation that there don't seem to be any significant alterations in the unicorn. Indeed, there are numerous instances of developers experimenting with prompt engineering, discovering what methods work best. However, I find it difficult to regard this as anything more than speculation for now. |
|
Regardless of if we should benchmark imagery with something that was claimed to be multimodal, can you genuinely not see the difference here?
https://imgur.com/a/Eburq3B
Maybe your internal prompt is primed to disagree regardless of what is presented?
Edit: https://www.youtube.com/watch?v=qbIk7-JPB2c&t=1585s even mentions what you claim not to see. Safety degrades the model.