|
|
|
|
|
by dfbrown
919 days ago
|
|
How real is it though? This blog post says In this post, we’ll explore some of the prompting approaches we used in our Hands on with Gemini demo video. which makes it sound like they used text + image prompts and then acted them out in the video, as opposed to Gemini interpreting the video directly. https://developers.googleblog.com/2023/12/how-its-made-gemin... |
|
> Narrator: "Based on their design, which of these would go faster?"
Without even specifying that those are cars! That was impressive to me, that it recognized the cars are going downhill _and_ could infer that in such a situation, aerodynamics matters. But the blog post says the real prompt was this:
> Real Prompt: "Which of these cars is more aerodynamic? The one on the left or the right? Explain why, using specific visual details."
They narrated inaccurate prompts for the Sun/Saturn/Earth example too:
> Narrator: "Is this the right order?"
> Real Prompt: "Is this the right order? Consider the distance from the sun and explain your reasoning."
If the narrator actually read the _real_ prompts they fed Gemini in these videos, this would not be as impressive at all!