Hacker News new | ask | show | jobs
by spacebanana7 1190 days ago
I reckon we’ll move out of the prompt difficulty phase you mention simply when the context window gets big enough.

If you were able to give midjourney a short textual instruction, a hand drawn sketch and a reference image from a human artist all together as a prompt then I’m pretty sure it could produce the image of a boy doing a high jump as you intent.

We already see extended length multimedia prompts in GPT4 so it’s doesn’t seem like an impossible leap for midjourney/DALL-E etc

2 comments

Midjourney already allows this - sort of - with image remixing.

From everything I tried, the results were worse.

Again, I think this is going to remain a problem for a long time - but it will probably improve slightly with each iteration. Either way there's so many use cases where the cost-benefit will massively favor AI generated art, and I think the % of cases will continue to increase - albeit slowly.

Similar to self-driving cars - they've been in limited availability in Phoenix for a long time, and now SF. The list of cities will grow, and the limitations will decrease - but I still can't see the vast majority of trips being self-driven within the next 20 years.

In the same way, I don't see AI generating the vast majority of Pixar films in 20 years. Nor AI generating Marvel comic strips or kids cartoons. Etc.

Sure - some people will be using it for these use cases. They already are, and were before GPT.

I don't see this killing jobs, but limiting job growth instead.

You can already give MJ a reference image, just by putting the URL of the image as the first thing after the imagine prompt and before the text description