Hacker News new | ask | show | jobs
by CPUTranslator 1343 days ago
To me, this article misses the point, but only slightly. If we compare the proposed Photoshop for text and the actual Photoshop for images, those are very different tools in how they work with their respective mediums.

Photoshop doesn’t use AI models to generate new things for the artist, photographer, whoever (it does have generation of course, but not in the same way). A tool like Photoshop for text, in the way described, seems more like a writing aid than a writing tool; something I can give vague commands like “make this paragraph more academic” (whatever that means), and it’ll spit out something approximating the academic style it analyzed. Whereas “auto balance the levels of this image” is much more concrete in what it means, there’s no approximation of “style.”

I feel like Photoshop for text should be an editor that cares more about the structure of stories and who’s in them, with ways to organize big chunks in easy ways, rather than something that generates content for you.

3 comments

> Photoshop doesn’t use AI models to generate new things for the artist, photographer, whoever

It certainly does have this, such as tools to remove objects and paint over them as if they were not there, automatic sky replacement (replace a cloudy sky with a sunset), super resolution (AI up scaling) and a range of what they call 'Neural Filters'.

You want your low quality boring midday image of some famous bridge to be a high resolution, taken during sunrise without that person riding a bicycle? Photoshop will do it with very little user skill or input.

Some beta features include style transfer, makeup transfer and automatic smile enhancment.

Yeah, but nobody is buying Photoshop for the inpainting features. Photoshop's bread-and-butter is raster graphics tools, and almost all of them function deterministically.
Content aware fill's been in Photoshop since the CS5 days, so at least back to 2019.
Yes, and it sucked for anything more complex than filling sky with more clouds or foliage with more leaves.
Not sure what you mean by "deterministically", but most models are deterministic during inference: given the same input or prompt, they'll generate the same output.
Are they? I thought there was always a kind of “random parameter” or random step midway?

Hope this question is not too dumb!

Usually not, perhaps noise filters, but mostly not. There was amusing case of reversing twirl filter a while ago: https://boingboing.net/2007/10/08/untwirling-photo-of.html
Surely that twirl effect isn't made using AI but just a geometric function, right?
Thanks a lot. I was mistaken.
It depends on the algorithm, but many models can be re-generated if you start with the same seed.
Is it common and/or expected to have control over the seed though?
Thank you! I was mistaken then.
Tense and plurals is one that would that seems like it would be helpful for a computer to keep consistent while you edit, so when you edit the subject to be plural from singular, it tacks on the "s" onto the verb for you. If you had

"Families enjoy this restaurant"

And started typing and changed "Families" to "our family" this "photoshop for text" would change "enjoy" to "enjoys" for you.

Enough little facets like that, I could see being useful.

It's like content-aware text refactoring
Clippy on steroids.
The article is more about user experience than implementation. When you apply an effect in Photoshop you can change all relevant pixels at once. Whereas we don’t have good tools to change all words in a document at once (while retaining the intended meaning)
I get your point, but your comment made me search for a Photoshop Stable Diffusion plug-in. I had no idea.

I’m decent with Photoshop but have not had a need for it lately. This makes me want to fire it back up.

Each video is 1-2 minutes long. Mind blown again.

https://youtu.be/XdctF1uTLBo

https://youtu.be/mn1PV6HqXGU

https://youtu.be/Wo6ZDYFCWTY

Oh that would be cool. Write a paragraph about a red ball, then halfway through, decide it's a blue ball. The editor then highlights all occurrences of the variable and rewrites them for you. Like the "Refactor local variable name" bit in VSCode.