Hacker News new | ask | show | jobs
by goffi 1244 days ago
Wow that's really impressive (I've seen similar things in research papers for a while now, but having it usable so easily and generic is great).

A few questions:

- would it be possible to use this tool to make automatic mask for editing in something like GIMP (for instance, if I want to automatically mask the hair)?

- would it be possible to have a REPL or something else to make several prompt on the same image? Loading the model takes time, and it would be great to be able to just do it once.

- how about a small GUI or webui to have the preview immediately? Maybe it's not the goal of this project and using `instruct-pix2pix` directly with its webui is more appropriate?

Thanks for the work (including upstream people for the research paper and pix2pix), and for sharing.

1 comments

> would it be possible to use this tool to make automatic mask for editing in something like GIMP

probably but GIMP plugins are not something I've looked into

> REPL

already done. just type `aimg` and you're good to go

> GUI

GUIs add a lot of complexity. Can your file manager do thumbnails and quick previews?

> GUIs add a lot of complexity. Can your file manager do thumbnails and quick previews?

Somewhat OT, but I find this really funny. It says a lot about the difficulty of using various ecosystems and where communities spend time polishing things.

"Yeah, I made something that takes natural language and can do things like change seasons in an image. But a GUI? That's complicated!"

It's not a criticism of you, but the different ecosystems and what programmers like to focus on nowadays.

Fair but I'd point out I also didn't make the algorithm that changes photos. I'm wrapping a bunch of algorithms that other people made in a way that makes them easy to use.

It's not just that GUI's are hard, it's that the "customer" base will inevitably be much less technical and I'd receive a lot more difficult to resolve bug reports. So no-gui is also a way of staying focused on more interesting parts of the project.

thanks for the quick answer and cool for REPL. Yeah sure I can just launch Gwenview on the output directory.

> probably but GIMP plugins are not something I've looked into

I was just thinking about a black and white or grey level output image with the desired area , no need to integrate it in GIMP of whatever. I've tried a prompt like "keep only the face", but no luck so far.

There is a smart mask feature. Add `--mask-prompt face --mask-mode keep`. I believe it outputs the masks as well