Hacker News new | ask | show | jobs
by tell_me_whai 218 days ago
Does this allow for mixing LLMs and Image Gen? I find LLMs really useful to generate image prompts that diffusion models understand (which can be tedious to do manually). Although you need very detailed system prompts to teach what Image Gen models expect.
1 comments

That's how the pros do it. Yes, you can load both a text and image gen models at the same time. Needless to say you'll need a very beefy GPU(s) to do this so I wouldn't recommend it unless you know exactly what you're doing as generally you'll want to max out your VRAM for one model at a time for the highest quality results. Open webui and sillytavern allow both text and image gen from the same UI although I wouldn't recommend it for advanced users. Otherwise Gerbil will give you multiple pages to toggle through via the titlebar dropdown.