Hacker News new | ask | show | jobs
by echelon 1380 days ago
A demo sells it.

You should try to host the model yourself.

Using the SD API is great for a POC, but to scale this and take full advantage of the cost benefits, you need to spin up a cloud GPU.

Heck - you can even build an off the shelf PC and pull workloads from the cloud, and then upload the resulting images. That would work well within your traffic needs. Maybe the upfront cost isn't worth it to you, but if you already have the components ...

(I've done this myself. I run a dual cloud / on-prem 3090 cluster.)

1 comments

Planning on learning how to do this next week. If you have any articles/video that could point me in the right direction, that would be highly appreciated.
I made this for a generator I made (example, https://phantasmagoria.stavros.io/images/6cJAbYyeDXfA6oHZ/).

It's just a task library (I used Dramatiq) with a worker that connects to it. Then the worker picks up tasks, processes them and uploads the results to R2.