Hacker News new | ask | show | jobs
by krebby 933 days ago
Yeah unfortunately differentiable SVG rendering is still something that is very difficult to accomplish with diffusion models. I looked into this last year and found VectorFusion [0][1], which appears to be using a similar technique to this app - generate the raster image and then apply a vectorization tracer. This leads to the blobbiness you see, given the way the diffusion model generates the image and the tracer applies to the output.

There have been a few attempts (i.e. this recent one [2][3]) to attempt to fine-tune the parameters of the SVG with image segmentation, before comparing rendered outputs with a CLIP model. This is promising but the search space for vector images is just so huge that so far you really need to start with an existing image as a basis rather than starting from scratch. Interesting area of research!

0. https://ajayj.com/vectorfusion/ 1. https://arxiv.org/abs/2211.11319 2. https://intchous.github.io/SVGCustomization/ 3. https://arxiv.org/abs/2309.12302