Hacker News new | ask | show | jobs
by nairoz 594 days ago
> trained from scratch

Not exactly. They mention starting from the VAE from Stable Diffusion XL and the Transformer from Phi3.

Looks like these LLMs can really be used for anything

1 comments

Pretty cool, comfy ui and community is too cumbersome for me and still results in too much throwaway content