Hacker News new | ask | show | jobs
by isaacfung 1033 days ago
I did some tests and compared with the clip demo on https://huggingface.co/spaces/vivien/clip

It seems clip performs better for prompts like "three birds", "man and woman"