Hacker News new | ask | show | jobs
by dunefox 2191 days ago
> 5x higher than pytorch, and I couldn’t fit my models on the GPU for flux. For large models, I had to compromise on batch size in Julia

I had the exact same experience. While I like Julia and Flux I can't use it in this state for my models.

1 comments

Would you mind opening corresponding issues on the repo? That would help guide the ongoing compiler work.