qwen3-coder-next runs fine on my consumer grade nvidia 4070. Performance is not spectacular, but it's only a little bit slower than a properly-fit model.
GPU: NVIDIA GeForce RTX 4060 Ti 16gb (I typo'd the GPU above)
(This is via Ollama on Ubuntu.)
But 1-3 tokens per second is much faster than a lot of other high end models I've tried, so I was pretty pleased with it. Obviously other models run much faster on this hardware though.
I didn't try a sched_spread with a 3090 and the MI100 which would provide 56GB ram