| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mynameismon 637 days ago
	How easy is it to run on older GPUs (think 1080Tis)? The reason I ask this is because torch.compile refuses to support that, and that alone makes things much slower.

3 comments

danielhanchen 637 days ago

The other issue is Pascal cards don't have tensor cores, so there much slower than those with them. You could try Unsloth for 2x faster llama fine-tuning - someone made P40s and P100s work. Although I would suggest upgrading to at least RTX 20x series.

link

formalsystem 637 days ago

The project is very much focused on maxing out tensor cores and since older GPUs don’t have them it’s not where the project shines best

link

almostgotcaught 637 days ago

> torch.compile

torch.compile is a pt2.0 feature and has nothing to do with handwritten cuda kernels

> How easy is it to run on older GPUs

this is a torch cpp extension

https://github.com/HazyResearch/ThunderKittens/blob/8daffc9c...

so you're going to have the same exact issue (whatever issue you're having)

link