| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by treksis 150 days ago
	how fast is this compare to python based?

3 comments

antirez 150 days ago

Very slow currently, I added the benchmarks in the README. To go faster it needs to implement inference faster than the current float32-only kernels.

link

rcarmo 150 days ago

The Python libraries are themselves written in C/C++, so what this does performance-wise is, at best, cutting through some glue. Don't think about this as a performance-driven implementation.

link

throwaway314155 150 days ago

PyTorch MPS is about 10x faster per the README.md.

link

antirez 150 days ago

I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow.

link

Numerlor 150 days ago

Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?

link