Y
Hacker News
new
|
ask
|
show
|
jobs
by
treksis
150 days ago
how fast is this compare to python based?
3 comments
antirez
150 days ago
Very slow currently, I added the benchmarks in the README. To go faster it needs to implement inference faster than the current float32-only kernels.
link
rcarmo
150 days ago
The Python libraries are themselves written in C/C++, so what this does performance-wise is, at best, cutting through some glue. Don't think about this as a performance-driven implementation.
link
throwaway314155
150 days ago
PyTorch MPS is about 10x faster per the README.md.
link
antirez
150 days ago
I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow.
link
Numerlor
150 days ago
Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?
link