|
|
|
|
|
by fluffy87
2074 days ago
|
|
How fast is Julia wrt to CUDA C++, or C++/Rust? In those languages, I can actually take the theoretical practical hardware limits (memory BW, throughput, latency), and relatively easily achieve 99% of their utilization. When people say “very fast” and “my cores are at 100%” they often mean “my program achieves 1% of the perf the hardware can deliver”. |
|