|
|
|
|
|
by yyding
957 days ago
|
|
Good job!
I observed that you implemented many cuda kernels by yourselves. Just wondering your consideration or trade-off between implementating the kernels via pure CUDA code vs. implementing based on compiler like TVM/Triton. |
|