|
|
|
|
|
by lostmsu
1327 days ago
|
|
It was ok as an educational tool, but now they don't count GPU implementation in 1000 lines, so it is not small. Considering the code style it is closer to 20k+ lines when formatted and GPU code included. It also doesn't support bfloat16 so is doomed to be 2x slower. |
|