|
|
|
|
|
by LoganDark
1082 days ago
|
|
> This is the kind of PR bamboozlement I was alluding to - literally none of the above is correct I'm basing my knowledge on discussions with other developers on rwkv.cpp because we were talking about how performance scales with the number of tokens per iteration. Memory speed/bandwidth came up and some things about M1 were said. Sorry about that. |
|