|
|
|
|
|
by LoganDark
1093 days ago
|
|
> The only reason open-source LLMs have a heartbeat is they’re standing on Meta’s weights. Not necessarily. RWKV, for example, is a different architecture that wasn't based on Facebook's weights whatsoever. I don't know where BlinkDL (the author) got the training data, but they seem to have done everything mostly independently otherwise. https://github.com/BlinkDL/RWKV-LM disclaimer: I've been doing a lot of work lately on an implementation of CPU inference for this model, so I'm obviously somewhat biased since this is the model I have the most experience in. |
|