|
|
|
|
|
by pico_creator
534 days ago
|
|
Currently the strongest RWKV model is 32B in size: https://substack.recursal.ai/p/q-rwkv-6-32b-instruct-preview This is a full drop in replacement for any transformer model use cases on model sizes 32B and under, as it has equal performance to existing open 32B models in most benchmarks We are in works on a 70B, which will be a full drop in replacement for most text use cases |
|