Hacker News new | ask | show | jobs
by unlikelymordant 1211 days ago
Transfomers are currently state of art i.e. the problems that transformers are currently solving cant be solved better by any other known technique/algorithm. Thats the main reason why they are so popular at the moment. LLMs are popular right now because a recent transformer based neural network has proven to be fun/useful i.e. gpt-3/chatgpt. A lot more useful than previous language models at least.
1 comments

RWKV is showing that maybe RNNs can perform on par with transformers

https://github.com/BlinkDL/RWKV-LM