|
|
|
|
|
by szvsw
739 days ago
|
|
I really disagree with pigeonholing it as an LLM architecture! It is much more general than that as I mentioned in another comment in this post [1] (and of course as mentioned in the original paper which you linked). [1] https://news.ycombinator.com/item?id=40616181 |
|