|
|
|
|
|
by thesz
151 days ago
|
|
> like graph relationships
Once upon a time during me being language modeling researcher I built and finetuned a big (at the time - about 5 billions parameters) Sparse Non-Negative Matrix Language Model [1].[1] https://aclanthology.org/Q16-1024/ As this model allows for mix-and-match of various contexts, one thing that I did is to have a word-sorted context. This effectively transforms position-based context into a word-set based context, where "you and me", "me and you" and "and me you" are the same. This allowed for longer contexts and better prediction. |
|
https://aclanthology.org/2025.acl-long.1564/
Thanks to you both for two, interesting papers tonight. :)