Y
Hacker News
new
|
ask
|
show
|
jobs
by
tullie
446 days ago
The other direction that isn’t explicitly mentioned in this post is the variants of SASRec and Bert4Rec that are still trained on ID-Tokens but showing scaling laws much like LLMs. E.g. Meta’s approach
https://arxiv.org/abs/2402.17152
(paper write up here:
https://www.shaped.ai/blog/is-this-the-chatgpt-moment-for-re...
)