Hacker News new | ask | show | jobs
by pkoird 380 days ago
Back to TF IDF we go.
1 comments

One could argue TF-IDF is a case of an attention layer... but not quadratic in inference/training and kinda just a quotient. Yeah maybe we should go back