Hacker News new | ask | show | jobs
by solarkraft 165 days ago
My bad, I took this as something Multi-head Latent Attention (MLA) related.