Hacker News new | ask | show | jobs
by justicezyx 2184 days ago
Note that this is a system paper, not a ML/DL/NLP paper. It's kind of OK to expand the parameter to such larger number.