Hacker News new | ask | show | jobs
by phreeza 2417 days ago
XLnet is Bert with a bunch of additional training tricks.
1 comments

BERT is a Transformer with a bunch of additional training tricks. Transformer is self-attention with a bunch of additional training tricks...