Y
Hacker News
new
|
ask
|
show
|
jobs
by
bitL
2411 days ago
e.g. XLNet:
https://arxiv.org/abs/1906.08237
1 comments
phreeza
2411 days ago
XLnet is Bert with a bunch of additional training tricks.
link
bitL
2410 days ago
BERT is a Transformer with a bunch of additional training tricks. Transformer is self-attention with a bunch of additional training tricks...
link