XLNet: Generalized Autoregressive Pretraining
for Language Understanding
Zhilin Yang1 , Zihang Dai12 , Yiming Yang1 , Jaime Carbonell1 ,
Ruslan Salakhutdinov1 , Quoc V. Le2
1
Carnegie Mellon University, 2 Google AI Brain Team
{zhiliny,dzihang,yiming,jgc,rsalakhu}@cs.cmu.edu, qvl@google.com
Abstract
With the capability of modeling bidirectional contexts, denoising autoencoding
based pret ...
附件列表