Dual Stage Attention RNN (DA-RNN)
Transformer Models