changes.mady.by.user Isaac Godfried
Saved on Oct 20, 2020
Dual Stage Attention RNN (DA-RNN)
Transformer ModelsBased models
Multi-Head Attention Variations
Vanilla LSTM and GRU