Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Dual Stage Attention RNN (DA-RNN)

Transformer ModelsBased models

Multi-Head Attention Variations

Vanilla LSTM and GRU