Skip to content

Neural Machine Translation by Jointly Learning to Align and Translate

Notifications You must be signed in to change notification settings

ahsanabbas123/Neural-Machine-Translation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

NMT by Jointly Learning to Align and Translate Open In Colab

Allievates the information compression problem by allowing the decoder to "look back" at the input sentence by creating context vectors that are weighted sums of the encoder hidden states. The weights for this weighted sum are calculated via an attention mechanism, where the decoder learns to pay attention to the most relevant words in the input sentence.
Based on the paper Neural Machine Translation by Jointly Learning to Align and Translate

Dataset

The data for this project is a set of many thousands of English to French translation pairs.

Architecture

A Sequence to Sequence network, or seq2seq network, or Encoder Decoder network, is a model consisting of two RNNs called the encoder and decoder. The encoder reads an input sequence and outputs a single vector, and the decoder reads that vector to produce an output sequence.

Attention allows the decoder network to “focus” on a different part of the encoder’s outputs for every step of the decoder’s own outputs. First we calculate a set of attention weights. These will be multiplied by the encoder output vectors to create a weighted combination. The result (called attn_applied in the code) should contain information about that specific part of the input sequence, and thus help the decoder choose the right output words.

About

Neural Machine Translation by Jointly Learning to Align and Translate

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published