What is Neural Machine Translation?

Overview

Neural Machine Translation (NMT) uses neural networks to train a framework to translate from one language to another. The semantics of languages differ so starkly that it becomes challenging to prescribe a phrase-to-phrase or word-to-word translation system.

Note: Using neural networks to train a translation system decreases reliance on individual words and instead focuses on the larger context of the sentence

How does NMT work?

Largely, NMT systems tend to contain a series of deep neural networks that can be trained to translate an entire sentence, paragraph or even a document.

Generally, the input sequence is encoded into a sequence of numbers. The resultant sequence of numbers is eventually fed into the decoding network that converts the sequence into the output sequence of words.

Advantages over traditional techniques

More realistic translation: Translation using NMT takes into account the entire context and structure rather than a limited scope of neighboring words and hence is more likely to capture the true essence of the sentence.
Diminishing need for domain knowledge: For cross-language translation involving slang words and idiomatic expressions, the need for manual rules-based mapping via a human resource is minimized as the model can be trained over such cases using the dataset.
Cost-effective and Reusable: For languages that have different dialects and regional variants, the rules and mapping need not be defined again. The existing model can be trained using additional data and be tuned to match its respective application.

On the whole, neural machine translation provides a robust framework that forms a one-stop solution for much of the language-translation problems that especially fail to capture the contextual cues. It is widely accepted and used in popular services such as the Google Neural Machine Translation System.

What is Neural Machine Translation?

Overview

How does NMT work?

Embedding Layer

Encoding using Recurrent Neural Networks

Decoding using Recurrent Neural Networks

Classification Layer

Advantages over traditional techniques