[PDF] Top 20 Understanding and Improving Hidden Representations for Neural Machine Translation

Understanding and Improving Hidden Representations for Neural Machine Translation

... the hidden representations of NMT for better predictive performances regarding those relative tasks, in hope of achieving improved performance in terms of the target translation ...every ... See full document

12

Visualizing and Understanding Neural Machine Translation

... the translation process can be denoted as a deriva- tion that comprises a sequence of translation rules ...these translation rules are interpretable from a linguistic ... See full document

10

Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models

... The attention mechanism has been introduced as a way to learn an alignment between the source and target text, and improves encoder-decoder models significantly, while also providing a way to inter- pret the inner ... See full document

8

Towards Understanding Neural Machine Translation with Word Importance

... Important words are more influential on translation performance than the others. Under three different perturbations, perturbing words of top-most importance leads to lower BLEU scores than Random selected words. It ... See full document

10

Understanding and Improving Morphological Learning in the Neural Machine Translation Decoder

... Both Joint-data learning and Multi-task learning improved overall translation performance. In the case of En→De, the performance of both ap- proaches is very similar. However, each has its own pros and cons. While ... See full document

10

Towards Bidirectional Hierarchical Representations for Attention based Neural Machine Translation

... and hidden layer are respectively set as 620 and ...other hidden states, are set to ...the translation relative to a ref- erence is assessed using the BLEU metric (Pap- ineni et ... See full document

10

Improving Neural Machine Translation Models with Monolingual Data

... pure neural machine translation architec- tures was first investigated by (Gülçehre et ...recurrent hidden state of the language model to the decoder state of the encoder-decoder network, with ... See full document

11

Improving Robustness of Neural Machine Translation with Multi task Learning

... for machine translation, Tu et ...the hidden layers of the translation de- ...speech translation if the speech translation is based on both the input speech and its transcrip- ... See full document

7

Learning Joint Multilingual Sentence Representations with Neural Machine Translation

... We can make the following observations. First, using an BLSTM with max-pooling (Table 1 right) performs much better than an LSTM and using the last hidden state as sentence representa- tion (Table 1 left). This ... See full document

11

Improving Lexical Choice in Neural Machine Translation

... We use the global attentional model with gen- eral scoring function and input feeding by Lu- ong et al. (2015a). We provide only a very brief overview of this model here. It has an encoder, an attention, and a decoder. ... See full document

10

Pre trained language model representations for language generation

... model representations have been successful in a wide range of language understanding ...trained representations into sequence to sequence models and apply it to neural machine ... See full document

8

Syntax Enhanced Neural Machine Translation with Syntax Aware Word Representations

... We take the simple yet effective Seq2Seq model with attention mechanism proposed by Luong et al. (2015) as our baseline. Under the stan- dard encoder-decoder architecture, an encoder first maps the source-language input ... See full document

11

Improving Robustness in Real World Neural Machine Translation Engines

... We use a shared vocabulary BPE Model (Sen- nrich et al., 2016) for subword segmentation, with a code of 32000 merge operations. We use con- volutional (Gehring et al., 2017) encoder-decoder (15x15) architecture with the ... See full document

7

Improving Sequence to Sequence Neural Machine Translation by Utilizing Syntactic Dependency Information

... and hidden state proposal matrices were initial- ized as random orthogonal matrices. Weights were optimized using the Adadelta algorithm and were updated with a mini-batch size of 32 sentences. The vocabulary ... See full document

9

Word Representations in Factored Neural Machine Translation

... single hidden-to-output (h2o) layer which is used by the two separate ...decoder’s hidden state and the concatenation of the embeddings of the previous generated ... See full document

12

Learning Hidden Unit Contribution for Adapting Neural Machine Translation Models

... for neural machine translation (NMT) is starting to get more attention from the scientific ...and machine translation practitioners want to improve the performance of their systems on a ... See full document

6

Improving Japanese to English Neural Machine Translation by Voice Prediction

... recurrent neural networks such as encoder-decoder models have gained increasing attention in machine translation owing their abil- ity to generate fluent ...German neural machine ... See full document

6

Sequence to Dependency Neural Machine Translation

... generated translation and its corresponding dependency ...the translation of SMT is disfluent and ungram- matical, whereas RNNsearch is better than ...the translation of RNNsearch is locally fluent ... See full document

10

Improving Machine Translation Quality Estimation with Neural Network Features

... and machine translation outputs as features, and combined them with baseline features to improve the system performance of ...and neural machine translation log-likelihood features ... See full document

5

Improving Neural Machine Translation Robustness via Data Augmentation: Beyond Back Translation

... back translation (BT) (Sennrich et al., 2016a) with forward translation (FT) on noisy texts and find that pseudo-parallel data from forward translation can help improve more ... See full document

9