[PDF] Top 20 Trainable Greedy Decoding for Neural Machine Translation
Has 10000 "Trainable Greedy Decoding for Neural Machine Translation" found on our website. Below are the top 20 most common "Trainable Greedy Decoding for Neural Machine Translation".
Trainable Greedy Decoding for Neural Machine Translation
... of decoding, only a few research groups have tackled this problem by incorporating a target decoding algorithm into ...a neural machine translation model as a policy ...on ... See full document
11
Improving Neural Machine Translation through Phrase based Forced Decoding
... chine translation (SMT), neural machine translation (NMT) often sacrifices ade- quacy for the sake of ...phrase-based decoding cost for an NMT output and then using this cost to rerank ... See full document
11
Speeding Up Neural Machine Translation Decoding by Shrinking Run time Vocabulary
... 3. Several variants of Softmax have been pro- posed to solve its poor scaling properties on large vocabularies. Morin and Bengio (2005) propose hierarchical softmax, where at each step log 2 V binary classifications are ... See full document
6
Character based Decoding in Tree to Sequence Attention based Neural Machine Translation
... character-based decoding to a tree-based NMT model (Eriguchi et ...character-based decoding in the tree-based NMT ...English-to-Japanese translation task on the WAT’16 ...its translation ... See full document
9
Fast Decoding and Optimal Decoding for Machine Translation
... the greedy decoder iterates exhaustively over all alignments that are one operation away from the alignment under ...the greedy decoder alters the initial align- ment incrementally as shown in Figure 2, ... See full document
8
Speeding Up Neural Machine Translation Decoding by Cube Pruning
... Neural machine translation (NMT) has shown promising results and drawn more attention re- cently (Kalchbrenner and Blunsom, 2013; Cho et ...recurrent neural network (RNN) (Hochreiter and ... See full document
11
Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation
... of neural machine translation (NMT) removes many ways of manually guiding the translation process that were available in older ...constrained decoding with a complex- ity of O (1) in ... See full document
11
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU
... these neural machine translation (NMT) system, which is often at least an order of magnitude higher than a phrase-based system trained on the same ...single-threaded decoding speeds of 8-10 ... See full document
6
Improving Character Based Decoding Using Target Side Morphological Information for Neural Machine Translation
... poses extra overhead. Since MCWs can appear in various forms we require a very large vocabu- lary to i) cover as many morphological forms and words as we can, and ii) reduce the number of OOVs. Neural models by ... See full document
11
Multi agent Learning for Neural Machine Translation
... bidirectional decoding (Liu et ...agent decoding from left to right (L2R) while the other decoding in opposite direction ...the translation models by introducing a regularization term into the ... See full document
10
Greedy Search with Probabilistic N gram Matching for Neural Machine Translation
... model’s translation capability is nearly zero and the generated sentences are often meaningless and do not contain useful information for the training, so it is unreasonable to directly ap- ply the greedy ... See full document
7
Dynamic Oracle for Neural Machine Translation in Decoding Phase
... To the best of our knowledge, Goldberg et. al. (2012) first define the concept of dynamic oracle and propose an online algorithm for parsing problems, , which provides a set of optimal transitions for every valid parser ... See full document
5
Neural Machine Translation Decoding with Terminology Constraints
... We have presented our approach to NMT decod- ing with terminology constraints using decoder at- tentions which enables reduced output duplication and better constraint placement compared to ex- isting methods. Our ... See full document
7
Context Aware Neural Machine Translation Decoding
... general translation quality, allowing to rank them as ...the translation from the fused sys- tem is better than the baseline, and they consider the quality of both translations to tie 19% of the ... See full document
11
Exploring Recombination for Efficient Decoding of Neural Machine Translation
... checked decoding with a beam size of 10 on Zh-En ...output translation graph can hold much more output space than the original k-best list, and we found that on average the possible output se- quences were ... See full document
6
Greedy Decoding for Statistical Machine Translation in Almost Linear Time
... 10 translation candidates (instead of the top candidate) for each input word, and performing a (small) random num- ber of SWAP and INSERT operations before the actual search ... See full document
8
Incremental Decoding and Training Methods for Simultaneous Translation in Neural Machine Translation
... stream decoding scenario, the en- tire source sequence is not readily ...simultaneous translation is to chop the incoming input after every ...the translation and segmentation process operate ... See full document
7
Guiding Neural Machine Translation Decoding with External Knowledge
... 4.1 Forcing the presence of a given term In PBSMT, XML markups can be easily han- dled: when looking for translation options for each source phrase, the decoder checks both the exter- nal suggestions and the ... See full document
12
Towards Decoding as Continuous Optimisation in Neural Machine Translation
... novel decoding approach for neural machine translation (NMT) based on continuous ...mulate decoding, a discrete optimization problem, into a continuous problem, such that optimization ... See full document
11
Consensus Training for Consensus Decoding in Machine Translation
... by machine translation sys- ...over translation forests gives similar sta- bility benefits to recent work on lattice-based min- imum error rate training (Macherey et ... See full document
10
Related subjects