[PDF] Top 20 Unsupervised Tokenization for Machine Translation
Has 10000 "Unsupervised Tokenization for Machine Translation" found on our website. Below are the top 20 most common "Unsupervised Tokenization for Machine Translation".
Unsupervised Tokenization for Machine Translation
... makes tokenization in English usu- ally ...chine translation, one token should not necessarily correspond to one morphological unit, but rather should reflect the morphological units and writing system of ... See full document
9
Unsupervised Bilingual Segmentation using MDL for Machine Translation
... based as criterion with a more efficient greedy algo- rithm. Chen (2013) proposes a compression-based method using MDL and improve the performance of monolingual segmentation. Argamon et al. (2004) use an efficient ... See full document
8
Unsupervised Extraction of Partial Translations for Neural Machine Translation
... There are various methods for extracting sentence pairs from monolingual corpora. However, most of them rely on the availability of document-level information, in comparable corpora for instance, and usually for one ... See full document
11
Unsupervised Neural Machine Translation with SMT as Posterior Regularization
... because unsupervised NMT methods suffer from the noise problem while PBSMT is inherently defi- cient in fluency just as the case study in ...by unsupervised NMT, so that even PBSMT could not distinguish ... See full document
8
Translating Translationese: A Two Step Approach to Unsupervised Machine Translation
... Previous zero-shot NMT work compensates for a lack of source/target parallel data by either using source/pivot parallel data, extremely large monolingual data, or artificially generated data. These requirements and ... See full document
6
Integrating an Unsupervised Transliteration Model into Statistical Machine Translation
... Method 2: provides n-best transliterations to a monotonic decoder that uses a monolingual language model and a transliteration phrase- translation table to rescore transliterations. We carry forward the 4 ... See full document
6
Unsupervised Search for the Optimal Segmentation for Statistical Machine Translation
... of unsupervised determination of the optimal morphological segmentation for statistical machine translation (SMT) and propose a segmentation metric that takes into account both sides of the SMT ... See full document
6
Bilingual Lexicon Induction through Unsupervised Machine Translation
... an unsupervised phrase-based statistical machine translation system (Lample et ...extract translation candidates by taking the 100 nearest-neighbors of each source phrase, and score them with ... See full document
6
The LMU Munich Unsupervised Machine Translation System for WMT19
... and unsupervised NMT system were trained with German monolingual data which was not compound ...original unsupervised NMT system are trained with com- pound split German monolingual ... See full document
7
Multi Domain Neural Machine Translation through Unsupervised Adaptation
... ral Machine Translation (NMT) under the following three conditions posed by real- world application ...this unsupervised multi-domain setting, we explore an ef- ficient instance-based adaptation ... See full document
11
Phrase based Unsupervised Machine Translation with Compositional Phrase Embeddings
... to machine translation (Wu et ...neural machine translation (NMT) employs the encoder- decoder architecture, where the encoder reads the source sentence and produces its representation which ... See full document
7
Minimum Imputed Risk: Unsupervised Discriminative Training for Machine Translation
... for machine transla- tion has been well studied in the recent ...an unsupervised discriminative train- ing framework to incorporate the usually plen- tiful target-language monolingual data by us- ing a ... See full document
10
NICT’s Unsupervised Neural and Statistical Machine Translation Systems for the WMT19 News Translation Task
... using unsupervised Vecmap (Artetxe et ...backward translation systems, instead of using only either forward (Marie and Fujita, 2018b) or backward translations (Artetxe et ... See full document
8
Explicit Cross lingual Pre training for Unsupervised Machine Translation
... get unsupervised cross-lingual n-gram embeddings of them, from which we infer n-gram translation tables (source-to-target and target-to- ...n-gram translation pairs inferred in this way have proven ... See full document
10
Unsupervised Source Hierarchies for Low Resource Neural Machine Translation
... neural machine translation (NMT) has recently proven success- ful (Eriguchi et ...an unsupervised tree-to-sequence (tree2seq) model for neural machine translation; this model is able to ... See full document
7
Supervised and Unsupervised Machine Translation for Myanmar English and Khmer English
... sentence-level translation prob- abilities using the lexical translation probabil- ities learned by mgiza during the training of our SMT ...each translation direc- tion were also ...the ... See full document
8
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets
... the unsupervised CDLM training suffer from this unrelated limitation of the tuning procedure, we give it too the benefit of being able to compute risk on Set3 using y plus its 15 ... See full document
9
An Effective Approach to Unsupervised Machine Translation
... standard machine transla- tion benchmarks using monolingual corpora ...on unsupervised cross-lingual embedding mappings, which independently train word embeddings in two languages and learn a linear ... See full document
10
Unsupervised Neural Machine Translation with Future Rewarding
... Recently, motivated by the success of cross- lingual embeddings (Artetxe et al., 2016; Zhang et al., 2017; Conneau et al., 2017), several works have tried to train NMT or SMT models using un- supervised setting, in which ... See full document
10
Unsupervised Neural Machine Translation with Weight Sharing
... For model selection, we stop training when the model achieves no improvement for the tenth e- valuation on the development set, which is com- prised of 3000 source and target sentences extract- ed randomly from the ... See full document
10
Related subjects