• No results found

[PDF] Top 20 Unsupervised Bilingual Segmentation using MDL for Machine Translation

Has 10000 "Unsupervised Bilingual Segmentation using MDL for Machine Translation" found on our website. Below are the top 20 most common "Unsupervised Bilingual Segmentation using MDL for Machine Translation".

Unsupervised Bilingual Segmentation using MDL for Machine Translation

Unsupervised Bilingual Segmentation using MDL for Machine Translation

... method using MDL and improve the performance of monolingual ...cal segmentation using ...exploiting MDL to achieve monolingual seg- mentation, and indicate that MDL-based method ... See full document

8

Unsupervised Search for the Optimal Segmentation for Statistical Machine Translation

Unsupervised Search for the Optimal Segmentation for Statistical Machine Translation

... with bilingual cost (Morfessor-bi) against the word-based ...post- segmentation token alignments and the Moses toolkit (Koehn et ...phrase-based translation model genera- tion and ... See full document

6

An improved MDL based compression algorithm for unsupervised word segmentation

An improved MDL based compression algorithm for unsupervised word segmentation

... and MDL methods, though it still lags 7 percentage points behind the best result achieved by adap- tors grammar with ...test machine, it took roughly 15 hours for one instance of adaptors grammar with ... See full document

5

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

... novel segmentation approach for Phrase-Based Statistical Machine Translation (PB-SMT) to languages where word boundaries are not obviously marked by using both monolingual and bilingual ... See full document

8

Unsupervised Word Segmentation Improves Dialectal Arabic to English Machine Translation

Unsupervised Word Segmentation Improves Dialectal Arabic to English Machine Translation

... Collecting resources for dialectal Arabic: Several researchers have directed efforts to de- velop DA computational resources (Maamouri et al., 2006; Al-Sabbagh and Girju, 2010; Zaidan and Callison-Burch, 2011; Salama et ... See full document

10

Fully Unsupervised Word Segmentation with BVE and MDL

Fully Unsupervised Word Segmentation with BVE and MDL

... The algorithms described in Section 3 are all rela- tively recent algorithms based on entropy. Many al- gorithms for computational morphology make use of concepts similar to branching entropy, such as successor count. ... See full document

6

Unsupervised Paraphrasing without Translation

Unsupervised Paraphrasing without Translation

... on Machine Transla- tion (MT) have proven popular due to the scarcity of labeled paraphrase pairs (Callison-Burch, 2007; Mallinson et ...Conceptually, translation is ap- pealing since it abstracts semantic ... See full document

7

Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets

Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets

... target translation of a source-language ...the bilingual Hiero grammar are both SCFGs, the confusion sets are isomorphic with translation hypergraphs that are used by supervised discriminative train- ... See full document

9

Unsupervised Adaptation for Statistical Machine Translation

Unsupervised Adaptation for Statistical Machine Translation

... by using the hypotheses also for TM ...the bilingual data, and then use the target side of the filtered bilingual data to perform LM ...by using both the in-domain source- language corpus and ... See full document

9

Linguistically Motivated Unsupervised Segmentation for Machine Translation

Linguistically Motivated Unsupervised Segmentation for Machine Translation

... Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, and Evan ... See full document

5

Bilingual Lexicon Induction through Unsupervised Machine Translation

Bilingual Lexicon Induction through Unsupervised Machine Translation

... We believe that, beyond the substantial gains in this particular task, our work has important implications for future research in cross-lingual word embedding mappings. While most work in this topic uses BLI as the only ... See full document

6

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

... word segmentation (UWS) can provide domain-adaptive segmenta- tion for statistical machine translation (SMT) without annotated data, and bilin- gual UWS can even optimize segmenta- tion for ...their ... See full document

7

Unsupervised Bilingual Word Embedding Agreement for Unsupervised Neural Machine Translation

Unsupervised Bilingual Word Embedding Agreement for Unsupervised Neural Machine Translation

... Recently, several UBWE methods (Conneau et al., 2018; Artetxe et al., 2018a) have been applied to UNMT (Artetxe et al., 2018c; Lample et al., 2018a). These rely solely on monolingual corpora in each language via UBWE ... See full document

11

Unsupervised Tokenization for Machine Translation

Unsupervised Tokenization for Machine Translation

... Tokenizing a parallel corpus is usually the first step of training a statistical machine translation system. With languages such as Chinese, which has no spaces in its writing system, the main chal- lenge ... See full document

9

Bilingual Sense Similarity for Statistical Machine Translation

Bilingual Sense Similarity for Statistical Machine Translation

... evaluated bilingual sense similarity algorithms applied to a hierarchical phrase-based system, this method is also suitable for syntax-based MT systems and phrase-based MT ...the bilingual word alignment or ... See full document

10

Passive and Pervasive Use of Bilingual Dictionary in Statistical Machine Translation

Passive and Pervasive Use of Bilingual Dictionary in Statistical Machine Translation

... a bilingual dictio- nary is to hijack the decoding process and force word/phrase translations as per the dictionary en- ...Assisted Translation (CAT) environment to translate documents in the technical ... See full document

5

Building a Better Bitext for Structurally Different Languages through Self training

Building a Better Bitext for Structurally Different Languages through Self training

... Korean-English machine translation do not im- prove when using the newly produced aligned cor- ...loose translation of them or even involve substantial ...of translation by a ratio ... See full document

10

Wider Context by Using Bilingual Language Models in Machine Translation

Wider Context by Using Bilingual Language Models in Machine Translation

... for Machine Translation of European Languages, it could be shown that the translation performance of SMT systems can be increased by integrating a bilingual lan- guage model into a ... See full document

9

Fluency enhancement : applications to machine translation : thesis for Master of Engineering in Information & Telecommunications Engineering, Massey University, Palmerston North, New Zealand

Fluency enhancement : applications to machine translation : thesis for Master of Engineering in Information & Telecommunications Engineering, Massey University, Palmerston North, New Zealand

... in using the target language, then there is not a lot I can do with the translation result since I am incapable of doing any adequate post ...choose translation direction, they should also be able to ... See full document

181

An Effective Approach to Unsupervised Machine Translation

An Effective Approach to Unsupervised Machine Translation

... standard machine transla- tion benchmarks using monolingual corpora ...on unsupervised cross-lingual embedding mappings, which independently train word embeddings in two languages and learn a linear ... See full document

10

Show all 10000 documents...