• No results found

[PDF] Top 20 Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation

Has 10000 "Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation" found on our website. Below are the top 20 most common "Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation".

Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation

Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation

... the segmentation of an au- tomatic segmenter with reference to a WA cor- pus revealed a number of ...the word “bao fa” in Figure 1. Empirically we observed that this word is seg- mented as a single ... See full document

11

Can Word Segmentation be Considered Harmful for Statistical Machine Translation Tasks between Japanese and Chinese?

Can Word Segmentation be Considered Harmful for Statistical Machine Translation Tasks between Japanese and Chinese?

... Chinese. Word seg- mentation is thus normally adopted as an ini- tial step in most natural language processing tasks for these Asian ...Although word segmentation techniques have improved greatly ... See full document

10

Bilingually Motivated Domain Adapted Word Segmentation for Statistical Machine Translation

Bilingually Motivated Domain Adapted Word Segmentation for Statistical Machine Translation

... bilingual corpus with the relevant language segmented into basic writ- ing units ...of using the output from an existing statistical word aligner to obtain a set of candidate ...candidates ... See full document

9

Post Editing System For Statistical Machine Translation

Post Editing System For Statistical Machine Translation

... of translation is higher as compared to other ...their translation corresponding to high frequency ...on manually corrected corpus, as is evident from the manual evaluation and improved BLEU ... See full document

6

Using Word Embeddings for Improving Statistical Machine Translation of Phrasal Verbs

Using Word Embeddings for Improving Statistical Machine Translation of Phrasal Verbs

... vectors using neural net- works, the network is presented with a phrase cor- ...phrase corpus is similar to a word cor- pus except that some words are joined to make up ...phrases using a ... See full document

5

Clustered Word Classes for Preordering in Statistical Machine Translation

Clustered Word Classes for Preordering in Statistical Machine Translation

... Our work is based on the POS-based reorder- ing model described by Niehues and Kolss (2009), in which POS-based rules are extracted from a word aligned corpus, where the source side is part-of-speech ... See full document

7

Nonparametric Word Segmentation for Machine Translation

Nonparametric Word Segmentation for Machine Translation

... In statistical machine translation, the smallest unit is usually the word, defined as a token delimited by ...parallel corpus of source and target text, the training procedure first ... See full document

9

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

... Unsupervised word segmentation (UWS) can provide domain-adaptive segmenta- tion for statistical machine translation (SMT) without annotated data, and bilin- gual UWS can even optimize ... See full document

7

Topic Models + Word Alignment = A Flexible Framework for Extracting Bilingual Dictionary from Comparable Corpus

Topic Models + Word Alignment = A Flexible Framework for Extracting Bilingual Dictionary from Comparable Corpus

... and word alignment ...document-aligned corpus into a parallel topic-aligned cor- pus, then learning word alignments us- ing co-occurrence ...topic- aligned corpus is ... See full document

10

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

... novel segmentation approach for Phrase-Based Statistical Machine Translation (PB-SMT) to languages where word boundaries are not obviously marked by using both monolingual and ... See full document

8

Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation

Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation

... 6.2 Translation Task: Small Track IWSLT We evaluate our full model, using both monolin- gual and bilingual information, on the IWSLT ...training corpus was segmented using the unigram seg- ... See full document

8

Statistical Machine Translation with Word  and Sentence Aligned Parallel Corpora

Statistical Machine Translation with Word and Sentence Aligned Parallel Corpora

... of statistical machine translation for new language pairs and domains: a reduction in the cost of cre- ating new training data, and the development of more efficient methods for exploiting existing ... See full document

8

Improving Statistical Machine Translation with Word Class Models

Improving Statistical Machine Translation with Word Class Models

... Data sparsity is one of the major problems for statis- tical learning methods in natural language process- ing (NLP) today. Even with the huge training data sets available in some tasks, for many phenomena that need to ... See full document

5

Word reordering for Statistical Machine Translation Using Trigram Language Model

Word reordering for Statistical Machine Translation Using Trigram Language Model

... for word- reordering: Given a sentence outputted by the al- gorithm, we regard it as a permutation of the cor- rect sentence, and count the the number of in- version pairs in it, which can be seen as the dis- ... See full document

6

Improving Statistical MT through Morphological Analysis

Improving Statistical MT through Morphological Analysis

... for statistical machine translation is described by Lee ...Arabic-English translation takes as input POS-tagged English and Arabic text, where the Arabic words have been pre-segmented into ... See full document

8

HMM Word and Phrase Alignment for Statistical Machine Translation

HMM Word and Phrase Alignment for Statistical Machine Translation

... described word-to-phrase alignment mod- els capable of good quality bitext word ...Chinese-English translation and alignment they compare well to Model-4, even with large ...improves ... See full document

8

Integrating a Large, Monolingual Corpus as Translation Memory into Statistical Machine Translation

Integrating a Large, Monolingual Corpus as Translation Memory into Statistical Machine Translation

... trained with SRILM (Stolcke, 2002) on the target side of the training data. The weights of the log- linear model were optimized with MIRA (Watan- abe et al., 2007) on a held-out development set re- served for this ... See full document

8

Word Sense Disambiguation Improves Statistical Machine Translation

Word Sense Disambiguation Improves Statistical Machine Translation

... phrase-based statistical MT system and this im- provement is statistically ...into statistical MT via the intro- duction of two new features, we could explore other alternative ways of ... See full document

8

Word Sense Disambiguation vs  Statistical Machine Translation

Word Sense Disambiguation vs Statistical Machine Translation

... do word sense disambiga- tion models help statistical machine trans- lation quality? We present empirical re- sults casting doubt on this common, but unproved, ...assumption. Using a state-of- ... See full document

8

Learning to translate with products of novices: a suite of open ended challenge problems for teaching MT

Learning to translate with products of novices: a suite of open ended challenge problems for teaching MT

... apply machine learning or feature engi- neering to the task of reranking the systems, so we provided several ...weights using pairwise ranking optimization (PRO; Hopkins and May, 2011), with a perceptron as ... See full document

14

Show all 10000 documents...

Related subjects