[PDF] Top 20 Collecting and Using Comparable Corpora for Statistical Machine Translation
Has 10000 "Collecting and Using Comparable Corpora for Statistical Machine Translation" found on our website. Below are the top 20 most common "Collecting and Using Comparable Corpora for Statistical Machine Translation".
Collecting and Using Comparable Corpora for Statistical Machine Translation
... XML using a cesDOC format that can be validated against XCES standard ...bilingual comparable corpora two separate crawls are required (one per ... See full document
8
Ranking Translation Candidates Acquired from Comparable Corpora
... from comparable corpora has received the attention of a number of researchers (Fung and Cheung, 2004; Munteanu and Marcu, 2005; Munteanu and Marcu, 2006; Smith et ...by statistical machine ... See full document
9
Example Based Paraphrasing for Improved Phrase Based Statistical Machine Translation
... a translation is accumulated uniformely ev- ery time it is found associated with a source phrase in the training ...larger corpora an is- ...avoid collecting the full set of ex- amples has been shown ... See full document
11
Language and Translation Model Adaptation using Comparable Corpora
... Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to train a translation ...a statistical machine translation system on news ... See full document
10
Mining a Comparable Text Corpus for a Vietnamese French Statistical Machine Translation System
... For collecting bilingual text data for the two sets S1, S2, the Web is an ideal source as it is large, free and available (Kilgarriff and Grefen- stette, ... See full document
8
Combining Bilingual and Comparable Corpora for Low Resource Machine Translation
... Statistical machine translation (SMT) per- formance suffers when models are trained on only small amounts of parallel ...rable corpora, to improve ...by using bilingual lexicon induc- ... See full document
9
Using Comparable Corpora to Adapt a Translation Model to Domains
... Statistical machine translation (SMT) requires a large parallel corpus, which is available only for restricted language pairs and ...estimating translation pseudo-probabilities from bilingual ... See full document
7
Neural Machine Translation for Low Resource Languages using Bilingual Lexicon Induced from Comparable Corpora
... and statistical machine translation ap- proaches are highly reliant on the availability of large amounts of data and are known to perform poorly in low resource ...on machine trans- lation ... See full document
8
Scaling Phrase Based Statistical Machine Translation to Larger Corpora and Longer Phrases
... for the MLE estimation of the translation probabil- ities for a single phrase. The complexity is domi- nated by the k terms in the equation, when the num- ber of occurrences of the phrase in the corpus is high. ... See full document
8
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora
... Statistical machine translation systems are usually trained on large amounts of bilingual text and monolingual ...for statistical machine translation, where in-domain bi- lingual ... See full document
8
Statistical Machine Translation with Word and Sentence Aligned Parallel Corpora
... that using word-aligned data in estimat- ing the parameters for machine translation leads to better alignments is ...better translation quality, we used a state-of-the-art phrase-based decoder ... See full document
8
Chinese Portuguese Machine Translation: A Study on Building Parallel Corpora from Comparable Texts
... To alleviate data scarcity problem, we extracted bilingual data from Macao government websites. 4 Macao govern- ment documents, as requested by law, are written and archived in both languages. Domains contained in these ... See full document
8
Adapting Translation Models to Translationese Improves SMT
... for statistical ma- chine translation are compiled from par- allel corpora; such corpora are manually translated, but the direction of translation is usually unknown, and is ... See full document
11
Corpus based Study and Identification of Mandarin Chinese Light Verb Variations
... from comparable corpora with statistical and machine learning approaches, the authors find the five light verbs 從事 congshi, 搞 gao, 加以 jiayi, 進行 jinxing, and 做 zuo can be reliably ... See full document
10
Enriching Parallel Corpora for Statistical Machine Translation with Semantic Negation Rephrasing
... parallel corpora is vi- tal for the quality of statistical machine translation (SMT) ...these corpora are expen- sive to ...parallel corpora, such as negated sentences, ... See full document
10
Using collocations from comparable corpora to find translation equivalents
... suggested translation equivalents (usually 50 to 100 suggestions) ordered alpha- betically or by their frequency in target language ...a Machine Translation (MT) ...find translation ... See full document
6
Communications between Deaf and Hearing Children Using Statistical Machine Translation
... learns statistical translation models from bilingual ...aligned corpora, which is useful for both sentence alignment and phrase table ...Some translation models introduced by IBM scientists in ... See full document
7
Tutorial: Corpora Quality Management for MT Practices and Roles
... and Translation program study localization, translation, NLP, NLU, CAT tools, and machine ...build statistical machine translation engines for world languages, including Arabic, ... See full document
97
Improved Machine Translation Performance via Parallel Sentence Extraction from Comparable Corpora
... [r] ... See full document
8
Parallel Corpora for bi Directional Statistical Machine Translation for Seven Ethiopian Language Pairs
... The translation of natural language by machine becomes a reality, for technologically favored languages, in the late 20th century although it is dreamt since the seventieth century (Hutchins, ...based ... See full document
8
Related subjects