[PDF] Top 20 Parallel and comparable corpora: What are they up to?
Has 10000 "Parallel and comparable corpora: What are they up to?" found on our website. Below are the top 20 most common "Parallel and comparable corpora: What are they up to?".
Parallel and comparable corpora: What are they up to?
... English-Chinese parallel corpus of healthcare, found that the ratio of overt/covert marking of aspectual meanings was exceptionally low in Chinese ...a comparable L1 Chinese corpus using the same sampling ... See full document
13
Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora
... There has been a growing interest in approaches focused on extracting word translations from comparable corpora (Fung and McKeown, 1997; Fung and Yee, 1998; Rapp, 1999; Chiao and Zweigenbaum, 2002; Dejean ... See full document
8
Phrase based Parallel Fragments Extraction from Comparable Corpora
... of parallel cor- pora confirms that, the quality is more important than quantity in collecting training ...the parallel fragments extract- ed by PESA, our method get better translation re- sults in both ... See full document
5
Extracting Parallel Fragments from Comparable Corpora for Data to text Generation
... of parallel training data, has led to a sizeable research effort to develop methods for automatically constructing parallel ...in comparable corpora, ... See full document
5
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
... in comparable corpora, if performed by humans, can be extremely time consuming: exhaustively spotting such pairs in, say, two corpora of 400,000 sentences each may require the exam- ination of 160 ... See full document
6
Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction
... lel corpora called bitexts for training the transla- tion model and monolingual data to build the tar- get language ...Unfortunately, parallel texts are a limited resource and they are often not avail- able ... See full document
7
Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi Comparable Corpora
... We found that sentence pairs with high alignment scores are not necessarily more similar than others. This might be due to the fact that EM estimation at each intermediate step is not reliable, since we only have a small ... See full document
12
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora
... existing parallel data (to be described in subsection ...of parallel and, especially, quasi-parallel sentences than the binary classification approach of Munteanu & Marcu (2005) because of the ... See full document
8
Parallel Sentence Retrieval From Comparable Corpora for Biomedical Text Simplification
... searching parallel sen- tences in monolingual comparable corpora indi- cates that the main difficulty is that such sentences may show low lexical overlap but be neverthe- less ...language ... See full document
10
Chinese–Japanese Parallel Sentence Extraction from Quasi–Comparable Corpora
... as parallel sentences rarely exist in quasi–comparable corpora, we plan to extend our system to parallel subsentential frag- ment ...Japanese parallel sentence ... See full document
9
Extracting an English Persian Parallel Corpus from Comparable Corpora
... system, parallel corpora are of high importance. These parallel resources which have been aligned on the sentence level in two languages (source and target), are used in the training phase of the SMT ... See full document
6
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
... Comparable corpora are a set of monolingual corpora that describe roughly the same topic in different ...of parallel sentences con- tained in the comparable ...of parallel sen- ... See full document
5
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora
... Building parallel resources for corpus based machine translation, especially Statistical Machine Translation (SMT), from comparable corpora has recently received wide attention in the field Machine ... See full document
10
Multext East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
... Multext East Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages Multext East Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European[.] ... See full document
5
Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora
... bilingual corpora. These corpora, known as comparable cor- pora, are comprised of texts sharing common fea- tures such as domain, genre, register, sampling pe- riod, ...of comparable ... See full document
8
zNLP: Identifying Parallel Sentences in Chinese English Comparable Corpora
... domains. Comparable corpora are sets of texts in two or more languages that are selected according to similar specifications, but are not translations of each other (Sharoff et ...Nevertheless, ... See full document
5
Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora
... on parallel sentence extrac- tion from comparable ...of parallel sentence pairs not yet in the pro- vided gold ...the parallel sentence spotting ... See full document
8
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
... a comparable corpus, and describe novel methods for parallel sentence ex- ...for parallel sentence extraction on comparable corpora, and de- scribes our approach, which finds a global ... See full document
9
Improved Machine Translation Performance via Parallel Sentence Extraction from Comparable Corpora
... [r] ... See full document
8
Multext East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
... The Multext-East Copernicus projec0 Erjavec, et al., 1997 was a spin-off of the LRE project Multext2 Ide and Vtronis, 1994 intended to fill these gaps by developing significant resources[r] ... See full document
5
Related subjects