[PDF] Top 20 Extracting Parallel Phrases from Comparable Data
Has 10000 "Extracting Parallel Phrases from Comparable Data" found on our website. Below are the top 20 most common "Extracting Parallel Phrases from Comparable Data".
Extracting Parallel Phrases from Comparable Data
... Mining parallel data from comparable corpora is a promising approach for overcoming the data sparseness in statistical machine trans- lation and other NLP ...two comparable ... See full document
8
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
... Cross-linguistic studies, such as contrastive analysis or translation studies, rely more and more on corpus data. Most of the time, the favored corpora are bilingual, either comparable or parallel. ... See full document
6
Two Ways to Use a Noisy Parallel News Corpus for Improving Statistical Machine Translation
... (i) extracting a parallel corpus from a comparable corpus (the so-called “Noisy Corpus”) and (ii) using in-domain data to adapt a baseline SMT ...Arabic/French comparable corpus, ... See full document
8
Agreement based Learning of Parallel Lexicons and Phrases from Non Parallel Corpora
... into extracting parallel lexicons and phrases from non-parallel ...noisy data and leads to substantial improvements in phrase align- ment and machine translation ... See full document
10
Multi level Bootstrapping For Extracting Parallel Sentences From a Quasi Comparable Corpus
... mining parallel sentences from quasi-comparable bilingual texts which have very different sizes, and which include both in-topic and off-topic ...better parallel sentence extraction, better ... See full document
7
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
... of parallel sentences used in train- ...obtaining parallel sentences from non-parallel, or comparable data, such as news articles published within the same time period (Munteanu ... See full document
9
Phrase based Parallel Fragments Extraction from Comparable Corpora
... the parallel corpora only exist in particular domains for a few number of language pairs, such as international conference recordings and legal ...Since comparable cor- pora exist in large quantities with ... See full document
5
Mining Very Non Parallel Corpora: Parallel Sentence and Lexicon Extraction via Bootstrapping and E
... of extracting parallel sentences from far more disparate “very-non-parallel corpora” than previous “comparable corpora” methods, by exploiting bootstrapping on top of IBM Model 4 ... See full document
7
Extracting Parallel Sub Sentential Fragments from Non Parallel Corpora
... good-quality parallel sentence pairs can be automatically extracted from comparable corpora, and used to improve the per- formance of machine translation (MT) ...large parallel corpora for ... See full document
8
Identifying Parallel Documents from a Large Bilingual Collection of Texts: Application to Parallel Article Extraction in Wikipedia
... of parallel or closely parallel document ...of extracting parallel ...that extracting parallel sentences from a parallel corpus is something we do well, while ... See full document
9
Traduction automatique à partir de corpus comparables: extraction de phrases parallèles à partir de données comparables multimodales (Automatic Translation from Comparable corpora : extracting parallel sentences from multimodal comparable corpora) [in French]
... bilingual parallel text, also called bitext. However parallel corpora are a limited resource and are often not available for some domains or language ...for extracting parallel sentences ... See full document
8
The Use of Parallel and Comparable Data for Analysis of Abstract Anaphora in German and English
... all from the same text type, namely parliament ...noun phrases, in particular demonstrative label-noun chunks, which are likely to be abstract ...the parallel and comparable corpora described ... See full document
8
A Generative Model for Extracting Parallel Fragments from Comparable Documents
... Although parallel corpora are essential language resources for many NLP tasks, they are rare or even not available for many language ...tain parallel fragments of information that can be used applications ... See full document
9
Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction
... for parallel data at the sentence level (Zhao and Vogel, 2002; Utiyama and Isa- hara, 2003; Munteanu and Marcu, 2005; Abdul- Rauf and Schwenk, ...considerably, from noisy parallel texts, to ... See full document
7
Extracting Parallel Fragments from Comparable Corpora for Data to text Generation
... While data and texts in the three example do- mains cited above do occur naturally, two factors mean they cannot be used directly as example cor- pora or training data for building D 2 T systems: one, most ... See full document
5
Extracting an English Persian Parallel Corpus from Comparable Corpora
... of data on the Internet, statistical machine translation (SMT) has gained more ...system, parallel corpora are of high importance. These parallel resources which have been aligned on the sentence ... See full document
6
Extracting Paraphrases from a Parallel Corpus
... During the preprocessing stage, we perform sen- tence alignment. Sentences which are translations of the same source sentence contain a number of identical words, which serve as a strong clue to the matching process. ... See full document
8
Unsupervised Approach to Extracting Problem Phrases from User Reviews of Products
... problem phrases from texts are less ...extraction from user reviews of ...problem phrases to reduce classification ...services from English Twitter ... See full document
6
Extracting Recurrent Phrases and Terms from Texts Using a Purely Statistical Method
... ... See full document
6
Learning Translations of Named Entity Phrases from Parallel Corpora
... Thus if a target language word outside the can- didate translation has a high probability of associ- ating with a source language word in the selected phrase, that candidate translation [r] ... See full document
8
Related subjects