[PDF] Top 20 Sentence Alignment for Monolingual Comparable Corpora
Has 10000 "Sentence Alignment for Monolingual Comparable Corpora" found on our website. Below are the top 20 most common "Sentence Alignment for Monolingual Comparable Corpora".
Sentence Alignment for Monolingual Comparable Corpora
... Impact of Cluster Quality. Our method uses clustering to identify the different topics of each collection. It is important to know how sensitive our overall algorithm is to the quality of the iden- tified clusters. ... See full document
8
Identification of Parallel Sentences in Comparable Monolingual Corpora from Different Registers
... In relation with monolingual comparable cor- pora, the main difficulty is that sentences may show low lexical overlap but be nevertheless paral- lel. Recently, this task gained in popularity thanks to the ... See full document
11
Similarity Based Alignment of Monolingual Corpora for Text Simplification Purposes
... of monolingual text alignment is to find similar text fragments, it forms an important subtask of applications such as text reuse ...measuring sentence similarity by a modified version of a TF-IDF ... See full document
10
Bootstrapping Translation Detection and Sentence Extraction from Comparable Corpora
... mine comparable corpora without any prior trans- lation information or parallel ...use monolingual corpora to es- timate phrase-based SMT ... See full document
6
Unsupervised Extraction of Partial Translations for Neural Machine Translation
... extracting sentence pairs from monolingual ...in comparable corpora for instance, and usually for one specific domain, to efficiently extract accurate sentence pairs (Abdul Rauf and ... See full document
11
Paraphrase Fragment Extraction from Monolingual Comparable Corpora
... parallel corpora for training and then extract sub-sentential translation ...parallel corpora, comparable corpora are non-parallel bilingual cor- pora whose documents convey the similar ... See full document
9
Parallel Sentence Retrieval From Comparable Corpora for Biomedical Text Simplification
... their alignment rate. The alignment rate for a given corpus is the number of sentences that are part of an aligned pair relative to the total number of ...highest alignment rate of sentences, while ... See full document
10
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora
... words, sentence lengths (with lengths difference and lengths ratio), ...source sentence into the target language and then apply MT assessment measures such as WER (the Levenshtein distance), TER (Snover et ... See full document
8
Feature Based Method for Document Alignment in Comparable News Corpora
... extract comparable bilingual text without us- ing any linguistic ...two monolingual features 25 , a term frequency normalization (Stephan et ...across corpora which are linguistically very ...gual ... See full document
9
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora
... ering comparable corpora rather than parallel ...document alignment and then sen- tence ...take sentence po- sitions into account and mainly uses the number of trans- lated words in ... See full document
7
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
... Comparable corpora are a set of monolingual corpora that describe roughly the same topic in different ...a sentence pair, they could be helpful for this ... See full document
5
Improving Statistical Machine Translation with Monolingual Collocation
... Collocation is generally defined as a group of words that occur together more often than by chance (McKeown and Radev, 2000). A colloca- tion is composed of two words occurring as ei- ther a consecutive word sequence or ... See full document
9
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
... these monolingual corpora. Splitting the result- ing corpora after insertion was therefore liable to separate a large proportion of sentence ...the corpora before parallel ... See full document
6
Chinese–Japanese Parallel Sentence Extraction from Quasi–Comparable Corpora
... not sentence–level ...of sentence pairs extracted by “+Rank (Proposed)”, where parallel subsenten- tial fragments are in ...the alignment results of the extracted ... See full document
9
ACCURAT Toolkit for Multi Level Alignment and Information Extraction from Comparable Corpora
... parallel corpora and linguistic resources for many languages and domains is one of the major obstacles for the further advancement of automated ...exploit comparable corpora (non-parallel bi- or ... See full document
6
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora
... before the actual task. Some of the systems use semantic features (e.g., logical inference, Semantic Role Labelling) for solving the text and hypothesis entailment problem. MacCartney et al. (2006) proposed a new ... See full document
10
zNLP: Identifying Parallel Sentences in Chinese English Comparable Corpora
... Previous work (Smith et al., 2010; Munteanu and Marcu, 2005) on parallel sentence extrac- tion from comparable corpora has used external clues for this purpose. (Smith et al., 2010) boot- strapped ... See full document
5
Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora
... within comparable corpora (Utiyama and Isahara, 2003; Munteanu et ...promising sentence pairs be- fore examining them more ...two monolingual corpora in which pairs of translated ... See full document
8
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
... document alignment in news and web corpora has been explored by a number of researchers, includ- ing Resnik and Smith (2003), Munteanu and Marcu (2005), Tillmann and Xu (2009), and Tillmann ... See full document
9
Bilingual Word Embeddings with Bucketed CNN for Parallel Sentence Extraction
... the sentence pairs to be classified into dif- ferent groups and train CNN’s ...solves sentence alignment problem but our model can be regarded as a generic bag-of-words sim- ilarity measure for ... See full document
6
Related subjects