[PDF] Top 20 A Portable Method for Parallel and Comparable Document Alignment
Has 10000 "A Portable Method for Parallel and Comparable Document Alignment" found on our website. Below are the top 20 most common "A Portable Method for Parallel and Comparable Document Alignment".
A Portable Method for Parallel and Comparable Document Alignment
... strictly parallel document alignment, simple approaches based on file name matching can be the most efficient methods, as they do not rely on any analysis of the content of ...filename-based ... See full document
13
STACC, OOV Density and N gram Saturation: Vicomtech’s Participation in the WMT 2018 Shared Task on Parallel Corpus Filtering
... on parallel corpus ...and portable method for parallel sentence identification in comparable ...core method was expanded with a penalty based on the amount of unknown words in ... See full document
7
Accurate Parallel Fragment Extraction from Quasi–Comparable Corpora using Alignment Model and Translation Lexicon
... their method extacts sub- sentential fragments which are quite ...proposed method have a accuracy over 80%, while the remainder are partial ...the comparable sentences for alignment are bet- ... See full document
7
DOCAL Vicomtech’s Participation in the WMT16 Shared Task on Bilingual Document Alignment
... The system was developed to seek an opti- mal balance between precision and recall, and has shown promising results along these lines in different scenarios involving both parallel and comparable corpora ... See full document
6
Unsupervised Alignment of Comparable Data and Text Resources
... text alignment, ...text alignment at the document level and at the sentence level, reporting results for sev- eral methodological variants as well as base- ... See full document
8
Weighted Set Theoretic Alignment of Comparable Sentences
... as comparable corpora, has been extensively explored in the last two decades (Munteanu and Marcu, 2005; Sharoff et ...from comparable data is the align- ment of sentences in monolingual ... See full document
5
Sentence Alignment for Monolingual Comparable Corpora
... sentence alignment. Our method em- phasizes the search for an overall alignment, while relying on a simple local similarity ...cal alignment within mapping fragments to find sen- tence ...the ... See full document
8
Identifying Comparable Corpora Using LDA
... Manual alignment or creation of parallel corpora is exceedingly expensive, requiring highly skilled an- notators or professional ...aligning parallel corpora, and extracted parallel segments ... See full document
5
Bootstrapping Entity Translation on Weakly Comparable Corpora
... not parallel but comparable corpora, with asymmetry of entities and relationship as the asymmetry in the number of documents also ...leverages comparable parts from the corpora without ... See full document
10
Extracting Parallel Phrases from Comparable Data
... two comparable documents have few or no parallel sentence pairs, there could still be paral- lel sub-sentential fragments, including word transla- tion pairs, named entities, and long phrase ...from ... See full document
8
Topic Models + Word Alignment = A Flexible Framework for Extracting Bilingual Dictionary from Comparable Corpus
... a comparable document-aligned corpus into a parallel topic-aligned cor- pus, then learning word alignments us- ing co-occurrence ...word alignment re- ...of comparable data condi- ... See full document
10
Identification of Comparable Argument-Head Relations in Parallel Corpora
... word alignment) might be used to identify configurations which give rise to inconsistent an- ...bootstrapping method which combines co-training with ideas from active ... See full document
7
Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction
... for parallel data at the sentence level (Zhao and Vogel, 2002; Utiyama and Isa- hara, 2003; Munteanu and Marcu, 2005; Abdul- Rauf and Schwenk, ...noisy parallel texts, to quasi parallel texts (Fung ... See full document
7
Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora
... through parallel sentences included in the comparable ...our method on a French/English comparable corpus within the sub-domain of breast cancer in the medical ...the comparable corpus ... See full document
8
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
... a comparable corpus, and describe novel methods for parallel sentence ex- ...for parallel sentence extraction on comparable corpora, and de- scribes our approach, which finds a global sentence ... See full document
9
Feature Based Method for Document Alignment in Comparable News Corpora
... any document would imply fewer possible document alignment pairs for the ...each document, we use the term extraction model from Vu et ...per document are 556/37, 410/28 and 384/28 for ... See full document
9
A Generative Model for Extracting Parallel Fragments from Comparable Documents
... Although parallel corpora are essential language resources for many NLP tasks, they are rare or even not available for many language ...tain parallel fragments of information that can be used applications ... See full document
9
Model Invertibility Regularization: Sequence Alignment With or Without Parallel Data
... word alignment (Section 5), restricted to the HMM model, MIR attains F- and Bleu score improvements that are comparable to those of ABA and ...each method by ... See full document
10
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora
... Building parallel resources for corpus based machine translation, especially Statistical Machine Translation (SMT), from comparable corpora has recently received wide attention in the field Machine ... See full document
10
Set Theoretic Alignment for Comparable Corpora
... estimate document comparability by computing the coefficient on a subset of translated source sentences, discarding those containing large amounts of named enti- ties or numbers, and taking the average of these ... See full document
10
Related subjects