[PDF] Top 20 Automatic Parallel Fragment Extraction from Noisy Data
Has 10000 "Automatic Parallel Fragment Extraction from Noisy Data" found on our website. Below are the top 20 most common "Automatic Parallel Fragment Extraction from Noisy Data".
Automatic Parallel Fragment Extraction from Noisy Data
... In this work we are concerned with finding large phrases, 3 since very small phrases tend to be ex- tractible even when data is noisy. Bad alignments tend to cause conflicts when extracting large phrases ... See full document
5
Automatic Extraction of Synonyms for German Particle Verbs from Parallel Data with Distributional Similarity as a Re-Ranking Feature
... the extraction of synonyms for German particle verbs based on a word-aligned German-English parallel corpus: by translating the particle verb to a pivot, which is then translated back, a set of synonym ... See full document
8
Accurate Parallel Fragment Extraction from Quasi–Comparable Corpora using Alignment Model and Translation Lexicon
... for fragment extrac- ...external parallel data for alignment ex- tracts more fragments than only using the com- parable sentences, and the average size is slightly ...allel data is helpful to ... See full document
7
Investigations on Translation Model Adaptation Using Monolingual Data
... monolingual data in the source language. These automatic translations are filtered using the sentence-length normalized log score of Moses, ...The automatic translations were added to the ... See full document
10
Automatic Bilingual Phrase Extraction from Comparable Corpora
... extract parallel phrases from compa- rable corpora using a ...The data used to train the classifier is automatically derived from parallel ...automatically from parallel ... See full document
10
Analysis of an Automatic Text Content Extraction Approach in Noisy Video Images
... The wavelet transform has become a useful computational tool for noise reduction in signal. For many signals, the low- frequency content is the most important part. It is what gives the signal its identity. On the other ... See full document
8
Crowdsourcing High Quality Parallel Data Extraction from Twitter
... selected data that are ...of parallel sen- tences, we observe that using the crowdsourced corpus yields better scores than the automatically extracted corpora, comparable to experts annota- ...Twitter ... See full document
11
Improving Neural Machine Translation Using Noisy Parallel Data through Distillation
... gual data in the source and target languages from multilingual news portals such as Agence France- Presse (AFP), BBC news, Euronews ...by automatic document and sen- tence alignment techniques ... See full document
10
A Bayesian framework for extracting human gait using strong prior knowledge
... people from monocular video sequences in complex, real- world environments is an important and difficult problem, going beyond simple tracking, whose sat- isfactory solution demands an appropriate balance between ... See full document
33
Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction
... for parallel data at the sentence level (Zhao and Vogel, 2002; Utiyama and Isa- hara, 2003; Munteanu and Marcu, 2005; Abdul- Rauf and Schwenk, ...considerably, from noisy parallel ... See full document
7
Automatic Extraction of Parallel Speech Corpora from Dubbed Movies
... the extraction of a speech parallel corpus based on any language pair from dubbed ...pairs from movies, we only need raw movie data, and do not require any ... See full document
5
Automatic extraction of faults and fractal analysis from remote sensing data
... Radar data are one of the sources for generating DEMs and hence are the basic input in the process of fault ...layers from the DEM with the slope and aspect ... See full document
8
SYSTRAN Participation to the WMT2018 Shared Task on Parallel Corpus Filtering
... Similar to (Legrand et al., 2016) our model ex- tracts context information from source and target sentences and then computes simple dot-products to estimate word alignments. The objective func- tion is computed ... See full document
5
Paraphrase Fragment Extraction from Monolingual Comparable Corpora
... provides a way to pay people small amounts of money to perform tasks that are simple for humans but difficult for computers. Examples of these Hu- man Intelligence Tasks (or HITs) range from label- ing images to ... See full document
9
Bootstrapping Generators from Noisy Data
... generators from indepen- dently edited data and text ...biographies from Wikipedia infoboxes, while Wiseman et ...documents from a database of basketball games where the input is always the ... See full document
12
Surface Reconstruction from Noisy and Sparse Data
... filtering noisy point clouds, specifically those con- structed from merged depth maps as obtained from a range scanner or multiple view stereo (MVS), applying techniques that have previously been ... See full document
110
Research on Runtime Environment of Spacecraft Testing System
... in parallel pattern, this paper presented the runtime environment architecture of testing system based on ...between parallel test tasks, this paper proposed a collaborative strategy between parallel ... See full document
7
Automatic Discovery of Non Compositional Compounds in Parallel Data
... The NCCs proposed for the V objective function were much more likely to be validated than those proposed for I, because the predictive value func- tion v ~ is much easier to estimate a p[r] ... See full document
12
Formality Style Transfer for Noisy, User generated Conversations: Extracting Labeled, Parallel Data from Unlabeled Corpora
... pairs from an unlabeled cor- pus by using an auxiliary ...for noisy/user-generated text, which often lack datasets of matching vocab- ulary and ...scripts from 5 TV-shows, all TV-shows together, and ... See full document
6
Named Entity Extraction from Noisy Input: Speech and OCR
... We explore the effects of word error rate from ASR and OCR, performance as a function of the amount of training data, and for speech, the effect of out-of-vocabulary errors and the loss [r] ... See full document
9
Related subjects