• No results found

[PDF] Top 20 The RWTH Aachen University Filtering System for the WMT 2018 Parallel Corpus Filtering Task

Has 10000 "The RWTH Aachen University Filtering System for the WMT 2018 Parallel Corpus Filtering Task" found on our website. Below are the top 20 most common "The RWTH Aachen University Filtering System for the WMT 2018 Parallel Corpus Filtering Task".

The RWTH Aachen University Filtering System for the WMT 2018 Parallel Corpus Filtering Task

The RWTH Aachen University Filtering System for the WMT 2018 Parallel Corpus Filtering Task

... ParaCrawl corpus down to an amount that can be handled by stronger, computationally more complex, ...tered corpus. Although a big part of the corpus is removed (58M sentences or 60% of the origi- nal ... See full document

9

The RWTH Aachen University English German and German English Unsupervised Neural Machine Translation Systems for WMT 2018

The RWTH Aachen University English German and German English Unsupervised Neural Machine Translation Systems for WMT 2018

... The RWTH Aachen University has participated in the WMT 2018 German → English and English → German unsupervised news translation ... See full document

9

The RWTH Aachen University Supervised Machine Translation Systems for WMT 2018

The RWTH Aachen University Supervised Machine Translation Systems for WMT 2018

... at RWTH Aachen University for the German → English, English → Turkish and Chinese → English translation tasks of the EMNLP 2018 Third Conference on Machine Translation (WMT ...English ... See full document

8

Noisy Parallel Corpus Filtering through Projected Word Embeddings

Noisy Parallel Corpus Filtering through Projected Word Embeddings

... the WMT 2019 par- allel corpus filtering shared task is to select the 5 million words of parallel sentences producing the highest-quality machine translation system, given a set ... See full document

5

NRC Parallel Corpus Filtering System for WMT 2019

NRC Parallel Corpus Filtering System for WMT 2019

... shared task on parallel corpus filter- ing was essentially the same as last year’s edi- tion (Koehn et ...noisy corpus crawled from the web using ParaCrawl (Koehn et ...of parallel ... See full document

9

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low Resource Conditions

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low Resource Conditions

... translation system quality is computationally intractable due to the high cost of training these systems to eval- uate different weight ...high-quality parallel corpora, while low-quality sentence pairs are ... See full document

19

The RWTH Aachen University English German and German English Machine Translation System for WMT 2017

The RWTH Aachen University English German and German English Machine Translation System for WMT 2017

... German WMT 2017 evaluation ...given parallel data, back- translated synthetic data, two LSTM layers in the ...rapid corpus has been filtered to remove the most unlikely ...JTR system, which ... See full document

8

The University of Helsinki Submission to the WMT19 Parallel Corpus Filtering Task

The University of Helsinki Submission to the WMT19 Parallel Corpus Filtering Task

... the corpus filtering task organizers de- cided to pose the problem under more challeng- ing conditions by focusing on low-resource sce- narios, as opposed to previous year German– English (Koehn et ... See full document

7

NICT’s Corpus Filtering Systems for the WMT18 Parallel Corpus Filtering Task

NICT’s Corpus Filtering Systems for the WMT18 Parallel Corpus Filtering Task

... NMT has shown large gains in quality over Sta- tistical machine translation (SMT) and set several new benchmarks (Bojar et al., 2017). However, NMT is much more sensitive to domain (Wang et al., 2017) and noise ... See full document

5

UTFPR at WMT 2018: Minimalistic Supervised Corpora Filtering for Machine Translation

UTFPR at WMT 2018: Minimalistic Supervised Corpora Filtering for Machine Translation

... In this contribution, we presented the UTFPR sys- tems submitted to the WMT 2018 parallel corpus filtering task. Our supervised systems discern be- tween good and bad ... See full document

5

The RWTH Aachen Machine Translation System for WMT 2012

The RWTH Aachen Machine Translation System for WMT 2012

... We trained phrase-based translation systems for French→English and hierarchical phrase-based translation systems for English→French. Corpus statistics for the French-English parallel data are given in Table ... See full document

8

Alibaba Submission to the WMT18 Parallel Corpus Filtering Task

Alibaba Submission to the WMT18 Parallel Corpus Filtering Task

... The parallel corpus is an essential resource for machine translation and multilingual natural lan- guage ...of parallel corpus is also very important in MT system training (Koehn and ... See full document

6

An Unsupervised System for Parallel Corpus Filtering

An Unsupervised System for Parallel Corpus Filtering

... the WMT 2018 Parallel Cor- pus Filtering shared task which addresses the problem of cleaning noisy parallel ...The task of mining and cleaning parallel sen- tences ... See full document

6

The RWTH Aachen University Machine Translation Systems for WMT 2019

The RWTH Aachen University Machine Translation Systems for WMT 2019

... the RWTH Aachen Univer- sity’s submission to the WMT 2019 news trans- lation ...data filtering, preprocessing and synthetic data creation were ...De→En system performs on par with our ... See full document

7

STACC, OOV Density and N gram Saturation: Vicomtech’s Participation in the WMT 2018 Shared Task on Parallel Corpus Filtering

STACC, OOV Density and N gram Saturation: Vicomtech’s Participation in the WMT 2018 Shared Task on Parallel Corpus Filtering

... The task of cleaning noisy data from parallel corpora has been tackled by various researchers over the ...of corpus creation from web data, to filter dubious sen- tence ...tered corpus. In ... See full document

7

Learning Bilingual Sentence Embeddings via Autoencoding and Computing Similarities with a Multilayer Perceptron

Learning Bilingual Sentence Embeddings via Autoencoding and Computing Similarities with a Multilayer Perceptron

... the WMT 2018 paral- lel corpus filtering task (Koehn et ...al., 2018). Data The task is to score each line of a very noisy, web-crawled corpus of 104M ... See full document

11

The ILSP/ARC submission to the WMT 2018 Parallel Corpus Filtering Shared Task

The ILSP/ARC submission to the WMT 2018 Parallel Corpus Filtering Shared Task

... NMT system trained over the 100M ...SMT system shows an increase of 15.2%, while the NMT system shows a huge in- crease of ...NMT system trained on the 10M corpus is lower than that of ... See full document

6

Findings of the WMT 2018 Shared Task on Parallel Corpus Filtering

Findings of the WMT 2018 Shared Task on Parallel Corpus Filtering

... translation system quality is computationally intractable due to the high cost of training these systems to eval- uate different weight ...quality parallel corpora, while bad sentence pairs are either ... See full document

14

MAJE Submission to the WMT2018 Shared Task on Parallel Corpus Filtering

MAJE Submission to the WMT2018 Shared Task on Parallel Corpus Filtering

... shared task have to submit a file with quality scores, one per line, corresponding to the sentence pairs on the 1 billion word German- English Paracrawl ...al., 2018) MT systems with these corpora, and ... See full document

5

Prompsit’s submission to WMT 2018 Parallel Corpus Filtering shared task

Prompsit’s submission to WMT 2018 Parallel Corpus Filtering shared task

... the WMT 2018 paral- lel corpus filtering shared ...crafted filtering rules and an automatic classifier that selects those sentences that are mutual trans- ...performing system ... See full document

8

Show all 10000 documents...