• No results found

monolingual data

Improving Translation Model by Monolingual Data

Improving Translation Model by Monolingual Data

... lel data (500 thousand to 5 million sentences) and its effect on reverse self-training ...of monolingual data was always 5 million ...test data word forms covered by the training ...parallel ...

7

Graph based Semi Supervised Learning of Translation Models from Monolingual Data

Graph based Semi Supervised Learning of Translation Models from Monolingual Data

... Statistical phrase-based translation learns translation rules from bilingual corpora, and has traditionally only used monolin- gual evidence to construct features that rescore existing translation candidates. In this ...

11

Exploiting Monolingual Data at Scale for Neural Machine Translation

Exploiting Monolingual Data at Scale for Neural Machine Translation

... of data are then merged together to get the bilingual ...clean data and 18M Paracrawl data, which are denot- ed as WMT and WMTPC respectively for ease of ...The monolingual data we use ...

10

Exploiting Source side Monolingual Data in Neural Machine Translation

Exploiting Source side Monolingual Data in Neural Machine Translation

... Dong et al. (2015) propose a multi-task learn- ing method for translating one source language into multiple target languages in NMT so that the en- coder network can be shared when dealing with sev- eral sets of ...

11

Utilizing Monolingual Data in NMT for Similar Languages: Submission to Similar Language Translation Task

Utilizing Monolingual Data in NMT for Similar Languages: Submission to Similar Language Translation Task

... utilize monolingual data in addition to parallel data be- cause the provided parallel data is very small in ...lized monolingual data of both languages and also trained an NMT ...

5

Language Modeling for Code Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training

Language Modeling for Code Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training

... Our contributions in this work are four-fold: (a) We propose a new, vocabulary-size inde- pendent evaluation scheme for LM in general, motivated by ASR. This evaluation scheme is ranking-based and also suits CS LM; (b) ...

11

Improving Neural Machine Translation Models with Monolingual Data

Improving Neural Machine Translation Models with Monolingual Data

... of monolingual target data into the source language, and treating this synthetic data as additional train- ing ...domain monolingual data, back-translated into the source language, can ...

11

Copied Monolingual Data Improves Low Resource Neural Machine Translation

Copied Monolingual Data Improves Low Resource Neural Machine Translation

... parallel data where monolingual data has the most ...additional monolingual data is used ...lingual data in low-resource settings, which makes sense because it relies on ...

9

Using Monolingual Data in Neural Machine Translation: a Systematic Study

Using Monolingual Data in Neural Machine Translation: a Systematic Study

... target data, which often abounds, is used in these two ...parallel data through back-translation - a tech- nique that fails to fully take advantage of exist- ing ...of monolingual data, as ...

12

Efficient Extraction of Pseudo Parallel Sentences from Raw Monolingual Data Using Word Embeddings

Efficient Extraction of Pseudo Parallel Sentences from Raw Monolingual Data Using Word Embeddings

... in-domain monolingual data, while the baseline method extracted only 121k sentence pairs due presumably to the use of the coverage constraint that might remove source sentences with a high OOV ...

7

Learning a Phrase based Translation Model from Monolingual Data with Application to Domain Adaptation

Learning a Phrase based Translation Model from Monolingual Data with Application to Domain Adaptation

... given data to improve the translation ...domain monolingual data, but it does not take full advantage of the in-domain source-side monolingual ...

10

Investigations on Translation Model Adaptation Using Monolingual Data

Investigations on Translation Model Adaptation Using Monolingual Data

... Unsupervised training is widely used in other ar- eas, in particular large vocabulary speech recogni- tion. The statistical models in speech recognition use a generative approach based on small units, usu- ally ...

10

Syntax is clearer on the other side   Using parallel corpus to extract monolingual data

Syntax is clearer on the other side Using parallel corpus to extract monolingual data

... This paper describes the elaboration of a training corpus containing Hungarian sentences that are labelled according to a syntactic criterion, namely the syntactic role of a very common multifunc- tional word volt ...

8

Iterative Back Translation for Neural Machine Translation

Iterative Back Translation for Neural Machine Translation

... a data-hungry approach, requiring a large amount of parallel data to reach reasonable per- formance (Koehn and Knowles, ...parallel data, ...obtain monolingual data in either the source ...

7

The FLORES Evaluation Datasets for Low Resource Machine Translation: Nepali–English and Sinhala–English

The FLORES Evaluation Datasets for Low Resource Machine Translation: Nepali–English and Sinhala–English

... lel data is available and for which relatively large amounts of monolingual data are freely ...MT. Data and code to reproduce our experiments are available at ...

14

The LMU Munich Unsupervised Machine Translation Systems

The LMU Munich Unsupervised Machine Translation Systems

... parallel data, it needs to make use of on-the-fly ...the monolingual data and this pair of back- translated sample/gold standard sample is used to train the model in a traditional ...

9

CUNI Submissions in WMT18

CUNI Submissions in WMT18

... allel data, or we use all available synthetic ...of monolingual backtranslated data and half of original parallel ...76.5% monolingual data for Estonian and ...

7

Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation

Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation

... ization. Their approach has two main drawbacks: first, during the fine-tuning phase, they still require the original monolingual data which might not be available anymore in a life-long learning scenario. ...

6

NICT’s Unsupervised Neural and Statistical Machine Translation Systems for the WMT19 News Translation Task

NICT’s Unsupervised Neural and Statistical Machine Translation Systems for the WMT19 News Translation Task

... This paper describes the unsupervised neural (NMT) and statistical machine translation (SMT) systems built for the participation of the National Institute of Information and Communications Technology (NICT) to the WMT19 ...

8

Measuring the behavioral impact of machine translation quality improvements with A/B testing

Measuring the behavioral impact of machine translation quality improvements with A/B testing

... 2,625,162 monolingual French sentences for the training. The monolingual text was parsed and cleaned in the same manner as the aligned ...and monolingual training data, tuning data, and ...

5

Show all 10000 documents...

Related subjects