• No results found

[PDF] Top 20 Building a multilingual parallel corpus for human users

Has 10000 "Building a multilingual parallel corpus for human users" found on our website. Below are the top 20 most common "Building a multilingual parallel corpus for human users".

Building a multilingual parallel corpus for human users

Building a multilingual parallel corpus for human users

... any parallel corpus, and especially one including many languages, requires an opportunistic approach, be- cause a balanced mix of texts is hard to achieve even for written ...primary users are humans ... See full document

6

Building The Sense-Tagged Multilingual Parallel Corpus

Building The Sense-Tagged Multilingual Parallel Corpus

... Sense-annotated parallel corpora play a crucial role in natural language ...a corpus for Asian languages using English as a pivot, which is the first such corpus for these languages (Chinese, ... See full document

7

Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

... multi-lingual parallel corpus (Arabic-Spanish-English) aligned on the sentence level and tagged on the POS ...A multilingual parallel corpus which brings together Arabic, Spanish and ... See full document

6

Building a Multilingual Named Entity-Annotated Corpus Using Annotation Projection

Building a Multilingual Named Entity-Annotated Corpus Using Annotation Projection

... Arabic-Spanish parallel cor- pus aligned at sentence level and POS ...aligned corpus. The work of Volk et al. (2010) on combining parallel treebanks and geo-tagging showed similar results to what we ... See full document

7

Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus

Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus

... Dutch Parallel Corpus (DPC) project has reached an ...finalized corpus is a ten-million-word high-quality sentence-aligned bidirectional parallel corpus of Dutch, English and French, ... See full document

6

Comparing two acquisition systems for automatically building an English—Croatian parallel corpus from multilingual websites

Comparing two acquisition systems for automatically building an English—Croatian parallel corpus from multilingual websites

... as parallel documents by the human ...of parallel documents in the whole sample is ...more parallel documents than non-parallel docu- ...resulting corpus, a similarity ... See full document

7

The AMARA Corpus: Building Parallel Language Resources for the Educational Domain

The AMARA Corpus: Building Parallel Language Resources for the Educational Domain

... AMARA corpus: a new multilingual corpus, covering a wide range of educational ...The corpus was derived from the AMARA platform, where volunteers contribute subtitles and translations for ... See full document

7

A SURVEY REPORT ON THE EXISTING METHODS OF BUILDING A PARALLEL CORPUS

A SURVEY REPORT ON THE EXISTING METHODS OF BUILDING A PARALLEL CORPUS

... English-Chinese parallel corpus that is aligned at document and sentence ...The parallel data is taken from bilingual Websites containing good quality content in the two language from multiple ... See full document

7

Building and Annotating the Linguistically Diverse NTU-MC (NTU-Multilingual Corpus)

Building and Annotating the Linguistically Diverse NTU-MC (NTU-Multilingual Corpus)

... make multilingual resources machine readable for NLP ...the corpus would be to use the parallel texts (English, Malay, Chinese and Tamil) distributed by National Environment Agency of Singapore (NEA) ... See full document

10

Challenges in Building a Multilingual Alpine Heritage Corpus

Challenges in Building a Multilingual Alpine Heritage Corpus

... a multilingual heritage corpus of alpine ...a corpus of 25 million words, 10% of which is a parallel French-German ...this corpus, and we sketch how a large gazetteer of Swiss ... See full document

7

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

... allow users to easily make use of the different sections, we at- tempted to identify these sections using regular expres- sions, and mark them up as body, signature and annex (see Figures 1 and ... See full document

6

A multilingual parallel parsed corpus as gold standard for grammatical inference evaluation

A multilingual parallel parsed corpus as gold standard for grammatical inference evaluation

... Unless indicated otherwise, fulltext items are protected by copyright with all rights reserved. The copyright exception in section 29 of the Copyright, Designs and Patents Act 1988 allows the making of a single copy ... See full document

5

Building the multilingual TUT parallel treebank

Building the multilingual TUT parallel treebank

... a parallel treebank where Italian language is ...to parallel corpora, ...the corpus on which the schema has been ...of Parallel–TUT and briefly summarize the ... See full document

10

Building an Annotated Japanese Chinese Parallel Corpus   A Part of NICT Multilingual Corpora

Building an Annotated Japanese Chinese Parallel Corpus A Part of NICT Multilingual Corpora

... A parallel corpus is a collection of articles, paragraphs, or sentences in two different ...a parallel corpus contains translation correspondences between the source text and its translations ... See full document

6

Building Parallel Monolingual Gan Chinese Dialects Corpus

Building Parallel Monolingual Gan Chinese Dialects Corpus

... dialect corpus and corresponding computational models have been released for Catalan, Russian, Slovene, ...free corpus has been released for the similar, varieties or dialects in Mandarin ...the ... See full document

6

The Multilingual Affective Soccer Corpus (MASC): Compiling a biased parallel corpus on soccer reportage in English, German and Dutch

The Multilingual Affective Soccer Corpus (MASC): Compiling a biased parallel corpus on soccer reportage in English, German and Dutch

... The reports are saved as plain text files in UTF-8 coding in separate folders according to which sub- corpus and category (WIN, LOSS, TIE) they belong to. The metadata for the three main subcorpora is split into ... See full document

5

Language Independent Bilingual Terminology Extraction from a Multilingual Parallel Corpus

Language Independent Bilingual Terminology Extraction from a Multilingual Parallel Corpus

... We presented a bilingual terminology extraction module that starts from sub-sentential alignments in parallel corpora and applied it on three differ- ent parallel corpora that are part of the same auto- ... See full document

9

Multilingual Lexicon Bootstrapping   Improving a Lexicon Induction System Using a Parallel Corpus

Multilingual Lexicon Bootstrapping Improving a Lexicon Induction System Using a Parallel Corpus

... We address the task of improving the qual- ity of lexicon bootstrapping, i.e., of ex- panding a semantic lexicon on a given cor- pus. A main problem of iterative boot- strapping techniques is the fact that lex- icon ... See full document

5

Self Supervised Neural Machine Translation

Self Supervised Neural Machine Translation

... We present a joint architecture to select data and train NMT systems simultaneously using the emerging NMT system itself to select the data. This is a form of self-supervision alternating be- tween two tasks that support ... See full document

7

EuroSense: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text

EuroSense: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text

... Extending sense annotations to multiple languages is a demanding endeavor, especially when man- ual intervention is required. Despite the fact that sense-annotated corpora for a number of languages have been around for ... See full document

7

Show all 10000 documents...