• No results found

aligned parallel

Statistical Machine Translation with Word  and Sentence Aligned Parallel Corpora

Statistical Machine Translation with Word and Sentence Aligned Parallel Corpora

... The parameters of statistical translation models are typically estimated from sentence-aligned parallel corpora. We show that significant improvements in the alignment and translation quality of such mod- ...

8

Parsing Aligned Parallel Corpus by Projecting Syntactic Relations from Annotated Source Corpus

Parsing Aligned Parallel Corpus by Projecting Syntactic Relations from Annotated Source Corpus

... approaches either use examples from the same lan- guage, e.g., (Bod et al., 2003; Streiter, 2002), or they try to imitate the parse of a given sentence using the parse of the corresponding sentence in some other language ...

8

Automatic word alignment tools to scale production of manually aligned parallel texts

Automatic word alignment tools to scale production of manually aligned parallel texts

... We only have preliminary data to report for the Arabic- English experiment. We ran GIZA++ on 164,984 Ara- bic tokens (corresponding to 201,031 English tokens) of Arabic-English parallel treebank text. The genre ...

5

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

... The main interest of the JRC in exploiting this highly multilingual parallel corpus stems from the fact that it includes the ‘new’ EU languages. For some of these, only few linguistic resources are available. In ...

6

Bilingual Word Embeddings from Parallel and Non parallel Corpora for Cross Language Text Classification

Bilingual Word Embeddings from Parallel and Non parallel Corpora for Cross Language Text Classification

... In many languages, sparse availability of re- sources causes numerous challenges for tex- tual analysis tasks. Text classification is one of such standard tasks that is hindered due to lim- ited availability of label ...

11

DOMCAT: A Bilingual Concordancer for Domain Specific Computer Assisted Translation

DOMCAT: A Bilingual Concordancer for Domain Specific Computer Assisted Translation

... Bai et al. (2009) proposed a normalized frequency criterion to extract translation equivalents form sentence aligned parallel corpus. This criterion takes lexical-level contexture effect into account, so it ...

6

Mechanisms of perception of laid lines in Japanese paper

Mechanisms of perception of laid lines in Japanese paper

... In the manufacture of Japanese paper, the device that cor- responds to a wire for dehydrating a fi ber suspension to form a web in modern papermaking is called a screen, or su, in Japanese. It consists of bamboo or ...

8

Distant annotation of Chinese tense and modality

Distant annotation of Chinese tense and modality

... cur in the past while unbounded events tend to occur in the present (e.g., “believe, know, like”), a gen- eralization first articulated in (Smith and Erbaugh, 2005). Maintaining consistency among annotators when ...

9

Evolution of Microstructure in a Cu Cr in situ Composite Produced by Thermo Mechanical Processing

Evolution of Microstructure in a Cu Cr in situ Composite Produced by Thermo Mechanical Processing

... grains parallel to the drawing axis, as presented in Figure ...fibres aligned parallel to the drawing axis, with fibre aspect ratio increasing with increasing ...

8

Bilingual Word Embeddings from Non Parallel Document Aligned Data Applied to Bilingual Lexicon Induction

Bilingual Word Embeddings from Non Parallel Document Aligned Data Applied to Bilingual Lexicon Induction

... sentence-aligned parallel data and/or readily- available translation dictionaries to induce bilin- gual word embeddings (BWEs) that are consistent and closely aligned over languages in the same se- ...

7

On the orientation of plane tensegrity cytoskeletons under biaxial substrate stretching

On the orientation of plane tensegrity cytoskeletons under biaxial substrate stretching

... always aligned parallel with the maximum stretch direction, which corresponds to the unique global minimum of the total potential energy, independently of the stretch or prestrain ...

14

Syntactic Constraints on Phrase Extraction for Phrase Based Machine Translation

Syntactic Constraints on Phrase Extraction for Phrase Based Machine Translation

... A typical phrase-based machine transla- tion (PBMT) system uses phrase pairs extracted from word-aligned parallel corpora. All phrase pairs that are consis- tent with word alignments are collected. The ...

6

Discovery of Discourse Related Language Contrasts through Alignment Discrepancies in English German Translation

Discovery of Discourse Related Language Contrasts through Alignment Discrepancies in English German Translation

... The data source in our analysis is a sentence- and word-aligned parallel corpus, the same type of resource that is typically used for training MT systems. We begin by defining a set of surface pat- terns ...

9

Automatic Creation of a Sentence Aligned Sinhala Tamil Parallel Corpus

Automatic Creation of a Sentence Aligned Sinhala Tamil Parallel Corpus

... This paper presents the first ever study on automatically creating a sentence aligned parallel corpus for Sinhala and Tamil. Sinhala and Tamil are both under-resourced languages, and research imple- menting ...

9

Handling Named Entities and Compound Verbs in Phrase Based Statistical Machine Translation

Handling Named Entities and Compound Verbs in Phrase Based Statistical Machine Translation

... and aligned as parts of con- secutive phrases, since PB-SMT (or any other approaches to SMT) does not generally treat MWEs as special ...being aligned one-to-one; the alignments of the words inside source ...

9

Building a Hierarchically Aligned Chinese English Parallel Treebank

Building a Hierarchically Aligned Chinese English Parallel Treebank

... Hierarchically Aligned Chinese-English Parallel Treebank (HACEPT) where we manually do word- level and phrase-level alignments simultaneously on parallel phrase-based parse ...Chinese-English ...

10

Using Manual and Parallel Aligned Corpora for Machine Translation Services within an On line Content Management System

Using Manual and Parallel Aligned Corpora for Machine Translation Services within an On line Content Management System

... Manually Aligned Small Corpora vs. Automatically Aligned Large Corpora We decided to make selective experiments on cor- pora involving following language pairs: English, Romanian and ...

6

Developing Parallel Sense tagged Corpora with Wordnets

Developing Parallel Sense tagged Corpora with Wordnets

... tagged parallel corpora develop ...(i) parallel cor- pora: FuSe (Cyrus, 2006), SMULTRON (Volk et ...English parallel corpus (Padó and Erk, 2010), Eu- roparl corpus (Koehn, 2005), and OPUS ...

10

Using Parallel Texts and Lexicons for Verbal Word Sense Disambiguation

Using Parallel Texts and Lexicons for Verbal Word Sense Disambiguation

... a parallel treebank and manually created bilingual valency lexicon described herein significantly outperformed the previous results, where only monolingual data and features have been ...

9

An Aligned French Chinese corpus of 10K segments from university educational material

An Aligned French Chinese corpus of 10K segments from university educational material

... We have presented a bilingual parallel corpus of nearly 10K aligned French-Chinese segments, pro- duced over three years in the course of the MACAU project. This corpus is released under a free li- cense ...

5

Show all 8490 documents...

Related subjects