[PDF] Top 20 Automated Alignment in Multilingual Corpora
Has 10000 "Automated Alignment in Multilingual Corpora" found on our website. Below are the top 20 most common "Automated Alignment in Multilingual Corpora".
Nominal Expressions in Multilingual Corpora: Definites and Demonstratives
... In order to avoid disagreement related to the use of the annotation tool, we would also like to propose improvements to the MMAX interface, especially by analyzing the user feedback we collected from the annotators. ... See full document
8
Language comparison through sparse multilingual word alignment
... Parallel corpora have received a lot of attention since the advent of statistical machine translation (Brown et ...text corpora which contain texts in three or more languages (OPUS 9 , Europarl 10 , ... See full document
9
CATS: A Tool for Customized Alignment of Text Simplification Corpora
... parallel corpora consisting of original sentences and their manually simplified counterparts are very scarce and small in size, which impedes building supervised automated TS systems with sufficient ... See full document
9
Unit 5 Multilingual corpora pdf
... comparable corpora are supposed to be useful for different purposes ...parallel corpora is ...automatic alignment of parallel corpora is not a trivial task for some language pairs ... See full document
5
Word Order Typology through Multilingual Word Alignment
... (McDonald et al., 2013) version 2 contains sub- corpora in French, Italian, Spanish and Brazilian Portuguese. In all of these, noun-adjective or- der is dominant, which casts further doubts on our result. The key ... See full document
7
Parallel Global Voices: a Collection of Multilingual Corpora with Citizen Media Stories
... Most importantly for generating the parallel resources, at the exporting stage, we parsed the links of each document and stored information about its translation counterparts, thus creating a set of document pairs. We ... See full document
6
Using Parallel Corpora to enrich Multilingual Lexical Resources
... The alignment between pairs of translated documents is used to map words from each language into a single bilingual vector space, a technique first developed for CLIR (Dumais et ... See full document
6
Sampling-based Multilingual Alignment
... 3.1 Introducing alingual corpora As stated previously, one of the main advantage of the method is that it can align any number of lan- guages simultaneously. Fig. 1 shows examples in three languages. More ... See full document
5
Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
... While generative models are often robust, they tend to make independence assumptions that do not hold in data. The discriminative learning framework argued for in (Roth, 1998; Roth, 1999) as an alter- native to ... See full document
7
Uzbek English and Turkish English Morpheme Alignment Corpora
... In MT modelling, our linguistic-rule-based alignment annotation may not always be the ideal approach for all MT models. To satisfy different MT model preferences, users can modify current annotation data by ... See full document
6
GlobalTrait: Personality Alignment of Multilingual Word Embeddings
... the multilingual setting, which makes it essential to use the relatively larger size of data in English to help rec- ognize traits in other ...a multilingual model (Liu, Perez, and Nowson 2016; Siddique and ... See full document
8
Semantic Indexing of Multilingual Corpora and its Application on the History Domain
... One of the most straightforward applications from the semantic indexing of corpora is cross-lingual text retrieval. The task of cross-lingual text retrieval consists of, given an input query, retrieving the texts ... See full document
8
Building Open Javanese and Sundanese Corpora for Multilingual Text to Speech
... resulting corpora under a liberal license on OpenSLR 1 . We collaborated with two universities in Indonesia to con- duct data collections locally. For Javanese we worked with Universitas Gadjah Mada (UGM), Faculty ... See full document
5
Sentiment Analysis on Monolingual, Multilingual and Code Switching Twitter Corpora
... A. Balahur, M. Turchi, R. Steinberger, J. M. Perea- Ortega, G. Jacquet, D. Kucuk, V. Zavarella, and A. E. Ghali. 2014. Resource Creation and Evalu- ation for Multilingual Sentiment Analysis in Social Media Texts. ... See full document
7
Learning Common Grammar from Multilingual Corpus
... We assume hidden commonalities in syntax across languages, and try to extract a common grammar from non-parallel multilingual corpora. For this purpose, we propose a generative model for multilingual ... See full document
5
Improved Unsupervised Sentence Alignment for Symmetrical and Asymmetrical Parallel Corpora
... model-optimal alignment composed of the smallest possible correspondences, namely 1-to- 0/0-to-1 and 1-to-1, and then merges those cor- respondences into larger ...on corpora containing large blocs of ... See full document
9
Latent Domain Word Alignment for Heterogeneous Corpora
... In the following experiments, we use three hetero- geneous English-Spanish corpora consisting of 1M , 2M and 4M sentence pairs respectively. These cor- pora combine two parts. The first part respectively 0.7M, ... See full document
11
MULTEXT: Multilingual Text Tools and Corpora
... MULTEXT Multilingual Text Tools and Corpora M U L T E X T M u l t i l i n g u a l T e x t T o o l s and Corpora N a n c y I d e a n d J e a n V 6 r o n i s LABORATOIRE PAROLE ET L A N G A G E CNRS & U[.] ... See full document
5
Related subjects