[PDF] Top 20 Data Cleaning for Word Alignment
Has 10000 "Data Cleaning for Word Alignment" found on our website. Below are the top 20 most common "Data Cleaning for Word Alignment".
Data Cleaning for Word Alignment
... its word alignment. Secondly, the aforementioned phrase alignment (Marcu and Wong, 02) considers the n : m map- ping directly bilingually generated by some con- cepts without word ... See full document
9
Word Alignment Based Parallel Corpora Evaluation and Cleaning Using Machine Learning Techniques
... Corpus cleaning in practice has often been limited to applying a set of handwritten rules (regular expressions) to detect blatantly obvious cases where two sentences are not parallel (Rueppel et ...corpora ... See full document
8
Word Alignment with Synonym Regularization
... 100k data sets in ...the word alignment model, the synonym information incorporated in the synonym pair model is used directly for train- ing word alignment ...the word ... See full document
5
Hierarchical Search for Word Alignment
... Automatic word alignment is generally accepted as a first step in training any statistical machine translation ...Generative alignment models like IBM Model-4 (Brown et ...training data to ... See full document
10
Alignment Model Adaptation for Domain Specific Word Alignment
... Using these parameters, we build two adaptation models and a translation dictionary on the training data, which are applied to the testing set. The evaluation results on our testing set are shown in Table 1. From ... See full document
8
Are ACT’s Scores Increasing with Better Translation Quality?
... training data and validated by ...the alignment information is irrelevant (not equal to a connective), it then com- pares the word position (word index) of the source connective ... See full document
6
Confidence Measure for Word Alignment
... training data are bilin- gual sentence pairs with word alignment, from which we obtained phrase translation ...MaxEnt word alignment as well as the alignment with ... See full document
9
To Wash Your Body, or Purify Your Soul: Physical Cleansing Would Strengthen the Sense of High Moral Character
... the data set. The percentage of the data categorize as out- liers do not exceed ...The data from a participant is not included in the analysis because the participant shows high error rates (21%) in ... See full document
6
Improving Word Alignment using Word Similarity
... for word similarity probabilities, but an automatic method would be ...the word similarity model, which can automatically be trained from monolingual data, and then consider a more practical variant, ... See full document
6
On Complex Word Alignment Configurations
... annotating word alignments is a non-trivial task, dif- ferent sets of alignment guidelines have been ...en-de data is aligned following the style guide of the Blinker Project, specified in Melamed ... See full document
8
Improving Word Alignment of Rare Words with Word Embeddings
... Bilingual word embedding models like (Zou et ...of data is required, which is not available in low-resource language ...monolingual data is more ... See full document
7
Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation
... During learning, we run Model 1 without decipher- ment for 5 iterations. Then we perform joint word alignment and decipherment for another 5 iterations with Model 1 and 5 iterations with HMM. We tune ... See full document
9
Discriminative Word Alignment with a Function Word Reordering Model
... In this paper, we introduce a new approach to im- proving the modeling of reordering in alignment. In- stead of relying on monolingual parses, we condi- tion our reordering model on the behavior of func- tion ... See full document
11
Word Alignment Combination over Multiple Word Segmentation
... training data on all ...combined alignment (C+P+I, and then projected onto C, P, I ...of word alignment on all segmenta- tions by our proposed word alignment ... See full document
5
Word Order Typology through Multilingual Word Alignment
... of word order using multilingual word alignment and high-precision annotation transfer in a corpus with 1144 translations in 986 lan- guages of the New ...ent word order features. Beyond ... See full document
7
Multi Word Expression Sensitive Word Alignment
... Table 4 shows the results where ‘baseline’ in- dicates no BMWE grouping nor prior, and ‘base- line2’ represents a BMWE grouping but without the prior. Although ‘baseline2’ (BMWE group- ing) shows a drop in performance in ... See full document
9
Large scale Word Alignment Using Soft Dependency Cohesion Constraints
... the word alignment quality of generative models is still far from satisfactory for SMT ...discriminative alignment models incorporating linguistically motivated features have become increasingly ... See full document
10
Improving Word Alignment Using Linguistic Code Switching Data
... LCS data, or how to predict when an utterance will switch to another language (Chan et ...single word which uses only the English alphabet; approaches based only on the character set cannot tell these words ... See full document
9
Joint Prediction of Word Alignment with Alignment Types
... The alignment problem is viewed as a search problem over a log-linear space with features (sub- models) coming from the IBM Model ...semi-supervised word alignment technique that inte- grates ... See full document
14
Word-Transliteration Alignment
... to data sparseness (even if we use a longer list of names for training, the problem still ...the word “Michel- angelo” and the transliteration “ ” in Example (11): ... See full document
16
Related subjects