• No results found

[PDF] Top 20 Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation

Has 10000 "Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation" found on our website. Below are the top 20 most common "Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation".

Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation

Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation

... 2005), word segmentations are integrated into MT sys- tems during model training and ...a Bayesian semi- supervised CWS approach motivated by (Goldwa- ter et ...a word model and two ... See full document

8

Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

... In Chinese texts, words composed of single or multiple characters are not separated by spaces, unlike most western ...Therefore Chinese word segmentation is considered an important first step ... See full document

7

An Empirical Study Of Semi Supervised Chinese Word Segmentation Using Co Training

An Empirical Study Of Semi Supervised Chinese Word Segmentation Using Co Training

... the word-based ...The word-based segmenter implemented in this work is less power- ful, and it needs a good dictionary to achieve good ...the word-based model suffers a lot when the train- ing data ... See full document

10

Co regularizing character based and word based models for semi supervised Chinese word segmentation

Co regularizing character based and word based models for semi supervised Chinese word segmentation

... and word bound- aries is very hard (Jiao et ...enhance supervised CWS models, in semi- supervised ...few semi-supervised CWS models have been pro- ...a Bayesian ... See full document

6

Bilingually Motivated Domain Adapted Word Segmentation for Statistical Machine Translation

Bilingually Motivated Domain Adapted Word Segmentation for Statistical Machine Translation

... for Chinese or kana for ...existing statistical word aligner to obtain a set of candidate ...to word alignment (Melamed, 2000). We then mod- ify the segmentation of the respective ... See full document

9

A Semi Supervised Batch Mode Active Learning Strategy for Improved Statistical Machine Translation

A Semi Supervised Batch Mode Active Learning Strategy for Improved Statistical Machine Translation

... target word, we look up the corresponding source phrase that produced it, and use this information to compute a number of features from the translation phrase table and target language model ...target ... See full document

9

Nonparametric Word Segmentation for Machine Translation

Nonparametric Word Segmentation for Machine Translation

... In statistical machine translation, the smallest unit is usually the word, defined as a token delimited by ...a word alignment, then extracts phrase pairs from this word ...(e.g., ... See full document

9

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

... Unsupervised word segmentation (UWS) can provide domain-adaptive segmenta- tion for statistical machine translation (SMT) without annotated data, and bilin- gual UWS can even optimize ... See full document

7

Can Word Segmentation be Considered Harmful for Statistical Machine Translation Tasks between Japanese and Chinese?

Can Word Segmentation be Considered Harmful for Statistical Machine Translation Tasks between Japanese and Chinese?

... as Chinese, Japanese, Korean, Thai, Lao and Viet- namese. Therefore, word segmentation for such languages is usually the first important step in most Natural Language Processing (NLP) applica- tions ... See full document

10

Semi Supervised Chinese Word Segmentation Using Partial Label Learning With Conditional Random Fields

Semi Supervised Chinese Word Segmentation Using Partial Label Learning With Conditional Random Fields

... In this research we employ a sausage constraint to encode the knowledge for Chinese word seg- mentation. However, a sausage constraint does not reflect the legal label sequence. For exam- ple, in Figure 1 ... See full document

9

Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation

Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation

... explicit word de- limiters often have to be segmented for sta- tistical machine translation ...the word segmentation (WS) schemes of these annotated corpora are handcrafted for general ... See full document

11

Learning New Semi Supervised Deep Auto encoder Features for Statistical Machine Translation

Learning New Semi Supervised Deep Auto encoder Features for Statistical Machine Translation

... improved translation quality of n-gram translation model by using a bilingual neural LM, where transla- tion probabilities are estimated using a continu- ous representation of translation units in ... See full document

11

Graph based Semi Supervised Model for Joint Chinese Word Segmentation and Part of Speech Tagging

Graph based Semi Supervised Model for Joint Chinese Word Segmentation and Part of Speech Tagging

... A statistical analysis of the segmentation and tag- ging results of the supervised joint model (Base- line II) and our model is carried out to comprehend the influence of the graph-based ... See full document

10

Semi supervised Chinese Word Segmentation based on Bilingual Information

Semi supervised Chinese Word Segmentation based on Bilingual Information

... bilingual semi- supervised Chinese word segmentation (CWS) method that leverages the nat- ural segmenting information of English ...phrase-based translation sub-model to s- core ... See full document

10

Nonparametric Bayesian Semi supervised Word Segmentation

Nonparametric Bayesian Semi supervised Word Segmentation

... For Chinese, we first used a standard dataset from the SIGHAN Bakeoff 2005 (Emerson, 2005) for the labeled and test data, and Chinese gi- gaword version 2 (LDC2009T14) for the unlabeled ...simplified ... See full document

12

Chinese Unknown Word Translation by Subword Re segmentation

Chinese Unknown Word Translation by Subword Re segmentation

... phrase-based translation has led to great progress in statistical machine translation ...phrase translation ta- ...a translation table con- sisting of source phrases, target ... See full document

8

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

... Due to the issue mentioned in section 2.1, we apply character clustering (CC) technique on target text in order to reduce the search space. After performing CC, it will yield several character clusters 𝑇which can be ... See full document

8

HMM Revises Low Marginal Probability by CRF for Chinese Word Segmentation

HMM Revises Low Marginal Probability by CRF for Chinese Word Segmentation

... a Chinese word segmentation system for CIPS-SIGHAN 2010 Chinese language processing ...(OOV) word, but ex- ternal information of word ...of word information to in- crease ... See full document

5

Improving Statistical MT through Morphological Analysis

Improving Statistical MT through Morphological Analysis

... of translation. As expected, full lemmatization performed better than word-to- word translation, with an an improvement of about ...improves translation quality by reducing data ... See full document

8

Semi Supervised Neural Machine Translation with Language Models

Semi Supervised Neural Machine Translation with Language Models

... Artetxe et al. (2017) goes further and use adversarial loss to train their translation system. They build a single shared encoder and a single shared decoder, using both denoising autoen- coder loss and ... See full document

8

Show all 10000 documents...