• No results found

[PDF] Top 20 Nonparametric Word Segmentation for Machine Translation

Has 10000 "Nonparametric Word Segmentation for Machine Translation" found on our website. Below are the top 20 most common "Nonparametric Word Segmentation for Machine Translation".

Nonparametric Word Segmentation for Machine Translation

Nonparametric Word Segmentation for Machine Translation

... The problem of segmentation for machine trans- lation has been studied extensively in recent lit- erature. Most of the work used some linguistic knowledge about the source and the target lan- guages (Nießen ... See full document

9

A Character level Decoder without Explicit Segmentation for Neural Machine Translation

A Character level Decoder without Explicit Segmentation for Neural Machine Translation

... chine translation system can directly handle trans- lation at the level of characters without any word ...neural machine translation to translate at the level of characters, and that it actu- ... See full document

11

Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation

Refining Word Segmentation Using a Manually Aligned Corpus for Statistical Machine Translation

... into word aligned text to help improve automatic WA and translation ...imum translation units and translation relations; tagging adds contextual, syntactic and language- specific features to ... See full document

11

Morphology aware Word Segmentation in Dialectal Arabic Adaptation of Neural Machine Translation

Morphology aware Word Segmentation in Dialectal Arabic Adaptation of Neural Machine Translation

... other segmentation techniques like BPE and ...neural machine translation framework which sup- ports sentence piece tokenization with its two vari- ant BPE and SR (unigram language model) as well as ... See full document

7

Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

Adapting Chinese Word Segmentation for Machine Translation Based on Short Units

... the segmentation principle (3) (Section ...“known word + unlexicalized item” sequences and extracted the unlexicalized items with the frequency above ... See full document

7

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

... novel segmentation approach for Phrase-Based Statistical Machine Translation (PB-SMT) to languages where word boundaries are not obviously marked by using both monolingual and bilingual ... See full document

8

Challenging Language Dependent Segmentation for Arabic: An Application to Machine Translation and Part of Speech Tagging

Challenging Language Dependent Segmentation for Arabic: An Application to Machine Translation and Part of Speech Tagging

... Arabic word segmentation has shown to signifi- cantly improve output quality in NLP tasks such as machine translation (Habash and Sadat, 2006; Almahairi et ...cal segmentation in the ... See full document

7

Pseudo Word for Phrase Based Machine Translation

Pseudo Word for Phrase Based Machine Translation

... questioning word as basic translational unit is to directly question word segmentation on languages where word bounda- ries are not orthographically ...to-English translation task where ... See full document

9

Chinese Unknown Word Translation by Subword Re segmentation

Chinese Unknown Word Translation by Subword Re segmentation

... phrase-based translation has led to great progress in statistical machine translation ...phrase translation ta- ...a translation table con- sisting of source phrases, target phrases, ... See full document

8

Grammar checker features in modern Tamil natural language processing

Grammar checker features in modern Tamil natural language processing

... Character analyzer splits words, sentences, paragraphs and files into individual characters and calculates the number of characters. Characters splitting methods using string data type is giving wrong characters ... See full document

5

Can Word Segmentation be Considered Harmful for Statistical Machine Translation Tasks between Japanese and Chinese?

Can Word Segmentation be Considered Harmful for Statistical Machine Translation Tasks between Japanese and Chinese?

... Phrase translation tables gen- erated with our method are not limited to words, but also contain phrases, fragments and short sentences that may not be included in the EDR bilingual lexi- ... See full document

10

Bilingually Motivated Domain Adapted Word Segmentation for Statistical Machine Translation

Bilingually Motivated Domain Adapted Word Segmentation for Statistical Machine Translation

... of word segmentation in SMT and showed that the segmentation proposed by word alignments can be used in SMT to achieve competitive results com- pared to using monolingual ...use word ... See full document

9

Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods

Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods

... for machine translation have proved to be very ...a machine translation inpired approach – N-best list ranking using neural sequence labelling models – for grammatical error ...correction. ... See full document

6

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

Empirical Study of Unsupervised Chinese Word Segmentation Methods for SMT on Large scale Corpora

... The first bilingual corpus: OpenMT06 was used in the NIST open machine translation 2006 Eval- uation 2 . We removed the United Nations cor- pus and the traditional Chinese data sets from the constraint ... See full document

7

Target side Word Segmentation Strategies for Neural Machine Translation

Target side Word Segmentation Strategies for Neural Machine Translation

... It is important to note that the amount of dis- tinct target symbols in the setups ranges between 43K-46K; 50K for top-50K-voc systems. There are no massive vocabulary size differences. We always apply 50K BPE ... See full document

12

Using a maximum entropy model to build segmentation lattices for MT

Using a maximum entropy model to build segmentation lattices for MT

... to machine translation systems for a variety of tasks, including translating automatic speech recog- nition transcriptions and translating from morpho- logically complex languages (Bertoldi et ...the ... See full document

9

Nonparametric Model for Inupiaq Word Segmentation

Nonparametric Model for Inupiaq Word Segmentation

... monolingual nonparametric word ...bilingual word segmentation do not benefit when two languages are ...the nonparametric monolingual word segmentation on F, BF and average ... See full document

8

Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation

Bayesian Semi Supervised Chinese Word Segmentation for Statistical Machine Translation

... Our MT system uses a phrase-based decoder and the log-linear model described in (Zens and Ney, 2004). Features in the log-linear model in- clude translation models in two directions, a lan- guage model, a ... See full document

8

Unsupervised Word Segmentation Improves Dialectal Arabic to English Machine Translation

Unsupervised Word Segmentation Improves Dialectal Arabic to English Machine Translation

... The inconsistency in the orthographic spelling of the same word can increase data sparseness. Thus, we normalize the Arabic text in the collected re- sources by applying the reduced orthographic nor- malization ... See full document

10

Target Foresight Based Attention for Neural Machine Translation

Target Foresight Based Attention for Neural Machine Translation

... icantly better than baseline, but Model2 is significantly better with p<0.05 and Model3 is significantly better with p<0.01. Given that simply introducing an additional layer (“+2- Layer”) does not produce any ... See full document

11

Show all 10000 documents...