• No results found

word segmentation

Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation

Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation

... Character-based sequence labeling frame- work is flexible and efficient for Chi- nese word segmentation (CWS). Recently, many character-based neural models have been applied to CWS. While they obtain good ...

10

Improving Word Alignment by Adjusting Chinese Word Segmentation

Improving Word Alignment by Adjusting Chinese Word Segmentation

... adjust word segmentation so as to decrease the effect of lexicalization differences to improve word alignment ...Chinese word segmentation according to their translation derived from ...

8

Active Learning for Chinese Word Segmentation

Active Learning for Chinese Word Segmentation

... unknown word may appear many times and a few sentences containing the unknown word may be selected for man- ual annotation at the same time according to the uncertainty ...

10

Word Alignment Combination over Multiple Word Segmentation

Word Alignment Combination over Multiple Word Segmentation

... new word alignment combination approach on language pairs where one language has no explicit word ...combining word alignments of dif- ferent models (Xiang et ...combine word alignments over ...

5

Synthetic Word Parsing Improves Chinese Word Segmentation

Synthetic Word Parsing Improves Chinese Word Segmentation

... grained word segmentation ...Chinese word segmentation performance (especially on pseudo-OOVs) with- out introducing any new feature ...internal word structure ...

6

Which Is Essential for Chinese Word Segmentation: Character versus Word

Which Is Essential for Chinese Word Segmentation: Character versus Word

... In detail, Goh’s method is still a one-fold character-based classification one for Chi- nese word segmentation. However, they adopted FMM/BMM feature in learning except for character-based n-gram features. ...

12

Incorporating Word Attention into Character Based Word Segmentation

Incorporating Word Attention into Character Based Word Segmentation

... to word segmentation, especially Chi- nese, because of the ability to minimize the effort in feature ...or word-based, for utilizing word-level informa- ...lizing word information to ...

11

Nonparametric Model for Inupiaq Word Segmentation

Nonparametric Model for Inupiaq Word Segmentation

... We present how to use English translation for unsupervised word segmentation of low resource languages. The inference uses a dynamic programming algorithm for efficient blocked Gibbs sampling. We apply the ...

8

Chinese Word Segmentation by Classification of Characters

Chinese Word Segmentation by Classification of Characters

... Chinese Word Segmentation Bakeoff [Sproat and Emerson 2003] intended to evaluate the accuracy of different segmenters by standardizing the training and testing ...

16

Word Boundary Decision with CRF for Chinese Word Segmentation

Word Boundary Decision with CRF for Chinese Word Segmentation

... In this work, we analyze the relationship between WBD (Huang et al., 2007) and 4-tag character tagging approach for Chinese word segmentation. There are two main differences between them: One is category ...

7

Thai Word Segmentation Verification Tool

Thai Word Segmentation Verification Tool

... We showed that our new tool, with its new data structures in the form of hash table, worked more rapidly than the previous version, both for open- ing files and for responding to users. Moreover, finding and replacing ...

7

Word Segmentation in the Spoken Dutch Corpus

Word Segmentation in the Spoken Dutch Corpus

... automatic word segmentation (AWS) as a starting point for the manual checking stage (sections 2 and ...verified word segmentation (MWS) start- ing from the ...

6

Unsupervised Word Segmentation Without Dictionary

Unsupervised Word Segmentation Without Dictionary

... Word Segmentation. With the potential words and MI values indicating their likelihood, we proceeded to segment the text of a large corpus into words. For the Taiwanese Bible, we had to take care of the ...

5

Word Segmentation for Urdu OCR System

Word Segmentation for Urdu OCR System

... million word corpus (Ijaz et ...for word segmentation er- rors, by adding missing spaces between words and replacing spaces with Zero Width Non- Joiner (ZWNJ) within ...of word grams, the 18 ...

7

Chinese Word Segmentation as Character Tagging

Chinese Word Segmentation as Character Tagging

... automatic word segmentation of Chinese text generally adopt their own working definitions of what a word is, or simply rely on native speakers’ subjective ...a word is in ...the ...

20

A New Unsupervised Approach to Word Segmentation

A New Unsupervised Approach to Word Segmentation

... This article proposes ESA, a new unsupervised approach to word segmentation. ESA is an iterative process consisting of three phases: Evaluation, Selection, and Adjustment. In Eval- uation, both the ...

34

Urdu Word Segmentation

Urdu Word Segmentation

... This work presents a preliminary effort on word segmentation problem in Urdu. It is a multi- dimensional problem. Each dimension requires a deeper study and analysis. Each sub-problem has been touched in ...

9

Multi Grained Chinese Word Segmentation

Multi Grained Chinese Word Segmentation

... Traditionally, word segmentation (WS) adopts the single-granularity formal- ism, where a sentence corresponds to a single word ...Chinese word boundaries is only 76%, indicat- ing ...

12

Improving Cross Domain Chinese Word Segmentation with Word Embeddings

Improving Cross Domain Chinese Word Segmentation with Word Embeddings

... Chinese Word Segmentation (CWS) remains a challenge despite recent progress in neural-based ...semi-supervised word-based approach to im- proving cross-domain CWS given a baseline ...ploys ...

10

Nonparametric Word Segmentation for Machine Translation

Nonparametric Word Segmentation for Machine Translation

... involving word segmentation or morphological analysis of the source and/or target ...Arabic word “fktbwha” and its En- glish translation “so they wrote ...preferred segmentation of “fktbwha” ...

9

Show all 7528 documents...

Related subjects