[PDF] Top 20 Chinese Word Segmentation Based on Contextual Entropy
Has 10000 "Chinese Word Segmentation Based on Contextual Entropy" found on our website. Below are the top 20 most common "Chinese Word Segmentation Based on Contextual Entropy".
Feature based Neural Language Model and Chinese Word Segmentation
... fore word segmentation is a very basic and impor- tant pre-process for Chinese language ...Traditional word segmentation approaches are lexicon-driven (Liang, ...predefined ... See full document
7
Chinese Named Entity Recognition and Word Segmentation Based on Character
... Chinese word segmentation and NER are two of the most fundamental problems in Chinese information processing and have attracted more and more ...Maximum Entropy (Ng and Low, 2005) and ... See full document
5
Covering Ambiguity Resolution in Chinese Word Segmentation Based on Contextual Information
... Model in Word Covering Ambiguity Resolution in Chinese Word Segmentation Based on Contextual Information Xiao LUO; Maosong SUN National Lab of Intelligent Tech and Systems Tsinghua University, Beijing[.] ... See full document
7
Subword Based Tagging for Confidence Dependent Chinese Word Segmentation
... for Chinese word segmentation to improve the existing character-based ...maximum entropy (MaxEnt) and the conditional random fields (CRF) ... See full document
8
Semi supervised Chinese Word Segmentation based on Bilingual Information
... supervised Chinese word segmentation (CWS) method that leverages the nat- ural segmenting information of English ...sub-model based on character-based alignment to obtain ex- plicit ... See full document
10
A MMSM based Hybrid Method for Chinese MicroBlog Word Segmentation
... 泉基金会主席(Yanlin Zhuang act as chairman of the Xiquan Zhuang Fund)” for example, The person name “庄炎林(Yanlin Zhuang)” and “庄 希泉(Xiquan Zhuang)” do not exist in the system dictionary. The word-based MMSM model ... See full document
7
Co regularizing character based and word based models for semi supervised Chinese word segmentation
... and word bound- aries is very hard (Jiao et ...the segmentation as the hidden variable in machine ...the segmentation result- s by interpolating the statistics-based features de- rived from ... See full document
6
Unsupervised Segmentation of Chinese Text by Use of Branching Entropy
... in Chinese text, we pre-processed the test data by segmenting sentences at punctuation locations to form text ...because Chinese words with a length of more than 5 characters are ... See full document
8
Multi Grained Chinese Word Segmentation
... Penn Chinese Treebank (CTB) (Xue et ...matching based on lexicon dictionaries (Liu and Liang, 1986), to path searching from segmentation graphs based on language modeling scores and other ... See full document
12
Neural Word Segmentation Learning for Chinese
... Neural Network Models. Most modern CWS methods followed (Xue, 2003) treated CWS as a sequence labeling problems (Zhao et al., 2006b). Recently, researchers have tended to explore neu- ral network based approaches ... See full document
12
Word Boundary Decision with CRF for Chinese Word Segmentation
... for Chinese to English (Chang et al., 2008), segmentation errors would cause translation mistakes ...certain word, ...a word. It achieves much better performances than traditional ... See full document
7
Improving Word Alignment by Adjusting Chinese Word Segmentation
... adjust word segmentation so as to decrease the effect of lexicalization differences to improve word alignment ...adjust Chinese word segmentation according to their translation ... See full document
8
Synthetic Word Parsing Improves Chinese Word Segmentation
... In recent years, some related works about im- proving OOV problem in CWS have been ongo- ing. Sun et al. (2012) presented a joint model for Chinese word segmentation and OOVs detection. Their models ... See full document
6
Active Learning for Chinese Word Segmentation
... model based on a dictionary or a labelled data set. Among them, character-based classification has drawn most attention recently and been further implemented with sequence labelling algorithms (Tseng et ... See full document
10
A Pragmatic Chinese Word Segmentation Approach Based on Mixing Models
... Though many methods have been proposed and many improvements have been achieved, as a challenge task, word segmentation is not well-performed. The disambiguation and the out-of-vocabulary (OOV) ... See full document
24
Chinese Segmentation with a Word Based Perceptron Algorithm
... Penn Chinese Treebank Corpus (CTB), the Hong Kong City Uni- versity Corpus (CU) and the Peking University Cor- pus ...Penn Chinese Treebank Corpus is currently un- available, we excluded this ... See full document
8
Chinese Word Segmentation based on analogy and majority voting
... other. Any two of these similar substrings and input D form an analogical equation. In general, not all solutions of the equations occur in the training cor- pus. Consequently, only the solutions which occur in the ... See full document
6
A Character Based Joint Model for Chinese Word Segmentation
... The last 2,000, 600, 400, and 300 sentences for AS, MSR, CITYU, and PKU are extracted from the original training corpora as their cor- responding development sets. The statistics for new data sets are shown in Table 4. ... See full document
9
A compression based algorithm for Chinese word segmentation
... To infer word boundaries, a general adaptive text compression technique is used that predicts upcoming characters on the basis of their preceding context.. Spaces are inserted into posit[r] ... See full document
20
Related subjects