[PDF] Top 20 A compression based algorithm for Chinese word segmentation
Has 10000 "A compression based algorithm for Chinese word segmentation" found on our website. Below are the top 20 most common "A compression based algorithm for Chinese word segmentation".
A compression based algorithm for Chinese word segmentation
... To infer word boundaries, a general adaptive text compression technique is used that predicts upcoming characters on the basis of their preceding context.. Spaces are inserted into posit[r] ... See full document
20
A MMSM based Hybrid Method for Chinese MicroBlog Word Segmentation
... researches, Chinese word seg- mentation has achieved quite high precisions for formal style ...of segmentation is not so satisfying for MicroBlog ...for Chinese word segmentation ... See full document
7
Feature based Neural Language Model and Chinese Word Segmentation
... tributed word representation of neural language model have been proved very useful in NLP ...shares word representations across the tasks of language modeling, part-of-speech tag- ging, chunking, named ... See full document
7
Subword Based Tagging for Confidence Dependent Chinese Word Segmentation
... We proposed a subword-based tagging for Chinese word segmentation to improve the existing character-based tagging. The subword-based tagging was implemented using the maximum ... See full document
8
Co regularizing character based and word based models for semi supervised Chinese word segmentation
... bold scores indicate that our model does achieve significant gains over these two semi-supervised models. This outcome can further reveal that us- ing the agreements from these two views to regu- larize the learning can ... See full document
6
Chinese Segmentation with a Word Based Perceptron Algorithm
... of word-based models, we adapt the perceptron discriminative learning algorithm to the CWS ...the segmentation problem to a tag sequence learning problem, but defines fea- tures on segmented ... See full document
8
Chinese Word Segmentation by Mining Maximized Substrings
... and tested on CTB7 with different configurations. The row “Baseline” is baseline system as in Ta- ble 5. “+Basic&Freq” represents the system “MaxSub-U” with only basic and frequency fea- tures activated, and STS ... See full document
9
Context Based Chinese Word Segmentation using SVM Machine Learning Algorithm without Dictionary Support
... both word-based and character-based CWS methods. Word-based approaches treat the word as the basic unit, and POS and other word-based lin- guistic resources are ... See full document
9
A Pragmatic Chinese Word Segmentation Approach Based on Mixing Models
... Named Entity Recognition (NER) is one of the common message understanding tasks. The objective is to identify and categorize all members of certain categories of "proper names". In MUC-7, there are seven ... See full document
24
Discriminative Pruning of Language Models for Chinese Word Segmentation
... As shown in equation (13), the "importance" of each bigram depends on the base model. Ini- tially, the base model is set to the unigram model. With bigrams added in, it becomes a growing bigram model. Thus, W B * ... See full document
8
Word Boundary Decision with CRF for Chinese Word Segmentation
... for Chinese to English (Chang et al., 2008), segmentation errors would cause translation mistakes ...certain word, ...a word. It achieves much better performances than traditional ... See full document
7
Improving Word Alignment by Adjusting Chinese Word Segmentation
... adjust word segmentation so as to decrease the effect of lexicalization differences to improve word alignment ...adjust Chinese word segmentation according to their translation ... See full document
8
Synthetic Word Parsing Improves Chinese Word Segmentation
... In recent years, some related works about im- proving OOV problem in CWS have been ongo- ing. Sun et al. (2012) presented a joint model for Chinese word segmentation and OOVs detection. Their models ... See full document
6
Active Learning for Chinese Word Segmentation
... model based on a dictionary or a labelled data set. Among them, character-based classification has drawn most attention recently and been further implemented with sequence labelling algorithms (Tseng et ... See full document
10
Chinese Named Entity Recognition and Word Segmentation Based on Character
... Chinese word segmentation and named entity recognition (NER) are both important tasks in Chinese information ...both Chinese NER and word segmentation tasks, and turns out ... See full document
5
Semi supervised Chinese Word Segmentation based on Bilingual Information
... ter segmentation, although most such studies have focused on statistical machine translation ...consecutive Chinese characters either to construct a Chinese word dictionary for ... See full document
10
A Trainable Rule Based Algorithm for Word Segmentation
... The rule-based algorithm we developed to improve word segmentation is very effective for segmenting Chinese; in fact, the rule sequences combined with a very simple initial segmentation,[r] ... See full document
8
Adaptive Chinese Word Segmentation with Online Passive Aggressive Algorithm
... Due to the exponential size of the output space, sequence labeling problems tend to be more challenging than the conventional classifi- cation problems. Many algorithms have been proposed and the progress has been ... See full document
5
A Stochastic Finite State Word Segmentation Algorithm for Chinese
... Most languages that use Roman, Greek, Cyrillic, Armenian, or Semitic scripts, and m a n y that use Indian-derived scripts, mark orthographic word boundaries; however, languages written i[r] ... See full document
28
Related subjects