[PDF] Top 20 Unknown Word Detection for Chinese by a Corpus-based Learning Method
Has 10000 "Unknown Word Detection for Chinese by a Corpus-based Learning Method" found on our website. Below are the top 20 most common "Unknown Word Detection for Chinese by a Corpus-based Learning Method".
Unknown Word Detection for Chinese by a Corpus-based Learning Method
... of unknown word ...irregular unknown words, such as proper names etc. With this unknown word resolution procedure, an error reduction rate of ...the word segmentation ...Sinica ... See full document
18
A Chunking Strategy Towards Unknown Word Detection in Chinese Word Segmentation
... detect unknown words by chunking one or more word atoms together according to the word formation patterns of the word ...error-driven learning approach is proposed to learn useful ... See full document
12
Pruning False Unknown Words to Improve Chinese Word Segmentation
... Our method is based on the report by (Goh et ...and unknown word detection at the same ...on unknown word detection, then the recall for unknown words is ... See full document
12
Histogram Equalization on Statistical Approaches for Chinese Unknown Word Extraction
... to unknown word extraction can be roughly divided into two categories, rule-based approaches and statistical ...the corpus. In addition, statistical approaches and rule-based approaches ... See full document
20
Domain Adaptation and Attention Based Unknown Word Replacement in Chinese to Japanese Neural Machine Translation
... approximation method, we set the number of negative samples to 2,000, and the objective function was optimized via mini-batch stochastic gradient ...initial learning rate was ...the learning rate ... See full document
9
A MMSM based Hybrid Method for Chinese MicroBlog Word Segmentation
... New Word Dictionary which is updated weekly, and the second is the Sina Popular Word List, which is updated ...for Chinese MicroBlog morphological analysis in- cludes Chinese word ... See full document
7
Automatic Grammatical Error Detection for Chinese based on Conditional Random Field
... of learning and using Chinese, foreigners may have grammatical errors due to negative migration of their native ...tion method of grammatical errors is not mature enough. Based on the ... See full document
6
Fast Online Training with Frequency Adaptive Learning Rates for Chinese Word Segmentation and New Word Detection
... for Chinese word segmentation and new word ...including word-based features and enriched edge (label-transition) features, for the joint ...a word segmentation system on ... See full document
10
Augmented Parsing of Unknown Word by Graph-Based Semi-Supervised Learning
... propagation method to deal with unknown words. Graph-based label propagation methods have made a remarkable improvement in several natural language processing tasks, ...2008), Chinese ... See full document
9
Combining Finite State and Corpus-based Techniques for Unknown Word Prediction
... for unknown German ...metrics based on frequency counts obtained from a ...the corpus which is a drawback in comparison with the virtually unlimited data in the web our method has access ...a ... See full document
5
Unknown Word Extraction for Chinese Documents
... of unknown words in each three steps of processes, i.e. detection process, extraction process, and verification ...The detection process detects the occurrences of unknown words for better ... See full document
7
Automatic Corpus Based Thai Word Extraction with the C4 5 Learning Algorithm
... The induction algorithm proceeds by evaluating content of a series of attributes and iteratively building a tree from the attribute values with the leaves of the decision tree being the value of the goal attribute. At ... See full document
6
A Study on the Use of Conjunction by American Chinese Learners based on HSK Corpus
... It is found that conjunction redundancy is most frequently happened in the use of“ 所以 ”. English emphasizes hypotaxis while Chinese emphasizes parataxis. In Chinese writing, the relationship between clauses ... See full document
9
Using Chinese Gigaword Corpus and Chinese Word Sketch in linguistic Research
... to corpus-based computational approaches to linguistic analysis are to acquire enough data to show linguistic distribution, and to design efficient tools for extracting linguistically significant ... See full document
8
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
... in Chinese usually consists of two conjunctions, appearing ...its Chinese translation “虽然晚了,但是 (it is late, but)”, the Japanese conjunction “が (but)” corresponds to the Chinese conjunction “虽然 (but)” ... See full document
5
Investigating the Applicability of current Machine Learning based Subjectivity Detection Algorithms on German Texts
... second corpus is the MPQA corpus 1 , which is a 16000-sentence corpus made up of news arti- cles which are tagged with a complex set of sub- jectivity ...things. Based on these fine-grained ... See full document
8
CAD Based Method for Detection of Breast Cancer
... Performance evaluation in algorithms design is an important step that is commonly neglected. What constitutes an “acceptable” result differs significantly, and is often based on visual subjective opinion with very ... See full document
15
Automatic Event Trigger Word Extraction in Chinese Event
... The method of extracting event trigger word based on trigger word table includes the following two processes: construct trigger word set; calculate weight ...the word collection ... See full document
5
Chinese Word Segmentation with Conditional Support Vector Inspired Markov Models
... unsupervised learning outcomes from unlabeled ...unlabeled corpus for CWS, such as some statistics information on co-occurrence of sub- sequences in the whole text has been extracted from unlabeled data and ... See full document
7
A Corpus based Approach for Spanish Chinese Language Learning
... In order to use our annotated-corpus and search tool, the appropriate levels for Spanish foreign lan- guage learners are level B2 (intermediate level) and level C (including C1 and C2) (advanced level). Level B2 ... See full document
10
Related subjects