A Statistical Method for Extracting Uninterrupted and Interrupted Collocations from Very Large Corpora
Full text
Figure
Related documents
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora Proceedings of the 22nd International Conference on Computational Linguistics
This paper proposes a learning and extracting method of word sequence correspondences from non-aligned parallel corpora with Support Vector Machines, which have high ability of
Automatic Acquisition of Hyponyms from Large Text Corpora Automatic Acquisition of Hyponyms ~om Large Text Corpora M a r t i A H e a r s t C o m p u t e r S c i e n c e D i v i s i o n
In this paper, we proposed the signal processing method for extracting the scratching time, and applied the method to the piezo-ceramics sensors, the strain gauge,
Extracting pairs of corresponding phrases together with their word to word links, the biphrases, from sentence aligned bilingual corpora using statistical and heuristic models
Extracting Statistical Graph Features for Accurate and Efficient Time Series Classification.. Daoyuan Li University of Luxembourg Luxembourg
4. the current availability of large-scale corpora, tools for natural language processing and automatic text annotation, and statistical methods to extract linguistic
As the input for our first experiment we used data of eight Aranea corpora, with four of them representing the “large” languages (English, French, German, and Rus- sian), and