• No results found

large text corpora

Towards a Structured Representation of Generic Concepts and Relations in Large Text Corpora

Towards a Structured Representation of Generic Concepts and Relations in Large Text Corpora

... Existing work on pre-defined relation extraction have implemented methods of supervised, semi- supervised, bootstrapped and unsupervised classi- fication(Zhao and Grishman, 2005), (Kambhatla, 2004) (Bunescu and Mooney, ...

9

Automatic Acquisition of Hyponyms from Large Text Corpora

Automatic Acquisition of Hyponyms from Large Text Corpora

... Automatic Acquisition of Hyponyms from Large Text Corpora Automatic Acquisition of Hyponyms ~om Large Text Corpora M a r t i A H e a r s t C o m p u t e r S c i e n c e D i v i s i o n , 571 E v a n s[.] ...

7

Massive Disambiguation of Large Text Corpora With Flexible Categorial Grammar

Massive Disambiguation of Large Text Corpora With Flexible Categorial Grammar

... MASSIVE DISAMBIGUATION OF LARGE TEXT CORPORA WITH FLEXIBLE CATEGORIAL GRAMMAR MASSIVE DIS/~MBIGUATION OF Id%R~E TEA~' COR~K)RA WITH FLEXIBLE CATEGORIAL G~KMM~R f Ton van der WOUDEN (CFLEX/INL) nirk HE[.] ...

5

Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora

Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora

... Distributional models: For the distributional baselines, we employ the large, sparse distribu- tional space of Shwartz et al. (2017), which is com- puted from UkWaC and Wikipedia, and is known to have strong ...

6

Enhancing the possibilities of corpus based investigations: Word sense disambiguation on query results of large text corpora

Enhancing the possibilities of corpus based investigations: Word sense disambiguation on query results of large text corpora

... For each snippet we generate bag-of-words vec- tors using contexts of 10, 40, 80 or all words around the word of interest. Hence, for context size 10 we use the ten words before the token, the token itself and the ten ...

5

Exploratory Relation Extraction in Large Text Corpora

Exploratory Relation Extraction in Large Text Corpora

... easy adaption to changing domains (Chiticariu et al., 2013; Chiticariu et al., 2010). The lack of tools to assist rule developers in exploring and choosing between different automatically generated rules has been stated ...

10

Graph-based exploration and clustering analysis of semantic spaces

Graph-based exploration and clustering analysis of semantic spaces

... from large text corpora (Google news, Amazon reviews), and “human built” word networks derived from the well-known lexical databases: WordNet and Moby ...

26

Visualization Based Sequential Pattern Text
Mining

Visualization Based Sequential Pattern Text Mining

... A sequential pattern in data mining is a finite series of elements such as A → B → C → D where A, B, C, and D are elements of the same domain. The mining of sequential patterns is designed to find patterns of discrete ...

6

Improving LSTM based Video Description with Linguistic Knowledge Mined from Text

Improving LSTM based Video Description with Linguistic Knowledge Mined from Text

... noisy text in the caption ...use large text corpora to learn vector-space representations of words that capture fine-grained semantic and syntactic ...

6

Large Scale Paraphrasing for Natural Language Understanding

Large Scale Paraphrasing for Natural Language Understanding

... Further, we propose a semantics for paraphras- ing by classifying each paraphrase pair with one of the entailment relation types defined by natural logic (MacCartney, 2009). Natural logic is used to perform inference ...

7

Automatic Construction of Large Readability Corpora

Automatic Construction of Large Readability Corpora

... a text has a wide range of applications, including support for student reading material selection (Petersen and Ostendorf, 2009) or help for clinical patients (Feng et ...of text simplification, evaluating ...

10

Scaling Distributional Similarity to Large Corpora

Scaling Distributional Similarity to Large Corpora

... It is a general property of Machine Learning that increasing the volume of training data increases the accuracy of results. This is no more evident than in Natural Language Processing ( NLP ), where massive quantities of ...

8

Categorization of Large Corpora of Malicious Software

Categorization of Large Corpora of Malicious Software

... Anubis is a tool for analyzing the behavior of Windows PE-executable (binaries) files. Execution of Anubis results in the generation of a report file in HTML, XML, text, and PDF formats that contains very detailed ...

43

A Flexible Infrastructure for Large Monolingual Corpora

A Flexible Infrastructure for Large Monolingual Corpora

... We describe an infrastructure for managing large monolingual language resources. Since 1995, we have accumulated a German text corpus of more than 300 Million words with approx. 6 Million different word ...

5

Bootstrapping Large Sense Tagged Corpora

Bootstrapping Large Sense Tagged Corpora

... The generation algorithm is evaluated in two ways. First, this algorithm was used to create the GenCor cor- pus, which was employed during the English all words task, with significant improvement measured over the ...

5

Finding Parts in Very Large Corpora

Finding Parts in Very Large Corpora

... Pattern A headlight windshield ignition shifter dashboard radiator brake tailpipe pipe airbag speedometer converter hood trunk visor vent wheel occupant engine tyre Pattern B trunk wheel[r] ...

8

Manipulating Large Corpora for Text Classification

Manipulating Large Corpora for Text Classification

... larger corpora such as web pages in Internet applications(Mladenic and Grobel- nik, 1998), (McCallum, 1999), (Dumais and Chen, ...the large collection of cate- gories more ...

8

TMU Transformer System Using BERT for Re ranking at BEA 2019 Grammatical Error Correction on Restricted Track

TMU Transformer System Using BERT for Re ranking at BEA 2019 Grammatical Error Correction on Restricted Track

... a large corpus, such as Bidirectional Encoder Representations from Transformers (BERT), in a form suitable for the learner’s grammat- ical ...learner corpora with grammat- ical errors for ...

6

Discovery of Treatments from Text Corpora

Discovery of Treatments from Text Corpora

... The preceding section shows that if we are able to discover features in the data, we can estimate their AMCEs by randomly assigning texts to re- spondents. We now present a statistical model for discovering those ...

10

Automatic Corpora Construction for Text Classification

Automatic Corpora Construction for Text Classification

... candidate corpora belong to the same ...conduct corpora denoising. After clus- tering, the large clusters are preserved while the smaller ones are removed as the ...

7

Show all 10000 documents...

Related subjects