• No results found

large corpora

Word Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora

Word Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora

... Word Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora Word Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora Davi[.] ...

7

Fast and Accurate Misspelling Correction in Large Corpora

Fast and Accurate Misspelling Correction in Large Corpora

... various corpora in English and Ital- ...for large Levenshtein distances, being more than 30 times faster than a linear algorithm, and several hundred times faster than ...

9

Scaling to Very Very Large Corpora for Natural Language Disambiguation

Scaling to Very Very Large Corpora for Natural Language Disambiguation

... We show the results from sample selection for confusion set disambiguation in Figure 4. The line labeled "sequential" shows test set accuracy achieved for different percentages of the one billion word training ...

8

Discovering Relations among Named Entities from Large Corpora

Discovering Relations among Named Entities from Large Corpora

... Discovering the significant relations embedded in documents would be very useful not only for infor- mation retrieval but also for question answering and summarization. Prior methods for relation discov- ery, however, ...

8

CSNIPER   Annotation by query for Non canonical Constructions in Large Corpora

CSNIPER Annotation by query for Non canonical Constructions in Large Corpora

... This annotation-by-query approach of querying, assessing, evaluating and annotating allows multiple distributed raters to incrementally improve query re- sults and achieve high quality annotations. In this paper, we show ...

6

A Statistical Method for Extracting Uninterrupted and Interrupted Collocations from Very Large Corpora

A Statistical Method for Extracting Uninterrupted and Interrupted Collocations from Very Large Corpora

... A Statistical Method for Extracting Uninterrupted and Interrupted Collocations from Very Large Corpora A Statistical Method for Extracting Uninterrupted and Interrupted Collocations from Very Large Co[.] ...

6

Deducing Linguistic Structure from the Statistics of Large Corpora

Deducing Linguistic Structure from the Statistics of Large Corpora

... Deducing Linguistic Structure from the Statistics of Large Corpora D e d u c i n g L i n g u i s t i c S t r u c t u r e f r o m t h e S t a t i s t i c s of Large C o r p o r a E r i c Brill~ D a v i[.] ...

8

The Acquisition of Lexical Semantic Knowledge from Large Corpora

The Acquisition of Lexical Semantic Knowledge from Large Corpora

... The Acquisition of Lexical Semantic Knowledge from Large Corpora T h e A c q u i s i t i o n o f L e x i c a l S e m a n t i c K n o w l e d g e from Large C o r p o r a J a m e s Pustejovsky C o m p[.] ...

6

Tagging of very large corpora: Topic Focus Articulation

Tagging of very large corpora: Topic Focus Articulation

... hajicova dvi Tagging of very large corpora Topic Focus Articulation Eva Bur??ov? and Eva Haji?ov? and Petr Sgall Institut of Formal and Applied Linguistics, Faculty of Mathematics and Physics Charles[.] ...

6

In tool Learning for Selective Manual Annotation in Large Corpora

In tool Learning for Selective Manual Annotation in Large Corpora

... We present a novel approach to the selec- tive annotation of large corpora through the use of machine learning. Linguis- tic search engines used to locate potential instances of an infrequent phenomenon do ...

6

Scaling Distributional Similarity to Large Corpora

Scaling Distributional Similarity to Large Corpora

... Accurately representing synonymy using distributional similarity requires large vol- umes of data to reliably represent infre- quent words. However, the na¨ıve nearest- neighbour approach to comparing context ...

8

Portuguese Text Generation from Large Corpora

Portuguese Text Generation from Large Corpora

... Statistical approaches to NLG may however face a num- ber of difficulties. Among these, there is the issue of data sparseness, a problem that is particularly evident in cases such as our target language - Brazilian ...

5

Manipulating Large Corpora for Text Classification

Manipulating Large Corpora for Text Classification

... a large collection of data and propose a method for text classifi- cation which manipulates data using two well-known machine learning techniques, Naive Bayes(NB) and Support Vector Ma- ...handle large ...

8

Seeded Discovery of Base Relations in Large Corpora

Seeded Discovery of Base Relations in Large Corpora

... 3.3 Pruning clusters After clustering relation phrases with AP, we prune the resulting partition by evaluating the number of different relation instances appearing in each cluster, as we[r] ...

9

Finding Parts in Very Large Corpora

Finding Parts in Very Large Corpora

... Pattern A headlight windshield ignition shifter dashboard radiator brake tailpipe pipe airbag speedometer converter hood trunk visor vent wheel occupant engine tyre Pattern B trunk wheel[r] ...

8

A survey of machine learning approaches to analysis of large corpora

A survey of machine learning approaches to analysis of large corpora

... Part-of-Speech tagging, parsing, and semantic analysis take place at the level of word and sentence: each sentence in a text can be analysed independently. Language in real use in spoken dialogues exhibits structure ...

9

Categorization of Large Corpora of Malicious Software

Categorization of Large Corpora of Malicious Software

... Fig(5) MySql database: Querying analysis results of a malware to measure the threat level of a malware.. 29. Conclusion[r] ...

43

TMU Transformer System Using BERT for Re ranking at BEA 2019 Grammatical Error Correction on Restricted Track

TMU Transformer System Using BERT for Re ranking at BEA 2019 Grammatical Error Correction on Restricted Track

... a large corpus, such as Bidirectional Encoder Representations from Transformers (BERT), in a form suitable for the learner’s grammat- ical ...learner corpora with grammat- ical errors for ...

6

Advertisments

Advertisments

... Reversible Grammar in NLP The Balancing Act Computational Phonology Third Workshop on Very Large Corpora Fourth Workshop on Very Large Corpora Empirical Methods in NLP Fifth Workshop on [r] ...

9

Advertisements

Advertisements

... Third Workshop on Very Large Corpora, at ACL-95, Cambridge, MA, 30 June 1995 $ Fourth Workshop on Very Large Corpora, at Coling-96, Copenhagen, Denmark 4 August 1996 5 Fifth Workshop on [r] ...

14

Show all 10000 documents...

Related subjects