• No results found

[PDF] Top 20 Corpora and Data Preparation for Information Extraction

Has 10000 "Corpora and Data Preparation for Information Extraction" found on our website. Below are the top 20 most common "Corpora and Data Preparation for Information Extraction".

Corpora and Data Preparation for Information Extraction

Corpora and Data Preparation for Information Extraction

5

Information Extraction from Text Corpora: Using Filters on Collocation Sets

Information Extraction from Text Corpora: Using Filters on Collocation Sets

... text corpora of different Indo-European languages, following an integrated ap- proach which combines corpus data with lexicographic information like linguistic categories or semantic attrib- ... See full document

5

Inter-sentential Relations in Information Extraction Corpora

Inter-sentential Relations in Information Extraction Corpora

... Many information extraction systems are constrained to extracting binary relations that are asserted within a single sentence (single-sentence relations) and this limits the proportion of relations they can ... See full document

5

A Semi-Supervised Information Extraction Framework for Large Redundant Corpora

A Semi-Supervised Information Extraction Framework for Large Redundant Corpora

... the data the user wishes to query which generally avoid the lim- itations and complexity of most Information Extractions ...extracts information from large corpora with a high degree of ... See full document

62

Mining Metalinguistic Activity in Corpora to Create Lexical Resources Using Information Extraction Techniques: the MOP System

Mining Metalinguistic Activity in Corpora to Create Lexical Resources Using Information Extraction Techniques: the MOP System

... large-scale corpora has made it possible to mine specific knowledge from free or semi-structured text, resulting in what many con- sider by now a reasonably mature NLP technolo- ...in Information ... See full document

8

Statistical modelling of MT output corpora for information extraction

Statistical modelling of MT output corpora for information extraction

... The output of state-of-the-art machine translation (MT) systems could be useful for certain NLP tasks, such as Information Extraction (IE). However, some unresolved problems in MT technology could seriously ... See full document

15

Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue

Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue

... dialogue corpora can be used to extend coverage of Information Extraction (IE) templates in a Spoken Dialogue ...dialogue corpora found on the ... See full document

5

Corpora and Data Preparation

Corpora and Data Preparation

5

ACCURAT Toolkit for Multi Level Alignment and Information Extraction from Comparable Corpora

ACCURAT Toolkit for Multi Level Alignment and Information Extraction from Comparable Corpora

... comparable corpora has received greater attention in light of the scarcity of parallel data for under-resourced ...lexicon extraction (Morin and Prochasson, ... See full document

6

Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks

Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks

... training data, can one make better use of the multiply annotated data? This paper will set out to answer two questions with em- pirical evidence presented from biomedical IE ...annotated data in ... See full document

5

Agriculture Information Extraction Using Data Analytics in Weka

Agriculture Information Extraction Using Data Analytics in Weka

... based information stockpiling frameworks we have gone over a gigantic measure of storehouse of ...this information is not extremely accommodating until we realize what we can do with ...huge ... See full document

8

An Integrated Approach to Heterogeneous Data for Information Extraction

An Integrated Approach to Heterogeneous Data for Information Extraction

... personal information extraction, such as biographical information and occupation, and those kinds of information are necessary to further construct a social network (a kind of semantic web) ... See full document

10

A Comparable Corpus Based on Aligned Multilingual Ontologies

A Comparable Corpus Based on Aligned Multilingual Ontologies

... resulting corpora can serve as a reference, a re- search resource, for information extraction tasks re- lated to ontology learning (term extraction, concept formation, instantiation, ... See full document

7

Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction

Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction

... parallel data in comparable corpora is a promising approach for over- coming the lack of parallel texts in statis- tical machine translation and other NLP ...comparable corpora of texts as resources ... See full document

7

Self Supervised Neural Machine Translation

Self Supervised Neural Machine Translation

... select data and train NMT systems simultaneously using the emerging NMT system itself to select the ...on data representa- tion, an adequate function for the selection pro- cess, and studying how to avoid ... See full document

7

Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora

Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora

... comparable corpora are gener- ally constructed via the consultation of specialized Web ...comparable corpora and CliniWeb 2 for the English part, and (D´ejean and Gaussier, 2002) use documents extracted ... See full document

8

Biomedical Term Extraction: NLP Techniques in Computational Medicine

Biomedical Term Extraction: NLP Techniques in Computational Medicine

... term extraction in the BioNLP domain, starting form a description of the basic techniques used to the methodology followed in the creation of a multilingual corpus of medical texts for medical term ... See full document

9

Automatic Extraction of Subcategorization from Corpora

Automatic Extraction of Subcategorization from Corpora

... However, since there are disagreements between the dictionaries and there are classes found in the corpus data that are not contained in either dictionary, we report results relative bot[r] ... See full document

8

French English Terminology Extraction from Comparable Corpora

French English Terminology Extraction from Comparable Corpora

... the corpora are ...sciences corpora of 8 millions words and a reference bilingual terminological database of 180 words with high frequencies in the corpus: from 100 to ... See full document

12

Modeling Missing Data in Distant Supervision for Information Extraction

Modeling Missing Data in Distant Supervision for Information Extraction

... ing data in distant supervision, because this is a case where data is not missing at random ...of extraction from text and the process by which propositions are observed or missing in both the ... See full document

12

Show all 10000 documents...

Related subjects