• No results found

annotated data

EXCOTATE: An Add on to MMAX2 for Inspection and Exchange of Annotated Data

EXCOTATE: An Add on to MMAX2 for Inspection and Exchange of Annotated Data

... large data sets, such as the annotation of preposition senses ...stores data in an xml-standoff ...already annotated data require manual re-annotation or ...

6

Cheap checking for cloud computing : statistical analysis via annotated data streams

Cheap checking for cloud computing : statistical analysis via annotated data streams

... send data off to the cloud, and request some analysis to be performed, what guarantee do we get that the processing has been done to our satisfac- tion? The provider has an economic incentive to cut cor- ners: to ...

15

Just “OneSeC” for Producing Multilingual Sense Annotated Data

Just “OneSeC” for Producing Multilingual Sense Annotated Data

... neously producing high-quality resources. For ex- ample, Delli Bovi et al. (2017) exploited an exter- nal WSD system, i.e., Babelfy (Moro et al., 2014), and the richer context provided by aligned sen- tences, to carry ...

11

A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set

A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set

... standard data sets and well developed methodologies, the recognition of more nuanced affect has received less attention, and in particular, there are very few publicly available gold standard annotated ...

6

Representing Multimodal Linguistic Annotated data

Representing Multimodal Linguistic Annotated data

... Many tools and frameworks are available for handling rich media data. The practice of taking advantage of such a rich tool offer will not change. What can change however is the habit of developing a new language ...

7

Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter

Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter

... our annotated data set (see Table 2), it is to be expected the largest overlap occurs with tweets annotated as negative for hate ...our data set generally differs from the distribution in ...

5

Building Domain Specific Taggers without Annotated (Domain) Data

Building Domain Specific Taggers without Annotated (Domain) Data

... rapidly develop taggers for new domains without using the time and effort to develop annotated data. In this work, we use the Wall Street Journal (WSJ) corpus (Marcus et al, 1993) and large amounts of ...

9

Training a Neural Network in a Low Resource Setting on Automatically Annotated Noisy Data

Training a Neural Network in a Low Resource Setting on Automatically Annotated Noisy Data

... The model proposed by Bekker and Goldberger (2016) assumes that all clean labels pass through a noisy channel. One does only observe the noisy labels. The model of the noise channel, as well as the clean labels, are ...

7

WebCAGe – A Web Harvested Corpus Annotated with GermaNet Senses

WebCAGe – A Web Harvested Corpus Annotated with GermaNet Senses

... arately for the three word classes of adjectives, nouns, and verbs. Table 3 shows that precision and recall for all three word classes that occur for Wiktionary examples, external webpages, and Wikipedia articles lies ...

10

Named Entity Recognition with Partially Annotated Training Data

Named Entity Recognition with Partially Annotated Training Data

... First, to get an idea of the difficulty of NER in each language, we report scores from models trained on gold data without perturbation (Gold). Then we re- port results from an Oracle Weighting scheme (Ora- cle ...

11

Forecasting pedestrian trajectory with machine annotated training data

Forecasting pedestrian trajectory with machine annotated training data

... A similar annotation process is proposed in [33], in which pedestrians are detected and tracked using [5]. However, automated detectors do not perform on par with human annotators, and make different errors to humans, ...

7

Annotated Gigaword

Annotated Gigaword

... have annotated this collection with syntactic and discourse structure, for release to the community through the Linguistic Data Consortium (LDC) as a static, large-scale resource for knowledge acqui- sition ...

6

Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media Posts

Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media Posts

... MA data and, given that prior studies have shown substantial gender disparity online, with women receiving more negative be- haviors (Duggan, 2017), this choice has the po- tential for highest ...

11

Nested Named Entity Recognition

Nested Named Entity Recognition

... We believe this has largely been for practical, not ideological, reasons. Most corpus designers have chosen to skirt the issue entirely, and have annotated only the topmost entities. The widely used CoNLL (Sang ...

10

Developing language technology tools and resources for a resource poor language: Sindhi

Developing language technology tools and resources for a resource poor language: Sindhi

... The research presented in this paper was done in col- laboration with my advisors, Prof. Dipti M. Sharma and Dr. Manish Shrivastava. The Part-of-Speech an- notation was done in collaboration with Dr. Pinkey Nainwani and ...

8

The Annotated Transformer

The Annotated Transformer

... In this experimental paper, I propose an ex- ercise in open-source NLP. The goal is to tran- scribe a recent paper into a simple and under- standable form. The document itself is pre- sented as an annotated paper. ...

9

Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data

Creating Lithuanian and Latvian Speech Corpora from Inaccurately Annotated Web Data

... Then, audio is extracted from each video file and processed by the LIUM SpkDiarization toolkit(Rouvier et al., 2013), which segments audio into smaller parts and groups them into clusters (that should correspond to ...

5

Extract offender information from text

Extract offender information from text

... unlabeled data and each edge in the graph is ...predicted data of a trained classifier model is used to retrain its model so that it is able to train itself with new unlabeled ...unlabeled data as ...

131

Annotated bibliography

Annotated bibliography

... A symposium on brain stem tumors in childhood was held in December 1995 at New York University. Several papers from that symposium are published in this and subsequent issues of Pediatric Neurosurgery and will be ...

5

Annotated HB

Annotated HB

... It was tragic, all right, but George and Hazel couldn't think about it very hard. Hazel had a perfectly average intelligence, which meant she couldn't think about anything except in sho[r] ...

6

Show all 10000 documents...

Related subjects