• No results found

Quality of the Evaluation Dataset

SPADE: Evaluation Dataset for Monolingual Phrase Alignment

SPADE: Evaluation Dataset for Monolingual Phrase Alignment

... 1 https://www.ldc.upenn.edu/ 2 https://catalog.ldc.upenn.edu/LDC2018T09 (will be effective since March 2018) ments among these phrases as shown in Figure 1, resulted in 15, 721 alignments that at least an annotator ...

5

Polish evaluation dataset for compositional distributional semantics models

Polish evaluation dataset for compositional distributional semantics models

... 1.4 Motivation and organisation of the paper Studying approaches to various natural language processing (henceforth NLP) problems, we have observed that the availability of language re- sources (e.g. training or testing ...

9

Quality Assessment of the French OpenStreetMap Dataset

Quality Assessment of the French OpenStreetMap Dataset

... of quality control processes represent an integrant part of ongoing researches in the conception of an evaluation model of geometric imprecision in under qualified vector databases using, or not, reference ...

20

Visualisation of quality information for geospatial and remote sensing data:providing the GIS community with the decision support tools for geospatial dataset quality evaluation

Visualisation of quality information for geospatial and remote sensing data:providing the GIS community with the decision support tools for geospatial dataset quality evaluation

... the quality information facet. While a ‘quality seal’ icon could be utilised for this facet, such symbology could potentially be interpreted as certification and mislead the label ...

381

Discourse Coherence in the Wild: A Dataset, Evaluation and Methods

Discourse Coherence in the Wild: A Dataset, Evaluation and Methods

... 1 Introduction Discourse coherence is an important aspect of text quality. It encompasses how sentences are con- nected as well as how the entire document is orga- nized to convey information to the reader. Devel- ...

10

Comprehensive Multi Dataset Evaluation of Reading Comprehension

Comprehensive Multi Dataset Evaluation of Reading Comprehension

... 4.3 Synthetic Augmentations Table 6 shows the performance of the baseline model on various development sets and heuris- tically generated questions. The More Wrong Choice augmentation is omitted since a high enough ...

7

A High Quality Multilingual Dataset for Structured Documentation Translation

A High Quality Multilingual Dataset for Structured Documentation Translation

... XML accuracy, matching, and BLEU For each output text segment, we use the etree mod- ule to check if it is a valid XML structure by wrap- ping it with a dummy root node. Then the XML accuracy score is the number of the ...

12

Dataset for the First Evaluation on Chinese Machine Reading Comprehension

Dataset for the First Evaluation on Chinese Machine Reading Comprehension

... • Question Restriction: Only five questions can be ex- tracted within one passage. 3.4. Human Annotation Apart from the automatically generated large-scale training data, we also provide human-annotated validation and ...

5

Applying different genomic evaluation approaches on QTLMAS2010 dataset

Applying different genomic evaluation approaches on QTLMAS2010 dataset

... the quality of the SNP map to capture the whole genetic variation, which would depend on several factors such as Linkage Dise- quilibrium (LD) between loci and the coverage of the whole ...the quality of ...

7

EVALution MAN: A Chinese Dataset for the Training and Evaluation of DSMs

EVALution MAN: A Chinese Dataset for the Training and Evaluation of DSMs

... its evaluation. In this paper, we introduce a dataset for training and evaluating DSMs on semantic relations discrimination between words, in Mandarin, ...the dataset followed EVALution 1.0, which is ...

5

Crowd-Sourcing A High-Quality Dataset for Metaphor Identification in Tweets

Crowd-Sourcing A High-Quality Dataset for Metaphor Identification in Tweets

... a dataset for metaphor identification, that is able to rapidly achieve large coverage over the different usages of metaphor in a given corpus while maintaining high ...first dataset of tweets annotated for ...

17

Towards building a standard dataset for Arabic keyphrase extraction evaluation

Towards building a standard dataset for Arabic keyphrase extraction evaluation

... irst dataset of keyphrases for an Arabic document collection, obtained by means of ...the quality of our ...the dataset features, some lessons learned, and ideas for future ...

5

A Dataset and Evaluation Metrics for Abstractive Compression of Sentences and Short Paragraphs

A Dataset and Evaluation Metrics for Abstractive Compression of Sentences and Short Paragraphs

... reference dataset for abstractive sentence and short paragraph ...sion quality as found in this ...automatic evaluation metrics and human judgments of meaning preservation and grammaticality in the ...

11

Towards building a standard dataset for Arabic keyphrase extraction evaluation

Towards building a standard dataset for Arabic keyphrase extraction evaluation

... V. Conclusion and Future Work We reported on our irst efort in building a new KP dataset for Arabic documents by means of crowdsourc- ing. Being our irst efort in building such a corpus, there is plenty of ...

5

On the Creation of a Fuzzy Dataset for the Evaluation of Fuzzy Semantic Similarity Measures

On the Creation of a Fuzzy Dataset for the Evaluation of Fuzzy Semantic Similarity Measures

... benchmark dataset as a basis for the SFWD dataset would ensure that the same level of quality is ...existing dataset would require the addition of fuzzy components, which would then need to be ...

8

Empirical evaluation of software maintainability based on a manually validated refactoring dataset

Empirical evaluation of software maintainability based on a manually validated refactoring dataset

... It is hard to ensure the systematic reproducibility of the human evaluations as the classification of refactoring instances is prone to human subjectivity. There is no way we can guarantee that the re-validation of the ...

50

Global retrieval of ATSR cloud parameters and evaluation (GRAPE): dataset assessment

Global retrieval of ATSR cloud parameters and evaluation (GRAPE): dataset assessment

... which some are likely applicable to other, similar algorithms. Chiefly, these include improved identification and treatment of multi-layer and mixed-phase cloud systems; improved identification of cloud and description ...

24

daq, an Ontology for Dataset Quality Information

daq, an Ontology for Dataset Quality Information

... the Dataset Quality vocabulary is to enable data publishers to easily describe dataset quality so that, in turn, con- sumers can easily find out which datasets are fit for their intended ...

8

A New Dataset and Evaluation for Belief/Factuality

A New Dataset and Evaluation for Belief/Factuality

... For the actual evaluation, we used files which also had been hand-annotated for ACE entities. How- ever, we did not have a gold annotation for entity- focused belief, as this study is still contributing to- wards ...

10

Data Quality in the National Minimum Dataset (NMDS)

Data Quality in the National Minimum Dataset (NMDS)

... Accident flag, ACC claim number, purchaser code, admission date, external cause date ¾ If the accident flag is Y then the ACC claim. number field must not be blank[r] ...

28

Show all 10000 documents...

Related subjects