• No results found

human judgments

What makes a good conversation? How controllable attributes affect human judgments

What makes a good conversation? How controllable attributes affect human judgments

... via human judgments of overall quality, the relationship between quality and these individual factors is less ...large-scale human evaluation to measure the effect of these control parame- ters on ...

22

Do dependency parsing metrics correlate with human judgments?

Do dependency parsing metrics correlate with human judgments?

... with human judg- ment of parse output quality across five languages: Croatian, Danish, English, German, and ...the human judgments, we asked professional linguists with dependency annotation ...

6

Fluency, Adequacy, or HTER? Exploring Different Human Judgments with a Tunable MT Metric

Fluency, Adequacy, or HTER? Exploring Different Human Judgments with a Tunable MT Metric

... with human judgments of translation quality, it has sev- eral flaws, including the use of only a single ref- erence translation and the measuring of similarity only by exact word matches between the hypoth- ...

10

Taking MT evaluation metrics to extremes : beyond correlation with human judgments

Taking MT evaluation metrics to extremes : beyond correlation with human judgments

... where human assessments were collected on a continuous adequacy ...preference judgments (see Section ...with human judgments collected using a discrete scale following the post-editing effort ...

45

Improving Correlation with Human Judgments by Integrating Semantic Similarity with Second–Order Vectors

Improving Correlation with Human Judgments by Integrating Semantic Similarity with Second–Order Vectors

... a human curated taxon- omy into a second–order vector represen- ...with human judgments for both similarity and relatedness, and that our method compares favorably to various dif- ferent word ...

10

Not So Latent Dirichlet Allocation: Collapsed Gibbs Sampling Using Human Judgments

Not So Latent Dirichlet Allocation: Collapsed Gibbs Sampling Using Human Judgments

... our human-learned topic models have unique features such as fixed sparsity and a tendency for topics to be constructed around concepts which models such as LDA typically fail to ...

8

Vectorial Semantic Spaces Do Not Encode Human Judgments of Intervention Similarity

Vectorial Semantic Spaces Do Not Encode Human Judgments of Intervention Similarity

... grammaticality judgments and on- line reaction times), and across operators (sym- metric and asymmetric) show a consistent lack of correlation between measurements collected in ex- periments that manipulated the ...

10

Re evaluating the Role of Bleu in Machine Translation Research

Re evaluating the Role of Bleu in Machine Translation Research

... Over the past five years progress in machine trans- lation, and to a lesser extent progress in natural language generation tasks such as summarization, has been driven by optimizing against n-gram- based evaluation ...

8

A Dataset and Evaluation Metrics for Abstractive Compression of Sentences and Short Paragraphs

A Dataset and Evaluation Metrics for Abstractive Compression of Sentences and Short Paragraphs

... on human compres- sion quality as found in this ...and human judgments of meaning preservation and grammaticality in the compression task, and analyze the impact of the linguistic units used and ...

11

A Generative Model of Vector Space Semantics

A Generative Model of Vector Space Semantics

... graded human judgments of phrase similarity given only positive examples of matching pairs, or distributional rep- resentations of pairs as training data; when trained in this fashion, the model outperforms ...

9

Approximating a Deep Syntactic Metric for MT Evaluation and Tuning

Approximating a Deep Syntactic Metric for MT Evaluation and Tuning

... with human judgments but it is computation- ally costly and hard to adapt to other lan- guages because it relies on a deep-syntactic analysis of the system output and the refer- ...to human ...

7

Squibs: Evaluating Human Pairwise Preference Judgments

Squibs: Evaluating Human Pairwise Preference Judgments

... with human judgments have been developed, especially in Machine Transla- tion, to relieve some of the ...the human evaluation to be ...present human judg- ments of preferences for their ...

9

Structured vs  Flat Semantic Role Representations for Machine Translation Evaluation

Structured vs Flat Semantic Role Representations for Machine Translation Evaluation

... The human ad- equacy judgments were obtained by showing all three MT outputs together with the Chinese source input to a human ...The human reader was in- structed to order the sentences from ...

11

Expressivism and the use of moral language

Expressivism and the use of moral language

... different category from moral judgments. The purpose of this dissertation is to see how the two most prominent expressivists in recent years, Simon Blackburn and Allan Gibbard set out to explain the surface. I ...

75

Possession, Indefeasibility and Human Rights

Possession, Indefeasibility and Human Rights

... primacy. There was no law of ownership, merely a law of possession. 79 Legal effect was to be given to the de-facto state of affairs. To do otherwise would amount to an injustice. 80 However, this state of affairs was ...

15

Truth in Economic Subjectivism

Truth in Economic Subjectivism

... If eco no mic judgments can co rrespo nd to facts and thus instantiate truth, then o ther no rmative judgments, such as mo ral judgments, can be similarly co nsistent with realism.. It i[r] ...

7

Prediction of relevant biomedical documents: a human microbiome case study

Prediction of relevant biomedical documents: a human microbiome case study

... Laboratory experiments in academia have often shown dramatic gains from using relevance feedback [4]. Once ranked retrieval was used commercially, however, relevance feedback was either not used, or used in a very ...

12

Human rights law as social control

Human rights law as social control

... CourtÕs judgments by the Committee of Ministers of the Council of ...Europe. Judgments of the Court are ultimately declaratory commands similar to other forms of legal judgments: they are ...

34

Filtering and Measuring the Intrinsic Quality of Human Compositionality Judgments

Filtering and Measuring the Intrinsic Quality of Human Compositionality Judgments

... In this task, we built three datasets, in French ( fr ), Portuguese ( pt ) and English ( en ), containing human-annotated composi- tionality scores for 2-word NCs. Annotators were native speakers using an online ...

6

Reranking Bilingually Extracted Paraphrases Using Monolingual Distributional Similarity

Reranking Bilingually Extracted Paraphrases Using Monolingual Distributional Similarity

... Occasionally, paraphrases are context-dependent, meaning the relevance of the paraphrase depends on the context in a sentence. Bilingual methods can capture limited context through syntactic constraints if the POS tags ...

10

Show all 10000 documents...

Related subjects