• No results found

Semantics, Pragmatics, and Context in Grounded Human Language Understanding

N/A
N/A
Protected

Academic year: 2021

Share "Semantics, Pragmatics, and Context in Grounded Human Language Understanding"

Copied!
151
0
0

Loading.... (view fulltext now)

Full text

(1)

Semantics, Pragmatics, and Context in Grounded Human Language Understanding

Roger Levy

Computational Psycholinguistics Laboratory (CPL) Dept. of Brain & Cognitive Sciences

Massachusetts Institute of Technology

(2)

Collaborators

Helena Aparicio Curtis Chen Elizabeth Coppock

(3)

The challenge of language understanding

(4)

The challenge of language understanding

• How do humans communicate so well with language?

(5)

The challenge of language understanding

• How do humans communicate so well with language?

(6)

The challenge of language understanding

• How do humans communicate so well with language?

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(7)

The challenge of language understanding

• How do humans communicate so well with language?

Environmental noise

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(8)

The challenge of language understanding

• How do humans communicate so well with language?

Environmental noise

Memory Limitations

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(9)

The challenge of language understanding

• How do humans communicate so well with language?

Environmental noise

Incomplete knowledge of one’s interlocutors

Memory Limitations

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(10)

The challenge of language understanding

• How do humans communicate so well with language?

And how can we get machines to do the same?

Environmental noise

Incomplete knowledge of one’s interlocutors

Memory Limitations

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(11)

Amazing things humans do with language

(12)

Amazing things humans do with language

The brightly milted porcupine daxed a dinner party-ready nest out of temble.

(13)

Amazing things humans do with language

The brightly milted porcupine daxed a dinner party-ready nest out of temble.

Colorless green ideas sleep furiously. —Chomsky, 1957

(14)

Amazing things humans do with language

The brightly milted porcupine daxed a dinner party-ready nest out of temble.

Colorless green ideas sleep furiously. —Chomsky, 1957

*Furiously sleep ideas green colorless.

(15)

Amazing things humans do with language

The brightly milted porcupine daxed a dinner party-ready nest out of temble.

Colorless green ideas sleep furiously. —Chomsky, 1957

*Furiously sleep ideas green colorless.

(16)

Amazing things humans do with language

The brightly milted porcupine daxed a dinner party-ready nest out of temble.

Colorless green ideas sleep furiously. —Chomsky, 1957

*Furiously sleep ideas green colorless.

The teacher spoon-fed me the example.

(17)

Amazing things humans do with language

The brightly milted porcupine daxed a dinner party-ready nest out of temble.

Colorless green ideas sleep furiously. —Chomsky, 1957

*Furiously sleep ideas green colorless.

The teacher spoon-fed me the example.

(18)

Amazing things humans do with language

(19)

Amazing things humans do with language

Point to the frog on the left.

(20)

Amazing things humans do with language

Point to the frog on the left.

(21)

Amazing things humans do with language

Point to the rabbit.

Point to the frog on the left.

(22)

Amazing things humans do with language

Point to the rabbit.

Point to the frog on the left.

(23)

Amazing things humans do with language

Point to the rabbit.

Point to the box.

Point to the frog on the left.

(24)

Amazing things humans do with language

Point to the rabbit.

Point to the box.

Point to the frog on the left.

(25)

Amazing things humans do with language

Point to the rabbit.

Point to the box.

Is the rabbit in the box?

Point to the frog on the left.

(26)

Amazing things humans do with language

Point to the rabbit.

Point to the box.

Is the rabbit in the box?

Point to the frog on the left.

(27)

Amazing things humans do with language

Point to the rabbit.

Point to the box.

Is the rabbit in the box?

Point to the rabbit in the box.

Point to the frog on the left.

(28)

Amazing things humans do with language

Point to the rabbit.

Point to the box.

Is the rabbit in the box?

Point to the rabbit in the box.

Point to the frog on the left.

(29)

Vignettes

• Unknown words and pragmatic inference

• The nature of semantic scales and comparatives

• Syntax & inferring comparison classes for semantic scales

• Putting it all together: Complex descriptions and pragmatic inference in context

(30)

Vignettes

• Unknown words and pragmatic inference

• The nature of semantic scales and comparatives

• Syntax & inferring comparison classes for semantic scales

• Putting it all together: Complex descriptions and pragmatic inference in context

(31)

• Carey & Bartlett (1978) "fast mapping"

• Even after a single exposure, there is some learning (a

Unknown words and pragmatic inference

You see those two trays over there. Bring me the chromium one. Not the red one, the

chromium one.

Teacher 3–4 year old child

(32)

Unspoken alternatives in pragmatic inference

(33)

Unspoken alternatives in pragmatic inference

Bob says “hat”

(34)

Unspoken alternatives in pragmatic inference

Bob says “hat”

(35)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

(36)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

(37)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

(38)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

Scalar implicature (SI)

(39)

Scalar implicature: traditional account

1 2 3

(40)

Scalar implicature: traditional account

Bob said hat – either snowman 2 or 3 could be possible

1 2 3

(41)

Scalar implicature: traditional account

Bob said hat – either snowman 2 or 3 could be possible

But scarf is an alternative* to hat

1 2 3

(42)

Scalar implicature: traditional account

Bob said hat – either snowman 2 or 3 could be possible

But scarf is an alternative* to hat

If Bob had said scarf, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

1 2 3

(43)

Scalar implicature: traditional account

Bob said hat – either snowman 2 or 3 could be possible

But scarf is an alternative* to hat

If Bob had said scarf, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

1 2 3

(44)

Scalar implicature: traditional account

Bob said hat – either snowman 2 or 3 could be possible

But scarf is an alternative* to hat

If Bob had said scarf, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

• Thus I can conclude that Bob didn't mean snowman 3

• I should conclude that Bob meant snowman 2

1 2 3

(45)

Can unknown words drive scalar inference?

Bob says “hat”

(46)

Can unknown words drive scalar inference?

Bob says “hat”

(47)

Can unknown words drive scalar inference?

Bob says “hat”

(48)

Can unknown words drive scalar inference?

Bob says “hat”

Alternative:

???

(49)

Does the traditional account go through?

Bob said hat – either snowman 2 or 3 could be possible

But scarf is an alternative* to hat

If Bob had said scarf, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and..

1 2 3

(50)

Does the traditional account go through?

1 2 3

Bob said hat – either snowman 2 or 3 could be possible

But ???? is an alternative* to hat

If Bob had said ????, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

• Thus I can conclude that Bob didn't mean snowman 3

• I should conclude that Bob meant snowman 2

(51)

Does the traditional account go through?

1 2 3

Bob said hat – either snowman 2 or 3 could be possible

But ???? is an alternative* to hat

If Bob had said ????, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

(52)

Does the traditional account go through?

1 2 3

Bob said hat – either snowman 2 or 3 could be possible

But ???? is an alternative* to hat

If Bob had said ????, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

• Thus I can conclude that Bob didn't mean snowman 3

• I should conclude that Bob meant snowman 2

(53)

Potential accounts

(54)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations

underlying SI.

(55)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations underlying SI.

NO alternative

(56)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations underlying SI.

If speakers generate and use new lexical entries from one exposure, then nonce objects may drive

SI like familiar objects.

NO alternative

(57)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations underlying SI.

If speakers generate and use new lexical entries from one exposure, then nonce objects may drive

SI like familiar objects.

Novel alternative NO

alternative

(58)
(59)

Do nonce objects drive scalar implicature?

(60)

Do nonce objects drive scalar implicature?

Condition 1:

Familiar feature

(61)

Do nonce objects drive scalar implicature?

Condition 1:

Familiar feature

Condition 2:

Nonce feature

(62)

Do nonce objects drive SI?

(63)

Do nonce objects drive SI?

(64)

Do nonce objects drive SI?

(65)

Do nonce objects drive SI?

Hypothesis 1 No. There is no alternative label in

common ground.

(66)

Do nonce objects drive SI?

Hypothesis 1 No. There is no alternative label in

common ground.

(67)

Do nonce objects drive SI?

Hypothesis 2 Yes. People can reason about novel

lexical entries.

Hypothesis 1 No. There is no alternative label in

common ground.

(68)

Do nonce objects drive SI?

Hypothesis 2 Yes. People can reason about novel

lexical entries.

Hypothesis 1 No. There is no alternative label in

common ground.

(69)

Do nonce objects drive SI?

Hypothesis 2 Yes. People can reason about novel

lexical entries.

Hypothesis 1 No. There is no alternative label in

common ground.

Results

(70)

Do nonce objects drive SI?

Hypothesis 2 Yes. People can reason about novel

lexical entries.

Hypothesis 1 No. There is no alternative label in

common ground.

Results

(71)

Vignettes

• Unknown words and pragmatic inference

• The nature of semantic scales and comparatives

• Syntax & inferring comparison classes for semantic scales

• Putting it all together: Complex descriptions and pragmatic inference in context

(72)

Degree semantics for scalar adjectives

Mary is tall

(73)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

Mary is tall

(74)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale

Mary is tall

(75)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale

Mary

Mary is tall

(76)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale

Mary

height

Mary is tall

(77)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale

Mary

height(⟦Mary⟧)

Mary is tall

(78)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height

height(⟦Mary⟧)

Mary is tall

(79)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall

(80)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height

height(⟦Mary⟧)

θ

Mary is tall TRUE

(81)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall

(82)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height

height(⟦Mary⟧)

θ

Mary is tall

(83)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall FALSE

(84)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height

height(⟦Mary⟧)

Mary is tall

(85)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

Mary is tall

(86)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height

height(⟦Mary⟧)

θ

Mary is tall

(87)

Degree semantics for scalar adjectives

The meaning of a scalar adjective like big or tall does two things:

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall FALSE

(88)

Degree semantics also underlie comparatives

(89)

Degree semantics also underlie comparatives

The left circle is bigger than

the right circle.

(90)

Degree semantics also underlie comparatives

Point to the bigger circle.

The left circle is bigger than

the right circle.

(91)

Degree semantics also underlie comparatives

Point to the bigger circle.

The left circle is bigger than the right circle.

What exactly does this mean?!?

(92)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle the biggest circle

(93)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle

the biggest circle

😀

(94)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle

the biggest circle

😀

🙁

(95)

Corpus data

comparative superlative

0 5000 10000 15000

count

big large long short small tall thin wide

the [Adj] of [Num]

(96)

Corpus data

comparative superlative

2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 0

5000 10000 15000

cardinality

count

big large long short small tall thin wide

the [Adj] of [Num]

(97)

Corpus data

comparative superlative

0 5000 10000 15000

count

big large long short small tall thin wide

the [Adj] of [Num]

(98)

A revealing example from the wild

https://www.livestrong.com/article/550451-is-it-better-to-work-the-back-with-biceps-or-triceps/

Another muscle group to consider pairing your back workout with is the triceps. "Back and triceps workouts are a great way to ensure that you get

indirect workload on the biceps, but get the direct work on the triceps while still working on the bigger of the three muscles — the back," explains

Carneiro.

(99)

Two theories of the comparative

the bigger circle

the biggest circle

😀

🙁

(100)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle

the biggest circle

😀

🙁

(101)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle

the biggest circle

😀

🙁

the bigger circle

the biggest circle

(102)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle

the biggest circle

😀

🙁

the bigger circle

the biggest circle

😀

(103)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle

the biggest circle

😀

🙁

the bigger circle

the biggest circle

😀

🤔

(104)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

2. "bigger" requires that there are two granularities in the context of the comparison class

the bigger circle

the biggest circle

😀

🙁

the bigger circle

the biggest circle

😀

🤔

(105)

Theory of granularity inference

(106)

Theory of granularity inference

(107)

Experiment results

coarse progressive

2 3 4 2 3 4

−1.5

−1.0

−0.5 0.0 0.5 1.0

z−score

* *

*

* **

(108)

Experiment results

coarse progressive

2 3 4 2 3 4

−1.5

−1.0

−0.5 0.0 0.5 1.0

cardinality

z−score

comparative superlative

* *

*

* **

(109)

Vignettes

• Unknown words and pragmatic inference

• The nature of semantic scales and comparatives

• Syntax & inferring comparison classes for semantic scales

• Putting it all together: Complex descriptions and pragmatic inference in context

(110)

What else the degree semantics doesn’t say

(111)

What else the degree semantics doesn’t say

• How do we know the comparison class?

(112)

What else the degree semantics doesn’t say

• How do we know the comparison class?

• How does tall elephant turn out to mean something different from tall mouse?

(113)

What else the degree semantics doesn’t say

• How do we know the comparison class?

• How does tall elephant turn out to mean something different from tall mouse?

• How can the same individual be evaluated as either tall or not tall in different contexts?

(114)

What else the degree semantics doesn’t say

• How do we know the comparison class?

• How does tall elephant turn out to mean something different from tall mouse?

• How can the same individual be evaluated as either tall or not tall in different contexts?

(115)

What else the degree semantics doesn’t say

• How do we know the comparison class?

• How does tall elephant turn out to mean something different from tall mouse?

• How can the same individual be evaluated as either tall or not tall in different contexts?

Stephen Curry is tall.

(116)

What else the degree semantics doesn’t say

• How do we know the comparison class?

• How does tall elephant turn out to mean something different from tall mouse?

• How can the same individual be evaluated as either tall or not tall in different contexts?

Stephen Curry is tall.

(117)

What else the degree semantics doesn’t say

• How do we know the comparison class?

• How does tall elephant turn out to mean something different from tall mouse?

• How can the same individual be evaluated as either tall or not tall in different contexts?

Stephen Curry is tall.

(118)

What else the degree semantics doesn’t say

• How do we know the comparison class?

• How does tall elephant turn out to mean something different from tall mouse?

• How can the same individual be evaluated as either tall or not tall in different contexts?

Stephen Curry is tall.

(Stephen Curry is 6’2”; this is the 12th

Stephen Curry is a tall basketball player.

(119)

Your friend says: That’s a big great dane.

You and your friend see the following:

Your friend runs far ahead of you, and you see him in the distance:

What do you think your friend meant? It is big relative to other ________.

Context Basic-Level
 Syntax Predicate NP


Noun Subordinate Category

(120)

Your friend says: That dog is big.

You and your friend see the following:

Your friend runs far ahead of you, and you see him in the distance:

What do you think your friend meant? It is big relative to other ________.

Context Subordinate-level
 Syntax Subject NP


Noun Basic-level Category

(121)

Results

Basic−Level Context Subordinate−Level Context

0.0 0.5 1.0

Proportion of basiclevel responses

NP

basic

"one"

subordinate

(122)

Vignettes

• Unknown words and pragmatic inference

• The nature of semantic scales and comparatives

• Syntax & inferring comparison classes for semantic scales

• Putting it all together: Complex descriptions and pragmatic inference in context

(123)

Definites

Point at the bag

(124)

Definites

Point at the bag

which one?!

32

(125)

Definites

Point at the bag

Property of definites: require a unique referent

(126)

Definites

Point at the bag

34

Request violates a cooperative norm for communication rooted in the semantics of the definite determiner

(127)

Definites

Point at the bag

Reference Failure

(128)

Haddock Descriptions

36

The rabbit in the bag

Haddock (1987)

(129)

Haddock Descriptions

The rabbit in [the bag]

(130)

Haddock Descriptions

[The rabbit in [the bag]]

38 Haddock (1987)

(131)

Haddock Descriptions

The rabbit in the bag

(132)

Puzzle

40

The rabbit in the bag

(133)

Puzzle

Why doesn’t the embedded definite fail to refer?

(134)

A semantic account for Haddock descriptions

(135)

Click on the rabbit in the big [***]

Experiment: threshold uncertainty condition

(136)

Click on the rabbit in the big [***]

Experiment: threshold uncertainty condition

Definitely a big box!

(137)

Click on the rabbit in the big [***]

Experiment: threshold uncertainty condition

Definitely a big box!

(138)

Click on the rabbit in the big [***]

Experiment: threshold uncertainty condition

Definitely a big box!

Not necessarily a big bag!

(139)

Click on the rabbit in the big [***]

Experiment: informativity violation condition

(140)

Click on the rabbit in the big [***]

Experiment: informativity violation condition

Wouldn't have needed to say "big"!

(141)

Click on the rabbit in the big [***]

Experiment: informativity violation condition

Wouldn't have needed to say "big"!

(142)

Click on the rabbit in the big [***]

Threshold uncertainty and informativity violation

(143)

Click on the rabbit in the big [***]

Threshold uncertainty and informativity violation

(144)

Modeling with Bayesian pragmatics

(145)

Modeling with Bayesian pragmatics

(146)

Modeling with Bayesian pragmatics

Listener

(147)

Model derives qualitative patterns of human response

(148)

Vignettes

• Unknown words and pragmatic inference

• The nature of semantic scales and comparatives

• Syntax & inferring comparison classes for semantic scales

• Putting it all together: Complex descriptions and pragmatic inference in context

(149)

Zeroing in on truly human-like language

Theory of linguistic knowledge

Language Datasets

Computational Models

Psychological Experimentation Human

Language

(150)

Collaborators

Helena Aparicio Curtis Chen Elizabeth Coppock

(151)

Thank you for listening!

References

Related documents

The proposed rule would require the configuration log to contain the information under API 21.1.5.4 (meter identifier, date and time collected, contract hour, atmospheric pressure

Turkish Standard Formwork Striking Time Cement Type Sides of beams, walls and columns Slabs Props of beams and large opening slabs forms Normal Portland Cement.. 3 days

Gona has yielded important information on several critical time periods in human evolution, including the first stone tools in the world, and now the new pelvis discovery providing

Occupational burnout and its causes among practicing nurses in hospitals affiliated to Babol University of Medical Sciences.. Journal of Babol University of

DYNAPOL® UB systems are suitable for the manufacture of industrial coatings, particularly coil coatings with high formability, good surface hardness and

sed as controls incubated at 24±1°c under photoperiod cycle of 16/8 hr light /dark. Nodal plantlets of 10mm size were used as controls incubated at 24±1°c under photoperiod cycle

psychology/counselling; African Caribbeans and mental health; mixed methods in psychology; qualitative research; diversity and Counselling/Clinical Psychology; Counselling