Semantics, Pragmatics, and Context in Grounded Human Language Understanding

(1)

Semantics, Pragmatics, and Context in Grounded Human Language Understanding

Roger Levy

Computational Psycholinguistics Laboratory (CPL) Dept. of Brain & Cognitive Sciences

Massachusetts Institute of Technology

(2)

Collaborators

Helena Aparicio Curtis Chen Elizabeth Coppock

(3)

The challenge of language understanding

(4)

The challenge of language understanding

• How do humans communicate so well with language?

(5)

The challenge of language understanding

• How do humans communicate so well with language?

(6)

The challenge of language understanding

• How do humans communicate so well with language?

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(7)

The challenge of language understanding

• How do humans communicate so well with language?

Environmental noise

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(8)

The challenge of language understanding

• How do humans communicate so well with language?

Memory Limitations

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(9)

The challenge of language understanding

• How do humans communicate so well with language?

Incomplete knowledge of one’s interlocutors

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(10)

The challenge of language understanding

• How do humans communicate so well with language?

And how can we get machines to do the same?

Incomplete knowledge of one’s interlocutors

S

VP

NP

PP on the beach N

dogs Det

the V

discussed

NP N women Det

The

Ambiguity

S

VP

PP on the beach NP

N dogs Det

the V

discussed NP

N women Det

The

(11)

Amazing things humans do with language

(12)

Amazing things humans do with language

The brightly milted porcupine daxed a dinner party-ready nest out of temble.

(13)

Amazing things humans do with language

Colorless green ideas sleep furiously. —Chomsky, 1957

(14)

Amazing things humans do with language

*Furiously sleep ideas green colorless.

(15)

Amazing things humans do with language

(16)

Amazing things humans do with language

The teacher spoon-fed me the example.

(17)

Amazing things humans do with language

The teacher spoon-fed me the example.

(18)

Amazing things humans do with language

(19)

Amazing things humans do with language

Point to the frog on the left.

(20)

Amazing things humans do with language

✔

(21)

Amazing things humans do with language

Point to the rabbit.

✔

(22)

Amazing things humans do with language

✘

✔

(23)

Amazing things humans do with language

Point to the box.

✘

✔

(24)

Amazing things humans do with language

Point to the box.

✘

✔

(25)

Amazing things humans do with language

Point to the box.

Is the rabbit in the box?

✘

✔

(26)

Amazing things humans do with language

Point to the box.

✘

✔

(27)

Amazing things humans do with language

Point to the box.

Point to the rabbit in the box.

✘

✔

(28)

Amazing things humans do with language

Point to the box.

Point to the rabbit in the box.

✘

✔

(29)

Vignettes

• Unknown words and pragmatic inference

• The nature of semantic scales and comparatives

• Syntax & inferring comparison classes for semantic scales

• Putting it all together: Complex descriptions and pragmatic inference in context

(30)

Vignettes

(31)

• Carey & Bartlett (1978) "fast mapping"

• Even after a single exposure, there is some learning (a

Unknown words and pragmatic inference

You see those two trays over there. Bring me the chromium one. Not the red one, the

chromium one.

Teacher 3–4 year old child

(32)

Unspoken alternatives in pragmatic inference

(33)

Unspoken alternatives in pragmatic inference

Bob says “hat”

(34)

Unspoken alternatives in pragmatic inference

Bob says “hat”

(35)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

(36)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

(37)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

(38)

Unspoken alternatives in pragmatic inference

Bob says “hat”

Alternative:

“scarf”

Scalar implicature (SI)

(39)

Scalar implicature: traditional account

1 2 3

(40)

Scalar implicature: traditional account

• Bob said hat – either snowman 2 or 3 could be possible

1 2 3

(41)

Scalar implicature: traditional account

• But scarf is an alternative* to hat

1 2 3

(42)

Scalar implicature: traditional account

• If Bob had said scarf, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

1 2 3

(43)

Scalar implicature: traditional account

1 2 3

(44)

Scalar implicature: traditional account

• Thus I can conclude that Bob didn't mean snowman 3

• I should conclude that Bob meant snowman 2

1 2 3

(45)

Can unknown words drive scalar inference?

Bob says “hat”

(46)

Can unknown words drive scalar inference?

Bob says “hat”

(47)

Can unknown words drive scalar inference?

Bob says “hat”

(48)

Can unknown words drive scalar inference?

Bob says “hat”

Alternative:

???

(49)

Does the traditional account go through?

• If Bob had said scarf, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and..

1 2 3

(50)

Does the traditional account go through?

1 2 3

• But ???? is an alternative* to hat

• If Bob had said ????, I would have chosen snowman 3 – and Bob knows that, and I know that Bob knows that, and Bob knows that I know that Bob knows that, and...

(51)

Does the traditional account go through?

1 2 3

(52)

Does the traditional account go through?

1 2 3

(53)

Potential accounts

(54)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations

underlying SI.

(55)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations underlying SI.

NO alternative

(56)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations underlying SI.

If speakers generate and use new lexical entries from one exposure, then nonce objects may drive

SI like familiar objects.

NO alternative

(57)

Potential accounts

If the label for a certain feature is not in

common ground, then it might not enter the

computations underlying SI.

If speakers generate and use new lexical entries from one exposure, then nonce objects may drive

SI like familiar objects.

Novel alternative NO

alternative

(58)

(59)

Do nonce objects drive scalar implicature?

(60)

Condition 1:

Familiar feature

(61)

Condition 1:

Familiar feature

Condition 2:

Nonce feature

(62)

Do nonce objects drive SI?

(63)

(64)

(65)

Hypothesis 1 No. There is no alternative label in

common ground.

(66)

common ground.

(67)

Hypothesis 2 Yes. People can reason about novel

lexical entries.

common ground.

(68)

lexical entries.

common ground.

(69)

lexical entries.

common ground.

Results

(70)

lexical entries.

common ground.

Results

(71)

Vignettes

(72)

Degree semantics for scalar adjectives

Mary is tall

(73)

Degree semantics for scalar adjectives

• The meaning of a scalar adjective like big or tall does two things:

Mary is tall

(74)

Degree semantics for scalar adjectives

1. Projects a referent onto some value on a scale

Mary is tall

(75)

Degree semantics for scalar adjectives

Mary

Mary is tall

(76)

Degree semantics for scalar adjectives

Mary

height

Mary is tall

(77)

Degree semantics for scalar adjectives

Mary

height(⟦Mary⟧)

Mary is tall

(78)

Degree semantics for scalar adjectives

1. Projects a referent onto some value on a scale 2. Predicates that that value is greater than some

threshold θ

Mary

height

height(⟦Mary⟧)

Mary is tall

(79)

Degree semantics for scalar adjectives

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall

(80)

Degree semantics for scalar adjectives

threshold θ

Mary

height

height(⟦Mary⟧)

θ

Mary is tall TRUE

(81)

Degree semantics for scalar adjectives

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall

(82)

Degree semantics for scalar adjectives

threshold θ

Mary

height

height(⟦Mary⟧)

θ

Mary is tall

(83)

Degree semantics for scalar adjectives

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall FALSE

(84)

Degree semantics for scalar adjectives

threshold θ

Mary

height

height(⟦Mary⟧)

Mary is tall

(85)

Degree semantics for scalar adjectives

threshold θ

Mary

Mary is tall

(86)

Degree semantics for scalar adjectives

threshold θ

Mary

height

height(⟦Mary⟧)

θ

Mary is tall

(87)

Degree semantics for scalar adjectives

threshold θ

Mary

height(⟦Mary⟧)

Mary is tall FALSE

(88)

Degree semantics also underlie comparatives

(89)

Degree semantics also underlie comparatives

The left circle is bigger than

the right circle.

(90)

Degree semantics also underlie comparatives

Point to the bigger circle.

The left circle is bigger than

the right circle.

(91)

Degree semantics also underlie comparatives

Point to the bigger circle.

The left circle is bigger than the right circle.

What exactly does this mean?!?

(92)

Two theories of the comparative

1. "bigger" requires that there are two referents in the context of the comparison class

the bigger circle the biggest circle

(93)

Two theories of the comparative

the bigger circle

the biggest circle

😀

(94)

Two theories of the comparative

the bigger circle

😀

🙁

(95)

Corpus data

comparative superlative

0 5000 10000 15000

count

big large long short small tall thin wide

the [Adj] of [Num]

(96)

Corpus data

2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 0

5000 10000 15000

cardinality

count

the [Adj] of [Num]

(97)

Corpus data

0 5000 10000 15000

count

the [Adj] of [Num]

(98)

A revealing example from the wild

https://www.livestrong.com/article/550451-is-it-better-to-work-the-back-with-biceps-or-triceps/

Another muscle group to consider pairing your back workout with is the triceps. "Back and triceps workouts are a great way to ensure that you get

indirect workload on the biceps, but get the direct work on the triceps while still working on the bigger of the three muscles — the back," explains

Carneiro.

(99)

Two theories of the comparative

the bigger circle

😀

🙁

(100)

Two theories of the comparative

the bigger circle

😀

🙁

(101)

Two theories of the comparative

the bigger circle

😀

🙁

the bigger circle

(102)

Two theories of the comparative

the bigger circle

😀

🙁

the bigger circle

😀

(103)

Two theories of the comparative

the bigger circle

😀

🙁

the bigger circle

😀

🤔

(104)

Two theories of the comparative

2. "bigger" requires that there are two granularities in the context of the comparison class

the bigger circle

😀

🙁

the bigger circle

😀

🤔

(105)

Theory of granularity inference

(106)

Theory of granularity inference

(107)

Experiment results

coarse progressive

2 3 4 2 3 4

−1.5

−1.0

−0.5 0.0 0.5 1.0

z−score

* *

*

* **

(108)

Experiment results

coarse progressive

2 3 4 2 3 4

−1.5

−1.0

−0.5 0.0 0.5 1.0

cardinality

z−score

* *

*

* **

(109)

Vignettes

(110)

What else the degree semantics doesn’t say

(111)

What else the degree semantics doesn’t say

• How do we know the comparison class?

(112)

What else the degree semantics doesn’t say

• How does tall elephant turn out to mean something different from tall mouse?

(113)

What else the degree semantics doesn’t say

• How can the same individual be evaluated as either tall or not tall in different contexts?

(114)

What else the degree semantics doesn’t say

(115)

What else the degree semantics doesn’t say

Stephen Curry is tall.

(116)

What else the degree semantics doesn’t say

(117)

What else the degree semantics doesn’t say

(118)

What else the degree semantics doesn’t say

(Stephen Curry is 6’2”; this is the 12th

Stephen Curry is a tall basketball player.

(119)

Your friend says: That’s a big great dane.

You and your friend see the following:

Your friend runs far ahead of you, and you see him in the distance:

What do you think your friend meant? It is big relative to other ________.

Context Basic-Level  Syntax Predicate NP 

Noun Subordinate Category

(120)

Your friend says: That dog is big.

You and your friend see the following:

Your friend runs far ahead of you, and you see him in the distance:

What do you think your friend meant? It is big relative to other ________.

Context Subordinate-level  Syntax Subject NP 

Noun Basic-level Category

(121)

Results

Basic−Level Context Subordinate−Level Context

0.0 0.5 1.0

Proportion of basic−level responses

NP

basic

"one"

subordinate

(122)

Vignettes

(123)

Definites

Point at the bag

(124)

Definites

Point at the bag

which one?!

32

(125)

Definites

Point at the bag

Property of definites: require a unique referent

(126)

Definites

Point at the bag

34

Request violates a cooperative norm for communication rooted in the semantics of the definite determiner

(127)

Definites

Point at the bag

Reference Failure

(128)

Haddock Descriptions

36

The rabbit in the bag

Haddock (1987)

(129)

Haddock Descriptions

The rabbit in [the bag]

(130)

Haddock Descriptions

[The rabbit in [the bag]]

38 Haddock (1987)

(131)

Haddock Descriptions

The rabbit in the bag

(132)

Puzzle

40

The rabbit in the bag

(133)

Puzzle

Why doesn’t the embedded definite fail to refer?

(134)

A semantic account for Haddock descriptions

(135)

Click on the rabbit in the big [*]**

Experiment: threshold uncertainty condition

(136)

Click on the rabbit in the big [*]**

Experiment: threshold uncertainty condition

Definitely a big box!

(137)

Click on the rabbit in the big [*]**

Experiment: threshold uncertainty condition

(138)

Click on the rabbit in the big [*]**

Experiment: threshold uncertainty condition

Not necessarily a big bag!

(139)

Click on the rabbit in the big [*]**

Experiment: informativity violation condition

(140)

Click on the rabbit in the big [*]**

Experiment: informativity violation condition

Wouldn't have needed to say "big"!

(141)

Click on the rabbit in the big [*]**

Experiment: informativity violation condition

Wouldn't have needed to say "big"!

(142)

Click on the rabbit in the big [*]**

Threshold uncertainty and informativity violation

(143)

Click on the rabbit in the big [*]**

Threshold uncertainty and informativity violation

(144)

Modeling with Bayesian pragmatics

(145)

Modeling with Bayesian pragmatics

(146)

Modeling with Bayesian pragmatics

Listener

(147)

Model derives qualitative patterns of human response

(148)

Vignettes

(149)

Zeroing in on truly human-like language

Theory of linguistic knowledge

Language Datasets

Computational Models

Psychological Experimentation Human

Language

(150)

Collaborators

Helena Aparicio Curtis Chen Elizabeth Coppock

(151)