American
Journal
of Computational Linguistics
Miaofiche 36 : 26How
DOES ASYSTEM
KNOW
WHEN
TOSTOP
INFERENCING?*
The Moore S c h o o l o f E l e c t r i c a l E n g i n e e r i n g
U n i v e r s i t y o f P e n n s y l v a n i a , P h i l a d e l p h i a 1 9 1 7 4
Abstract The problem
of
constmining t h eset of
hfemtces added t o a set of beliefs is considered. One method, based on finding a minimal unifyingstructure, is
frresented and discussed. The methodis
meant t o pnxrideinternal
criteria f o r inference cut-off.I. Introduction
Natural language processing systems t h a t
are
sensitive t o the semanticand logical content of processed sentences and to t h e p ~ g l r a t i c s
of
t h e i r use generally d r a w inferences. A set of fonmilas representing the meaning of asentence and the 'state of b e l i e f t of the system is augnented by other related
formulas (the inferences)
which
are retrieved and/or constructed during t h epmcessing. The problem t o be investigated here is:
How
can thi$ process becontmlled? Can reasonable criteria be found f o r restraining the addition of
inferences?
Top-down inferences
fol.luwing
from the meaning of l e x i c a litems
(oftenexpressed by decomposition i n t o primitives) are clearly bounded, i f no
interactions are allowed amng the generated sub-formulas. This process
(which we c a l l EXPANSION) w i l l not be discussed here. Rather, w e shall be
concerned with SYNlESIS, i.e., the addition of new formulas based on t h e
*
This
work was p a r t i a l l y suppored by NSF- GrantSCC
72-05465A01.zxsence
-
of
already generated lower-level formulas, 'v~hich we shallcall
k l i e f s .
In
particular, weare
concerned with infererces addgd because a s e tcf beliefs
is
recognized as fitting a plre-defined pattern.The question we ask is: Given
an
initial set of beliefs o v m a set of;ri&ives,
-
w h a t ' c r i t e ~ i o ncan
be us& to M t the pmcess of pattern matcljngr-d associated inference addition? The major structural- feature t h a t we use
% w i d e such a c r i t e r i o n
is
a partial order over the set ofpatterns.
Before
pursuing t h i ssuggestion
any further, let us examme saned
-3s a i t ~ ~ n a t i v e ap-pmaches to infence and iiiCsrence
c~r-off.
To
logicians, deductive inferenceinvolves
r u l e s by which fanmilas can3e added to a set
(which
k i ~ a l l ycantains
the d o n s ) in c e r t a i n waysp v i d e d other formulas are dLready
in
the
set.In
general, t h i s sortof
infexence
is
quite open-endedin
that onecan
keep applying the rules ofLnXerence and
ccrme
up
w i t h m r e and a o ~ e f a m u l a ddl
of
which
represent' p v a b l e t statements. Xhe terminaticm criterion f o r
a
particular invocationof the m&ankmmi&t be the a p p e m c e of
an
'intestingt farmola or theloss
of interest
of the
infemcer, butin
generalthe
stdement of the rulesof inference says n o h i n g about when to cease deriving fbmulas.
This
para-
from logic has been carriedover
into
k t i f i c b l 3telligence qmtenrs, wherethe
issueof
terndnaticm
is very real. Theusual
solution
has been to invoke theinferencer
under the very strictcontrol
of
a supervising pgran w*&- has itsown
g m l s progmnmd inw h i c h
mkes c-ain that appropriate criteria are applied to hdltthe
inferenchg.
This is
=st apparentin
systems writtenin
PLANNER-like
languages a c h has use~~pmgrammble me&anisms for conbmlling the p l ~ g f
In
the
work
uf
Schankand
Riege, (Sch,75)
(Ri
,
7 4inference
hasnore
of
the
flavor
of
b eassociation;
inferences
are
conceive$
of
as expan&gs p h ~ s
in'inference
space.
'
'Ibtermination
strategies
areq l o y e d :
(1)the (iisoovery
of
a
chain
of
inferences
leading fr\mone
of
the i i i d lbehkfs
to
another
thmw
a shared formila,or
'contact
p i n tin
i n f m c espace,
and
(2)the
association
of
numerical
f s t n n g t h s t tof o m u k s
so
thata
line
of
inference
can
be
discontinued if
thestrength
falls
below a certainthIesbld.
Smtegy (2)
is
scmwihat unsatisfyiiin
viaw
of
t ! eprwLtW.
arbi-brsariness
and
attendant
difficulties
in
evaluating the
mle
of
parttcularnumerical constants
in
the
total. behavim o fa
cc~lplex systep. Theseosnstants,
presumbly, have
l i t t l e
to do
withthe
m-iical stru~tuceof
t?e
foninlinferace
s c h a ~ ~ ,and
assuch
we wouldcall
them ' e x t a m lcriteria.
'
Astmtegy
like
(1)
above,on
the other hard,
is nure
tintexmalt
andis
to bepfm.'
A
gwl
of
'the present mrljis
to
fcmnulate areasonable
internalcriterion
for
i n f m c ecut-off
tjhich canbe
stated fcmnally as partof
theinference
rule.
To do
this,
we & E L impose a s t m c h mon
the setof
patternsto
beused
in
inferencing, and the r u l efor
addinginferences
w i l l be fc~rmktedin
ternsof
.this s t r u t m e .The
operatibnsto
be de-ibedbelow
ateexp-
mrre
f u l l yin
(R,75),
where
a desmiptionof
a ccquter i q l e m ~ n t a t his
alsopresented.
1 A Partial Order
for
the Patterm Set!he
inference
rule we areaiming
fopis
t o depend on the-
r e t
of inputbeliefs
and
the-
set
0s
patterns. The notion we are trying t o fcmnalizeis
''What does this set of beliefs suggest with respect t o this s e t
af
patterns?"The particular
classof
inferences weare
concernedwith
are those gottenby matching beliefs i n the input
set
againsta
pattern and augmenting thebeliefs
with
additional p p o s i t i o n s as dictated by the pattern. We want t ofind
the least instarlcesof
patterns
which cover (include) the set of inputbeiiefs. W w i l l take e
as
inferences a l l pmpositions (an arbitrarynunber)
wW& a . entailed by t h a t instance of the pattern.
Put
another way, theinference
operaticsl i s to jump t o ~onclusions.However,
itis
cmlyt o
jump t o those conclusion required to make the resultingset an instance of ,the
-
least possible pattern in the pattern set.The
key concept hereis
'least' in t h a t thls iswhat
c m t m l show
manyinferences are added. What would be a suitable d e r i n g relation far patterns
and ~aopositional
beliefs?
One w h i c h naturally suggests i t s e l f and w h i c h i sc~rrartly unclex- investigation
relies Qn
the relationsof
instantiation andsubstitution instance
and (2) S
5,
S' if Sa
S t .< {q l,...
,%I,
where the p 'sCarbbing these
two,
we say that€q,.
. .
,p,)-
i
& p i
's are
~ ~ o p o s i t i o n a l forms, i f there i s a substitution, s, f o r thevariables of {pl,..
.
,p3
such that {s(pl),-. ,s(p1)
< Iq , e n m 9%)-n
n -1 1and let
Q
=
I
(HAPPYJOHN)
,
(GIVEMR.
JONES
JOHNTOY),
(PIIRE;YT MR.JONES
JOHN)).
Then
P
5
Q under
thesubstitution
?x+JOHN, ?y+MR. JWS.The
t l e s s - t h m ~ u a J . 'relation
is also
definedlor
pairs of
p a t t a m :h t P A M
=
{(P
?x ?y),$Q
?y ? a ) }and
l&
PAT-2
=
{iR
?u
?v ?w),(Q
?w ?v),Pdu
?w)1.
Clearly,
PH-1 f P-2under
thesub9citution
?x+?u,
?3*?~, ?B-?V.This defi#itfon
of
-
<is
quite s t d g h t f c r r d andczn
be made to aceorodateexpressions
wdth
embeddings
and
m a t eVariables.
(Theseare
includedin the
implaentation.
1
that
the
relation
<-
canbe
-thoughtof an
informition-axrtent
caparison;
if
S-
< S tthen
S tcontains
atleast
askmch 'infomatiant asS
(and pcwsibly mre) either by virtue
of
variables t ~ v i n g been replaced byparticular
constants
or
byadditional
farmuLas
having beenadded
tothe
set.Given
5
far
rehting
pairsof
belief
sets,pairs
of
patterns,
aa.
bedief-set/pttem
pairs, wecan
now
fundateme
belief-set-extending
III.
The
Infemnce Operation:SYNTHESIZE
Given a
setof
P
of
patterns andan
input setBel of
beliefs,
SYN!EESTZE
returns
a setI
of
instantiated
patterns
fma
P
such t h a tthe
following
-Weecaditions
a 3 lhold:
(2) (Pairwise inccnparability)
If
p,q rI
then(3E
(MinWity)mere
areno
other instancesr
of patternsin P
w h i c h are
not
ih
I
and yetwhich
are to some elementof
I
.cI
and
for which
Bil-
<r.
The el-ts of
I
=
SYHTHESSrEbl) represent possible r n u m d . b extensionsof
Bel;n
I
represents
clearextensions of
Bel, namely the supersetof
Be1.
stained
in
all
rmmmll.extansions.
Let
P
{ pl=
{(A?XI,
(B
?g),
(C
?XI),
p2
=
I
(B 7x1,
(C?XI,
(D
?XI),
p3
=
{(A?XI,
(B
?XI,
(C?XI,
(G
? X I }
1
Represented graphically:
If
input betBel
=
{(AJOHN),
(CJOHN)3
then
SYNEESIZE(Bel)
=
I I
(A JOHN),(B
JOHN), (CJOHN))
3.
There is
onlyone
possible ndnimdL extenshn;(B JOHN)
is
inferred.If
input set
B e l=
€(B
JOHN), (C JOHN))then sYN'mSIZE:(Bell
=
{f
(A JOHN),(B JOHN),
(CJOHN)
1
There are two possible
rmrundl
. .
extensions, but the set of clear extensionscontains no inferences beyond the input set, Bel. (Had pl and pq shared
another clause, however, an inference would have been added.)
If the input set Be1
=
€(G
JOHN),(B JOHN11
then
SYNTHESIZE(l3el)I
{ (G JOHN), (A JOHN),(B
JOHN),(C
JOHN)}I ,
Pattern pa
is the least
pattern which wheh instantiated covers the inputs,and there
are
two
inferxed pru,positions :(A JOHN) and
(C
JOHN).me d e s c r i p t i ~ n given here has been necessmily brief and incomplete
A
mre farnodl trea-t ofSYNTHESIZE
i n t a m s of lattice-themetic operationsis
given in (R,75) and i s miz zed in (JR,75). One additional technicalpoint should be made: It often happens t h a t f o r a given input s e t there are
no single patt- instances which cover a l l t h e inputs, though patterns
-
exist
m s e instances cover subsets ofthe
inputs.In
such a case we usean
extended SYNTHFLSIZE operationt~hich is definedin
the same s p i r i t a sSYWHESIZE. (See (R,75).)
Even w i t b u t the firU
f
d
treatment, several things shouldnow
beclear. F i r s t , the actual nunibex of inferences dra.. (propositions added) for
a
particular input set may be small o r large (depending on t h e inputs andthe pattern set,) but it is bounded in a p h c i p l e d way because of the
definitim of SYNTHESIZE.
Second, the usual distinction between 'antecedent' and 'consequent'
clauses
in
the pattern i s not h t a i n e d ; a clause in the pattern may serveThird, if 'defined1 lexical
item
w e r e t obe associated
w i t h
tht: patterns, notingwhich
variablesare
to be bound as arguments uponinstantiation,
then
the SYN'IIESIZE functioncan
be used to canpute sumnarizhgexpressions.
a*ls
SYNTHESIZE
remsents a possiELeformalism
for lexicalIV.
An f5wmole of theberation
SYNTHESIZE
Far
the sake o fillustration,
l e tthe
primitives be:
(BENIGN
?x)(
- ?X
?y)
--
?X thr?ebtesls ?y(GIVE
?x ?ob?y)
--
?x gives ?ob to ?y(BELONG
?ob
?XI
--
?ob
belongsto
?x
~IlITEND ?x
?Q)
--
?x
intendsto
do?Q
(
L
E
I
U
R
N
?X ?ob ? y )--
?X ?ob to ?y(?*ljlS-n~sT
?x ? y )-
?x -pays i.nt-:-sz: ?y(These
primitives and thepatterns
below
mayappear
somwhat a r k i f i c u , but wehave
chosen
a sinrpleillustration
due to
thedifficulties
in
followingexamples
with
axrethan
a
fewclauses.)
k t
the
pattern
setconsist
of
thefollowing four
pattms:
E
(BENIGN ?x),
( B M N G?ob
?y),
(GIVE ?y?ob
?XI,
(INTEND ?x (REIURN ?x?ob
?yl)l
PAT-2
: ?xtakes-loan-%
---
?y :I
(BENIGN ?x),
(EELOM;?ob
iy)
,
(GIVE ?y ?ob ?x),
(INEND ?x (REIURI'V ?x
?ob
?y)), (PAYS-INEREST ?x ?y))pm-3: ?x roba
-
?y:{(NOT
(KEXLGN?XI),
(BElXlNG?ob
?y), (- ?x ?y),(GIVE
?y?ob
?XI,
(NOT
(INEND ?x(RFNRN
Tx ?ob ?y)))3
Pm-4:
?xplays-practid
joke
on
?y :A rough graphic 'pepsentation of the set of patterns is sham in
Figure 1.
PAT-2
(takes-loan-from)PAT-4
(plays-pctical-j ake )PAYS- BENIGN BELONG
GIVE
THREATEN INTEND NOT- NOT-INqEND
INTEREST
BENIGN
Figure 1
Ncrw
consider the folluwing situations: Situation 1. Input beliefsBe1
{(BELXING
HARRY),(GIVE
HARRY
b K L E CHIE))
smTHESIZE(kl)
=
{{(BFLXING
WUlfS;THARRY),
(BENIGNMOE), ( G N E
HARKY
WMOE),
a' (INTEND MOE (REMW MOE WUET
HlRRY))3
{(NOT
BENIGN MOE)), (BELXING
WLETHARRY),
(THREATENMOE
HARRY),(GIVE
HARRY MALLETMOE),
(NOT INTEND MOE
~FEruRN
m
WLImIwRRY)13
The m m m d
.
.
matched patterns are-
rob and b o r n , adding the(conjectural) inf-tim that either
Hary
was threatened, 0x1 Moe intends toSituation 2. Input beliefs :
Be1
=
{(GIEUEWVK
1000-&XLARS JOHNDCE) ,(PAYS-INTEEST JOHNWEBANK)}
sYNTHESIzE(~1)
=
{ { ( m G N JOHNWE), (BELONG 1000-DOWARS
BANK)
,
(GIWBANK
1000-DOLLARS JOHNWE),(INTEND JO-E (RETURN JOHMXlE 1000-DOLIARS BANK) 1,
(PAYS-INTEREST
JOHNDOEBANK)
I I
As a msult of matching the
-
loan pattern, we have added three clauses.Situation 3.
Input
beliefsBe1 ={(INTEND JOHNDOE (REXKN JOHNDOE 1000-DOLLARS
BANK)),
(PAYS-INTEREST JOHNDOE
BANK)
1Here
SYNTHESIZE(I3el) fiturns exactly the same set as was returnedi
n
Situation 2. Note,however,
that the roles of(1)
(GIVE
RAM( 1000-DOLLARS JOHNDOE)and (2) (INTEND JOHMXlE
(RFIURN
JOHNDOE
1000-DOLLARSBANK))
have
been reversed.In
Situation 2, (1) was an input and ( 2 ) was infemed,
whereas
in
Situation 3, (2) was input wd (1) inferred. The currespondingclauses
of
the loan pattke2*nwere
serving as antecedents on one occasion andconsequents on the other. This follows naturKLly fran the way SYNIPIESIZE was
defined.
In
this regard the reader rnay notice that sane input belief sets mightyield ' w a r r a n t e d t
or
'spu~?ious' inferences--jumping to too many cmclusicms.Hawever,
the
incmmntal addition of new patterns corrects this anom19i
n
anatural way: Patterns
which
formerlywere
'least covers' may cease to be soi
n
V . Using Definitions to S e t -Up the P g t t e m Space
We have been particularly interested i n using definitions of words
to set up pattern spaces in which
SYNTHESIZE
could wark as an inferencerand a lexical insertion technique. Special attention was payed t o the 'speech
actv verbs, and a b i e f sample list is presented below. (The symbol '?Prt denotes a predicate variable. Also, primitive predicates are capitalized,
while defined predicates are underlined. ) Again, the definitions are greatly oversimplified
for
i l l u s t r a t i v e purposes.(define
-
t e l l(?x
?y ?p ?t) (and(-RE
? t O ?t)(NOT
(KNOW
?y ?p ?to))(SAY ?x ?y ?p ?t)
(KNOW ?y ?p ?t)
(CAUSE (SAY ?x ?y ?p ?t)(KNW ?y ?p ?t))))
(define request
(?x
?y ?p ? t )(tels ?x ?y (W ?x ?p ?t) ? t ) ) (define m s e
(?x
?y?Pr
?t)(and (EELS-OBLIGA'F€D
?x
(?Pr
?XI
?t)(tell
-
?x ?y UNTENP?x (?Pr
?x) ? t ) ? t ) ) )(define camand ( ? x ?y ?Pr ?t)
(request ?x ?y (?Fb ?y) ?t 1) (define implare
(?x
?y ?Pr ?t )38
The expansion of
these items to patterns over the primitives yieldsa set
in
which, far
example, KNOW<
-
t e l l 5 request iccnmand.The
inputset Be1
=
{CBEFQRE
tl t2), (SAY JAMES MASTER (INTENDJAMES (OPEN JAMES
DOOR) t 2 ) t2),(FEELS-OBLIGATED
JAMES
(OPE3 JAMES DOOR) t 2 1)tmuld be synthesized to (pmmise JAMES MASER (O?EN
+
LX)OR) t2), with &eedinferences
(KNOW FASTER (DElfi) JLLCES(OPEN
J A I CCOi:? t2) t21, etc., asd i c t a t d by the pattern instance of W s e .
A m t ? d bas been pmwsed
far
'fred
bfez-znciri-
by attern-
inatchig inwhich inference
c u t - f
can be structurally ccmstrained: A pattsxis
matchedif
itis
one of the minioil =emswhose
instantiati~n c o r n &he iqputin.~~mati.cm--~ven
if
this necessitates addkgan
mbitmryanounf
of additionalinfmmtion.
Similarly, on
the
questionof
b w
m y i n f m c e s to &aw:'
Enom-a inferemes are drawn to enable a c o h m t pattm to be matched. The method we have proposed
is
general in that it nrakes no assunptionsabout the particular predicates to be used
in
the patterns and beliefs.(Of
course,
it
does makeas^^
about h t counts as a patternm
a belief.1
The i n f m c i n g a u l d be done by a general purpose cmponent
wfiich
accepts aset
of
patterns as a parameta.Th*,
ap g r w
designing a system for inference by patternmtch
need not" devise external criteria, and certainlynot
miteria
to be associated Wi'th 1 every pattern. Ram the criteria arehqlicit
in
the system as a Wle; any wtternswhich
can be describedin
a v w general patterndescription
language w i l l genemteits
awn setof
internal miteria
fur inferencecut-off.
References
(C,75> Clark, H.H. 1113ridging,'' in Proceedings of the Workshop on
Theoretical Issues in Natural Language Processing. W i d g e ,
Mass&usetts., June 1975.
(JR,75) Joshi, ~ . 1 ( . and Rosenschein, S. "A FomKLism for R e l a t i n g Meal and Pragmatic Information," in Proceedings o f the Workshop on
Theoretical Issues in N a t u r w l Language messing. Canilxidge,
Massachusetts 3 June 19,75.
(Ri,74) Keg?, C. Conceptual M e m r y . Ph.Dw Thesis S t a n f c x d University.
Stanford, CaliTn;l..nia. 1974.
GR,75) Rosenschein
,
S.
Struc-hg a Pattern Space, w i t h Applications toL e x i c a l - - Information and Event Intermetation. Doctoral
-- --
-D i s s e r t a t i o n . University of Pennsylvania. W l a d e l p h i a ,
Pennsylvania. 1975.
( S c h , 75) Schank, R.
,
Goldman, N.,
Rieger,
C.,
and Riesba&, C. "Inference and P a r a p m e by Computer"
Journal,of-the A.C.M. Volume 22,No. 3. July, 1975.
(W,75) Wilks