• No results found

Semantic Based Parsing and a Natural Language Interface for Interactive Data Management

N/A
N/A
Protected

Academic year: 2020

Share "Semantic Based Parsing and a Natural Language Interface for Interactive Data Management"

Copied!
14
0
0

Loading.... (view fulltext now)

Full text

(1)

American Journal

of

Computational

Linguistics

~ i c r o f f c h e

32 : 5 8

JOHN

F .

B U R G E R ,

ANTONIO

L E A L ,

AND

A R I E

SHOSHANI

System Development

Corporation

S a n t a

Monica,

California 90406

m

c

T

We describe a natural-language recognition system having both applied and

theoretical relevance. A t the applications l e v e l , the prwram w i l l g i v e a

natural ccmmunications interface facility to users of existing i n t e r a c t i v e

data management systems. A t the t h e o r e t i c a l l e v e l , our work shows t h a t the

useful infoxmation i n a natural-language expression (its "meaning") can be

obtained by

an

algorithm t h a t uses no formal d e s c r i p t i o n of synt-. The

construction of the parsing tree i s c o n t r o l l e d primarily by semantics i n t h e

form of

an

abstraction of the nmicxo-world" of the DMS's f u n c t i o n a l capabil-

ities and the o r g a n i z a t ~ o n and semantic relations of t h e data base content

material. A prototype is c u r r e n t l y implemented in LTSP 1.5 on tho IBM

370/145 computsr at System Development Corporation.

In a

recent

article

in S c i e n t i f i c ,

American, Dr. Alphonse Chapanis says, " T f

t r u l y interactive computer ( ; y s t m are ever to be created, they will ~omehow

have to cope w i t h the... errors and v i o l a t i o n s of format t h a t a r e the rule

(2)

dialogue produced by t w a persons interacting w i t h each other by t e l e t y p e -

writer to solve a problem as~igned to them by experimenters showed that :not

one grernaaatfcally correct sentence appears in t h e entire protocol. tl

Many existing language pmcessors ( w o o d s , Kellogg

,

Thcmpson

,

e t c . ) [ 2,3,4)

a r e limited to what Chapanis calls "Irmnaculate p r o s e , " that i s , "the sen-

tences that are fed into the computer are parsed in one way or another so

that the m e a n i n g of the ensemble can be inferred frm conventional rules of syntax," which are a £0- d e s c r i p t i o n of the language. In effect, users

are required to i n t e r a c t w i t h these s y s t e m in s m e formal language, or at

l e a s t i n a language that has a formal representation i n the computer system

t h a t a user's expression must conform to (we are t h i n k i n g , in t h e latter instance, of Vhampsonls REL, which has an extensible formal representation

facility). In a d d i t i o n , most natural-language question-answering systems,

including all referenced above, require that a user's data be restruct-wedl

and reorganized acwraing t o the p a r t i c u l a r data base requirements of the

natural-language system to be used.

A t the level of a r t i f i c i a l i n t e l l i g e n c e research [ti ,6

,?'I

,

Mere is same

interest

in

systems that recognize meaning i n natural-language expressions by methods that dd not m i r e compiler-like syntactic a n a l y s i ~ of an

expression prior to asmantic i n t e r p r e t a t i o n . We b e l i e v e it is possible,

practical, and feasible, using new lingufstic processing strategies, to

design a natural-language interface system that will permit flexible, intu-

itive

coaansmicatiba w i t h information management systems and other computer

(3)

no prejudice about t h e user's system funckians and can be joined to almost

any such system with relatively l i t t l e effort. I t i s , i n addition, able to

infer t h e meaning of free-form English expressions, as they pertain to the

h o s t system, w i t h o u t requiring any formal description or representation of

English.

THE SEMANTIC INTEREACE ALTERNATIVE

The syntactic inflexibiiity of existing natural-language

processors

limits

their usefulness i n interactive man-madine tasks. O u r approach does n o t

use a c o l l e c t i o n of syntax rules or equations as they are normally defined.

Instead, w e construct a dictionary in which w e define words

in

terms of their

possible meanings with respect to the particular data base and data manage-

ment system (DMS) w e want to use and according to the possible relations

t h a t can exist between data-base and I3MS elements ( e . g . , an averaging func-

t i o n

on a group

CKE

numbers) i n the limited "micro-world" of this precisely

organized data collection. Words appearing

in

a user's

expression

t h a t a r e

not

explicitly defined are ignored by the system i n processing the expres-

sion; an

example

would be t h e word "the," which is usually not meaningful in

a data management environment. Wa thus avoid the expressive rigidity that

formal syntactic methods h p o s a on tha user

and

the excesaivcs time and

resource consumption t h a t results from the catibinatorial explosions usually

produced by such rnethade.

We distinguish

in

their d e f i n i t i o n s beween two types of words: content

(4)

'meaningsw are the objects, events, and concepts t h a t make up the s u b j e c t s

being

referred

t o by

users,

More p r e c i s e l y , for data axetnagernent systems,

these meanings (or "concepts") a r e the f i e l d names and entz'y i d e n t i f i e r s f o r

*e data b-e and the names for available IHS operations such as averaging,

s d n g , s o r t i n g , comparing, etc. Function words serve as connectors of

content

words. Their use

i n

natural language i s to indicate khe manner

in

which neighboring

conltent

words ar'e intended to relate to

one another.

In

the example "the

salary

of the secretary

,"

used belaw, "salary" and

"secretary," are c o n t e n t words,

and "of"

is

a function word used to connect

theta.

Many

cmntent wor&

are context sensitive, In a p a r t i c u l a r d a t a base, f o r

b t m c m , the ward "salary" may refer t o t h e data-base f i e l d name SECSAL if

the saXW frs "of a secretary," but may a l s o indicate the f i e l d

name

CLKSAL

if

it

is

a

*salary

of a clerk." In recpgnition of

this

we therefore d e f i n e

eaah aontent word by a

set

of

one

or

more

pairs

of the form

( ( X I Y l ) (X2 Y2)

. . .

(Xn Yn))

where

the

X i

a d

Y i are " o o n c e p ~ " ( t h a t

is,

f i e l d

names,

etc.) as described above.

This

expression

may be interpreted as,

"if

the word so d e f i n e d i r j t

contactually related

in

a sehtance to Xl,

its

particular meaning in this

c e n t a c t

is

Y 1 ,

if it

i s r eo related b X2, it meme Y 2 , m d ao f o r t h . " This

particular

oontextual

mnaranfng af

the

word

is

c a l l a d its sense. Two c o n t e n t

warm

are

consrid=& t o bls artmantically related i f the i n t e r s e c t i o n o f the

X i ' a fmtn the

definition

of one wort! w i t h the Yi's from the d e f i n i t i o n of

(5)

To get a more i n t u i t i v e understanding of this process, suppose, again, t h a t

a data base contains

e n t r i e s

f o r both secretaries and clerks w i t h salaries

fox each. Suppose "Suzi&' is an instance of a secretary and om" is an

instance of a clerk. We then have three words defined as follms:

Suzie ( (SUZIE SECY) )

Torn ( (TOM C-LK) )

Salary ( ( sECY SECSAL) (CLK CLKSAL) )

Processing

me

phrase "Suzie

'

s salary" would i n t e r s e c t the Y i ( " (SECY)

"

)

from

t h e

d e f i n i t i o n

of

"Suzie" w i t h t h e Xi's ("SECY" and "CLK") from t h e

definition of "salary." The intersection is nan-empty ("(SECY)")

,

and, i n

discovering the semantic r e l a t i o n s h i p the sense "SECSALI-' is assigned t o the

word "salary." Similarly, "Tan's salary" assigns t h e sense "CLKSAL" t o

"salary. !I

A particular b p l m e n t a t i o n of the natural-language interface processor

operates for a p a r t i c u l a r DMS/data-base t a r g e t system. It c o n t a i n s a

particular &&ion- c r e a t e d for t h a t t a r g e t system. For a p a r t i c u l a r d i c -

tionary, the s e t o f a21 l i s t s 0 5 p a i r s as described above, therefore,

c o n s t i t u t e s the equivalent of a ~ a n c c p t q ~ a p h ox network for the p a r t i c u l a r

data b a a malogous to those U R Q ~ hy many of t h e more conventj-onall, parsers

Pox semantic analysis folluwing (or during) the syntactic phase of parsing.

In the analysis of a particular input by our system, two words i n context

a r e t e ~ t e d using t h e " i n t e r s e c t i o n " method described abave and, if they are

found to be s e m a n t i c a l l y r e l a t e d , they are considered candidates f o r

(6)

Function words are defined as operators or processors t h a t perform

this

semantic

t e s t . The d e f i n i t i o n of one f u n c t i o n w o r d d i f f e r s fm that of

another according to its slope (see belaw) and also

in

t h a t t h e operational d e f i n i t i o n of a function word can reject a connection even though t h e two

words may be samntically related. In the operational d e f i n i t i o n of t h e

function word may be a list of acceptable concepts or a rejection list of

unacceptable concepts. In most conceivable data bases, t h e phrase " s a l a r y

in t h e secretary" would be thus rejected by t h e function word "in. n

As the analysis of an i n p u t expression proceeds, a "clumpifig" of word and

phr as e meanings more and more explicitly normally,

processing of the e n t i r e sentence r e s u l t s in a tree s t r u c t u r e made up of the

connected senses of a l l the content words fran the sentence. T h i s r e s u l t we

term the sentence qraph even though t h e input expression may not be a

grammatically cmplete sentence. This sentence graph will be t r a n s l a t e d

i n t o statement.

We recognize t h a t the linear ordering of the words in an input expression

is not

entirely randm and t h a t c e r t a i n aspects of

me

function of syntax

must be taken into accorunt. This is done by means of a new and p w e r f u l

azgorithm b k d on what we c a l l the syntactic-semantic slope. L i n g u i s t s

generally recognize that whenever two units of meaning are combined, one is

semantically domfnant and t h e other subordinate, as a modifier is sub-

ordinate to the modified word. A f t e r coenbinatfon, t h e d d n a n t word may be

wed in m o s t cases to refar to the canjoined pair. Thus, a "red h e r r i n g "

(7)

"salary." If this relationship of dominance i s represented v e r t i c a l l y on a

ltrectangular graph (i.e., dominance on the Y - a x i s ) , and

if

t&e l i n e a r order-

ing of the words

in

the expression is represented

on

the X-axis in n o w 1

left---right: order, then the connection of an adjacent pair of content

words or phrases will describe a linear slope on the graph. The slope is positive eir negative as the dominating sub-unit is, respectively, to t h e

r i g h t or to the l e f t of the subordinate

sub-unit.

For example, the phrase

"red herring" makes a positive slope, thus:

HERRING

/

RED

and "the salary of t h e secre=" makes a negative slope:

S;71LARY

Thus, the ~ p e r a ~ o n a l meanings of f q n c t i a n words operate on the meanings of

nearby

content

words. Dominance

is

assigned, semantic relationships are

verified,

and the relationships so discovered are accepted or rejected. If

accepted, the two word-meanings are connected, and the acceptable sense is

assigned to t h e dumllnant word.

Eunction words may connect c o n t e n t words in "positive," "negative

,"

or

"peak" connections. me f o l l m i n g are examples of each mannax of connection:

1. "Of" is a negative operator, as in " t h e salary of the

(8)

2. " ' 8 "

is

a positive operator, as in "the s e c r e t a r y ' s salary":

3 . "And" is a peak operator, as in "Atlantic and P a c i f i c .

"

In

contrast w i t h positive and negative operators, peak operators add

a representation of their m semantics i n t o t h e structures t h e y

build ;

AND

\

A-IC PACIFIC

4. Between any two adjacent content words there is

an

implicit "empty"

operator t h a t

is

a positive operator, as in "red herring":

RED

In general, all prepositions are defined as negative operators. T h i s is

equivalent

Go

the rule

used by syntactic

processors.

The positive empty operator is equivalent to

the rule

N P + A x x r P 3 P

and athew,

while

vexbe and c o n j u n c t i o n s a r e defined as peak operators,

giving our atatemcnt o f rules such errs

s + N P v E ' N P

(9)

Each operator has t h e f a c i l i t y to accept or r e j e c t any semantic rejlation

accordin9 to t h e precise d e f i n i t i o n of t h e function word for t h e h o s t data

management s y s t e m .

Progressive connection of word meanings and previously connected groups or

"phrase meanings" results in a tree graph t h a t we c a l l the sentence qraph.

For example, t h e question "What

is ;t;he

surface displacement of

U S .

d i e s e l

submarines?" could, f o r a particular d a t a base, produce from t h e dictionary

a s t r i n g of content-word and funeion-word d e f i n i t i o n s that might be rep-

resented typographically l i k e this:

( (SUB SURE-DISC) ) <OF> ( (U

.

S. LOC) ( (DIESEL TYPE) ) ( (LOC SUBS)

(TYPE SUBS)

As a xesult of processing, these will assemble into a tree structured (using

the senseg of the words) l i k e this:

WHAT

/

sUm-D=sP

P

LOC

AsuBs

TYPE

U , S . DIESEL

Even though this t r e e , o r sentence graph, i s created as a r e s u l t o f semantic

relationships instead of Eonnal r u l e s of grammar, it still. c l o s e l y resembles

the "parse t r e e " produced by m o ~ t conventional syntactic language processors.

With

respect t o t h e user's target data management system, t h e sentence graph

(10)

straightforward translation into the formal query language of the EMS. In

SDCrs DS/3 lanwage, f o r example, the above q u e s t i o n would be expressed as PRINT SURF-DISP WHERE TYPE EQ DIESEL AND lXXl EQ U.S.

The response to the usex's question will thus be the response frclrn h i s DMS

t o the formal query statement.

The user's input in this hypothetical example i s proper i n fom and grammar.

H o w e v e r , it need not have been. The request

OBTAIN SURFACE DISP FOR US SUBS SUCH AS HAS TYPE EQ DIE=.

would produce exactly the same sentence graph and thexefore, exactly t h e

same f o m l query statement with the same response f r o m t h e DMS.

It is not l i k e l y t h a t a syntax-based parser would have anticipated t h e odd

laxxguage-use and grammar of this last request. Without a syntax rule t h a t would alluw for the phrase "such as has" such a parser would n o t l o o k at the

semantics involved and would be unable t o interpret the r e q u e s t . Our syntax

algorithm gets t h e same results that would be expected f m m the application

of syntax rules without the need t o a n t i c i p a t e each grammatical construct

expected from the user.

In overview, t h e parsing algorithm makes a series of positive, negative, and

peak connections based on the operational meanings of t h e function wards

(including t h e "empty" aperator) and on the relations between meanings of the

content wort%?. The algoridt-Xlm adheres to the following rules:

e 1 Connections between content words are possible only if

(11)

and i f this result i s not rejected by the operation of the f u n c t i o n

word p e r f o d n g t h e test. The f u n c t i o n word d e f i n i t i o n also deter-

m i n e s which w o r d supplies its X ' s and which its Y's for the t e s t ,

I t thus controls which w o r d has its sense d e t e d n e d if t h e t e s t

ia successful. Most of ten (though there are exceptions)

,

p o s i t i v e

operators use t h e X's f r o m t h e w o r d to the r i g h t and t h e Y ' s from

the word to the left of . b e operator. P o s i t i v e operators, these-

fore, determine the sense of the word

t o

t h e right. This is

i l l u s t r a t e d u s i n g , again, t h e secretaxy and her salary, Consider

the d e f i n i t i o n of "Suzie" and "salary" as shown on page 5 , The

phrase "Suzie's salazy" has two content w o r d s , "Suzie" and

"salary, " separated by the function word

,

" s , " This function

word i s a positive operator and, hence, applies t h e intersection

t e s t t o the X i from the definition of "salary" w i t h t h e Yi from

the d e f i n i t i o n of " ~ u z i e . " These values are, xespactively,

'

I (SECY CLK) " and

"

( k m )

.

"

The i n t e r s e c t i o n y i e l d s

"

(SECY)

,

"

which is acceptable to the

"

' s " o p e r a t o r , and the connection i s

made with "salary" as the dominant word. The sense of "salary"

is the Y i associated with "SECY" in t h e d e f i n i t i o n of "salary,"

hence, "SECSAL." T h i s selection process is reversed f o r negative

aperators, while peak operators employ both k i n d s of t e s t s , one

on each s i d e of t h e peak.

Rule 2: N o node i n a sentence graph may have m o r e .than one dominating node. That is to say, a l l connections m u s t r e s u l t i n trees, This

I s a canmon asswnptLon consistent with conventional syntax-driven

(12)

Rule 3: Given a subtree, a c o n s t i t u e n t on its left has the p o s s i b i l i t y

of conneation only to nodes of the subtree's positive a d j a c e n t

slope, and a c o n s t i t u e n t on the r i g h t can connect onLy t o the nodes i n the adjacent negative slope. I n t u i t i v e l y , this means that if

the nodes of a subtree are connected by "lines" that are "opaque

b a r i e r s r n then a c o n s t i t u e n t on either side of t h e s u b t r e e may

connect to it only on those nodes that it can rlsee.r' I t may not

connect t o nodes on the "inside" or the "fax s i d e " of the s u b t r e e .

This i s a powerful h e u r i s t i c r u l e t h a t eliminates t h e need t o t r y connections to many syntactically impossible portions of t h e sub-

tree. In effect this one rule, together w i t h the definitions of

the f u n c t i o n words, replaces all the syntax rules used by most

conventional parsers.

R u l e 4: I n order t o minimize disconnection of existing subtree

s t r u c t u r e s (badcup) and s t i l l consider a l l possible connections,

the system should, whenever possible, constrztct,subtrees s t a r t i n g

from t h e top and make new connections from belaw. This rule leads

to the following algorithm: Scan the consUtuents from left t o

right making negative connections, then scan from right to left

making positive connections. S c a n thus back and forth u n t i l no

more connections can be made. Then make any poasible peak aonnec- t i o n s and repeat the algorithm. Continue t h i s process u n t i l a l l

c o n s t i t u e n t s have been connected i n t o a single tree,

(13)

or

analysis of the expression. Therefore. there is no need to carry along

temporary multiple construction p o s s i b i l i t i e s , The algorithm may eirher

query t h e user at this point for disambiguation or W d w t the pxocesging and

inf o m reason,

I.

Chapanis, Alphonse. Interactive human cammunlcation, Scientific

American, May, 1975.

2. Woods, W. A, Trahsition network gr-ars for natural language analysis.

Cozmnunications of the ACM, October 13, 1970,

3. Kellogg, C. H,, et a l , The CONVEXGE natural language data management

system: current status and plans. ACM Sym~osium on Information Storaqe

and Ratrieval, University of Maryland, 1971.

4, Thompson, F, B . ; 'Lockman, P. C.; Dostert, B.; Deverill, R, S. REL:

a rapidly extensible language. Proceedings of 24th National Conference,

ACM, New York, 1969, 399-417,

-

Riesbeck, C, K. Computational understanding. Theoretical Issues i n

Natural Langu~ge Processinq: Proceedinqs of an Interdisciplinary

Workshop in C a n p u t a t i c m a l ~inguist&cs, Psychology, Linguistics and

Artificial Intelligence. Cambridge, Massachuastts, June 10-13, l975.

6, Waltz, D. L. On understanding poetry, Theoretical Issues i n Natural

Langtmgs Processing, Proceedings of an Interdisciplinary Workshop in

Camputational Linguistics, Psychology, Limuistics and ~rtificial

(14)

7 . Sdhank, Roger, and Tesler, L. G. A Conceptual Parser for N a t u r a l .

Language. Stanford

Artificial I n t e U i g e n c e

Project. Memo No. AI-76,

References

Related documents

The top-level findings of the LMI research team were: (1) Persons with experience in using prototype (Section 845) and research OTs (TIA’s) viewed them positively; (2) effective use

• Small Business Innovation Research (SBIR) and Small Business Technology Transfer (STTR) grants are a government funding program offered by eleven federal departments and awarded

MPI is set up as a programme of the Global Security Institute (GSI), one of its member organizations. It was founded in 1980 to do re- search, education, and advocacy relevant to

T-786c Polymorphism in nitric oxide synthase 3 gene and Nitrit Oxide Level of Diabetic Retinopathy in Javanese Population.. Putri Widelia Welkriana 1 , Sunarti 2* , and

Abdel-Azeez HA, El-Okely AM (2009) Association of the receptor for advanced glycation end products (RAGE) -374 T/A gene polymorphism and circulating soluble RAGE with nephropathy

The learnability (defined as “How easy is it for users to accomplish basic tasks the first time they encounter the design?” [10]) and memorability (when users return to the

• in case of delays in charge (of) (prep phr) in danger (prep phr) in debt (prep phr) in charge of (prep phr) in difficulties (prep phr) in fact (prep phr) in front of (prep phr)

Keywords: speaker authentication; text independent; mobile devices; LPCCs; naive Bayes; voice;