• No results found

Referential Nets With Attributes

N/A
N/A
Protected

Academic year: 2020

Share "Referential Nets With Attributes"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

North.Holland Publishing Company © Academi~ 1982

REFERENTIAL NETS WITH ATTRIBUTES

Christopher U. Habel *

ODe of the essential problems in natural language production

and understanding is the problem of processing referential

relations. In this paper I describe a model for representing

and processing referential relations: referential nets with

attributes. Both processes (analyzing and generating

referential expressions) are controlled by attributes. There

are two types of attributes, on one hand, the ones to the

internal substitutes of the objects spoken about, on the other hand, the ones to the descriptions of these objects.

I. BASIC NOTIONS: KNOWLEDGE AND SEMANTIC REPRESENTATION

It is a well-known fact that (computational) models of human language production

and understanding have to take the knowledge of the speaker and the listener into

consideration. Thus, one of the main problems is to describe and represent this

knowledge and to distinguish relevant subtypes. A suitable classification which I

described in detail in Habel [7] distinguishes three subtypes:

- knowledge of facts or assertions, e.g. of states of the world, events, etc.,

the "factual knowledge',

- knowledge of rules, e.g. of rule-like relations between objects or states of

the world (or classes of such entities), the "inferential knowledge',

- knowledge of objects, e.g. of persons in the world, the "referential

knowledge'.

(In this paper I give emphasis to the third type: referential knowledge

represented by "Referential Nets').

The common core for representing all these types of knowledge is the "semantic

representation language" SRL. From the formal point of v i e w SRL is a

(propositional) symbolic language as described by Kalish/Montague [8]. Well-formed

SRL-expressions are generated by normal (recursive) formation rules. I

distinguish - as usual - two kinds of meaningful expressions in SRL, terms and

formulas. Thus we define:

TER - the set of well-formed terms FOR - the set of well-formed formulas SRL = <FOR, TER>

For the purpose of knowledge representation SRL contains some specific operators,

e.g.

on the factual level: tenses, beliefs, ...

on the inferential level: entailment relation,...

on the referential level: description operators of various types.

Since SRL is a representation language, I now have to treat the central question

of "representation theory': What kinds of entities are represented by the

different entities of SRL? Here I follow - with minor changes, which I will not

discuss in this paper Miller [I0]: The expressions and entities of SRL

represent cognitive concepts. The operators correspond to different types of

"concepts" in the Miller-approach, e.g. predicative , nominal and modifying

concepts (this third class are the "operators" of Miller.). E.go by means of

description operators it is possible to construct nominal concepts from

(2)

102 CH. U. HABEL

p r e d i c a t i v e concepts or other n o m i n a l concepts.

I shall n o w illustrate SRL w i t h an example. In u n d e r s t a n d i n g and r e p r e s e n t i n g the sentence

(i) I w i l l meet B a r b a r a in the u n i v e r s i t y tomorrow.

the act of referring to "I", "Barbara", and "the u n i v e r s i t y " is to be processed, i.e. the ( o b J e c t - ) o r i e n t a t i o n to them is the i n t e r e s t i n g p r o b l e m from the referential point of view. The SRL r e p r e s e n t a t i o n of (i) is

(2) time (meeting (l,Barbara, ETA(x) :university(x) ),tomorrow)

The a r g u m e n t s of "time" and "meeting" p r e s e n t the most relevant types of

referential operators: "I" and "tomorrow" are deictic substitutes (personal or

temporal), B a r b a r a is a p r o p e r n a m e and the e x p r e s s i o n "ETA(x): u n i v e r s i t y ( x ) " is

built up from the n o m i n a l concept "'university" (representing a class of

i n s t i t u t i o n s for e d u c a t i o n and science) and the v a r i a b l e - b i n d i n g d e s c r i p t i o n operator "ETa'. "ETA(x): p(x)" has the m e a n i n g "'an element from the class {x/p(x)>". E T A is an i n d e f i n i t e a n a l o g y to the IOTA-operator of formal logics (Similar o p e r a t o r s are i n t r o d u c e d by Hilbert as E P S I L O N - or E T A - o p e r a t o r . ) . The s e m a n t i c s of the d e s c r i p t i o n operators is given by a set of i n f e r e n c e and

e v a l u a t i o n rules. E.g. there is an inference rule w h i c h relates the ETA-

e x p r e s s i o n s to e x p r e s s i o n s w i t h e x i s t e n t i a l quantification: ( 3 ) p ( E T A ( x ) : q ( x ) ) ~ EX x : p ( x ) & q ( x ) 1"

Some e v a l u a t i o n rules w i l l be d e s c r i b e d in the following chapters, e.g. "creation

of a referential object'. (Both types of rules are d e s c r i b e d in detail in Habel

[5], [6].) Thus, E T A is the formal r e p r e s e n t a t i v e of one of the m e a n i n g s of i n d e f i n i t e articles.

F u r t h e r d e s c r i p t i o n s of the objects m e n t i o n e d in (i) or in the SRL e x p r e s s i o n (2)

can be generated by (focussing) t r a n s f o r m a t i o n s similar to the solution of

a l g e b r a i c equations: The u n i v e r s i t y m e n t i o n e d above is also the one described by (4) ETA(x) :time ( m e e t i n g ( l , B a r b a r a , u n i v e r s i t y ( x ) ) , t o m o r r o w ) 2~

or the n a t u r a l language equivalent "a (the) u n i v e r s i t y where Barbara and I will

meet tomorrow". (The uniqueness of the E T A - d e s c r i p t i o n depends on the

s l t u a t l o n a l / t e x t u a l context.)

B e f o r e we pass on to a d e t a i l e d d e s c r i p t i o n of "referential nets', I will make a short r e m a r k on d i s c o u r s e and communication. As m e n t i o n e d above h u m a n language p r o d u c t i o n and u n d e r s t a n d i n g is based on the s p e a k e r - l i s t e n e r ' s k n o w l e d g e of the language, the w o r l d and p a r t i c u l a r l y the p a r t i c i p a n t s in the communication. Thus, I a s s u m e a d i s c o u r s e model 3* which is based Dn t r i p a r t i t e k n o w l e d g e (namely

factual, inferential, and referential), each of these types is r e p r e s e n t e d by

s t r u c t u r e d sets of S R L - e x p r e s s i o n s .

\

2. T H E R E F E R E N T I A L NET

Processing, e.g. s t o r i n g or retrieving, r e f e r e n t i a l relations is executed by a p a r t i c u l a r component of the d i s c o u r s e p r o c e s s i n g system, the "referential procedures" (ReP). T h e RePs w o r k on a m e m o r y s t r u c t u r e w h i c h is a d e q u a t e for the r e p r e s e n t a t i o n of k n o w l e d g e about objects, the "referential net" (RefN) 4*. A RefN consists of entities of three different types:

- referential objects (RefOs), which are the internal s u b s t i t u t e s for the objects s p o k e n about,

- descriptions, i.e. terms of SRL which describe the RefOs,

- attributes, i.e. p r o p e r t i e s of the d e s c r i p t i o n - r e l a t i o n b e t w e e n a R e f O and one of its d e s c r i p t i o n s or p r o p e r t i e s of the R e f O (see chap. 3).

A f t e r d e t e c t i n g a n e w indefinite d e s c r i p t i o n (as ETA(x) : unlversity(x)) R e P creates a n e w " r e f e r e n t i a l object'" (RefO). D u r i n g t h e d i s c o u r s 6 (after the i d e n t i f i c a t i o n process) further d e s c r i p t i o n s of the same R e f O will be linked to this RefO. (The r e l a t i o n a m o n g the d e s c r i p t i o n s of the same R e f O is the w e l l k n o w n

relation of eoreference,) ~ u s we have, for example, the f o l l o w i n g set of

d e s c r i p t i o n s of the RefO m e n t i o n e d above :

(5) r . l ~ . ~ d. II--ETA(x) : university(x)

(3)

For the d e s c r i p t i o n of the r e f e r e n t i a l net I use SRL and in a d d i t i o n a "finite but

extendable" set, REFO, of "referential objects'. The r e f e r e n t i a l objects are

seen as a kind of b a s i c term of SRL. The set R E F O is analogous to the set of

variables. The p h r a s e "finite but extendable" is used to emphasize the dynamic

aspect of the set REFO and the RefN. REFO consists of e x a c t l y those RefOs that are a c t u a l l y needed. It is n e c e s s a r y to d i s t i n g u i s h two types of terms, one type

w h i c h consists of the m e n t a l representatives, i.e. the RefOs, and the other which

contains those terms w h i c h f u n c t i o n as descriptions. N a m i n g this set b y D - T E R we

h a v e T E R - R E F O U D-TER. Thus, referential nets, RefNs, can he defined as

( 6 ) R e f N c D - T E R x REFO •

For <d-ter, r.i> 6 R e f N I also use "d-ter deser r.i'" , "d-ter is a d e s c r i p t i o n of r.i". In this w a y a d e s c r i p t i o n - r e l a t i o n is d e f i n e d on the basis of the RefN. B y m e a n s of referential nets it is possible, e.g. to a r r i v e at one object (i.e. RefO) v i a different aspects, i.e. different d e s c r i p t i o n s , of this object. It is

only by such aspects (cp. Schank's [II] w a y o f m e m o r y discrimination), that a

s y s t e m is able to choose "'a best d e s c r i p t i o n " of the objects w h i c h are the theme

of the d i s c o u r s e (see below). The referential net (5) is s i m p l i f i e d in two

crucial points, first, that the descriptions of r.l o p e r a t e on M S R L ~ p r e s s i o n s and not on RefOs, too. But this can be treated s i m i l a r l y to the focussing process of (4), w h i c h can be f o r m a l l z e d now:

( 7 ) p ( r . i ) 8--- r . i - - ETA(x) : p ( x )

U s i n g ( 7 ) , t h e d e s c r i p t i o n s c a n b e s o l v e d w i t h r e s p e c t t o a l l R e f O s , i . e . a l l a r g u m e n t s a r e f i l l e d b y RefOs o r o p e n t e r m s . Thus we c a n d e r i v e ( 8 ) f r o m ( 5 ) :

(8) r . l ~ - - - d . I I - E T A ( x ) : u n i v e r s i t y ( x )

d . 1 2 - E T A ( x ) : t i m e ( m e e t i n g ( r . 2 , r . 3 , u n l v e r s l t y ( x ) ) , r . 4 ) d . 1 3 - I O T A ( x ) : p r o f e s s o r - a t ( r . 3 , u n i v e r s l t y ( x ) )

r . 2 - - d . 2 1 - " I ' " r . 3 - - d . 3 1 - - B a r b a r a r . 4 - - d . 4 1 - " t o m o r r o w "

Secondly, R e P has tO e x p l i c a t e the r e f e r e n t i a l links of deictlc substitutes, e.g. "r" r e f e r s to the speaker, etc.. " " s t a n d s for the s p e c i f i c ReP w h i c h evaluates d e i c t l c expressions, i.e. w h i c h i n s ~ a n t i a t e s deictic ~xpressions w i t h RefOs, e.g. "I'" - SPEAKER(l) (cp. fn 6.). By the same m e t h o d s n e w RefOs for "we" c a n be created, e.g. in the p r o c e s s i n g of " T h e r e we w i l l discuss some papers." (We = Barbara + I)

3. A T T R I B U T E S IN R E F E R E N T I A L N E T S

I will n o w extend the basic t w o - p l a c e r e l a t i o n b e t w e e n RefOs and d e s c r i p t i o n s to the more adequate concept of a m a n y - p l a c e r e f e r e n t i a l relation. The a d d i t i o n a l p l a c e s w i l l be called "attributes'. In the first step of e x t e n t i o n I give emphasis to "attributes of descriptions'. T h e y r e p r e s e n t p r o p e r t i e s of the d e s c r i p t i o n relation, i.e. of pairs <d, r> E RefN. Thus, the e x t e n t i o n to an a t t r i b u t e d R e f N (ARefN) uses a set A T T of a t t r i b u t e s and changes (6) to

(9) A R e f N c A T T x D - T E R x R E F O .

For <att, d, r > C A R e f N I also say "'att is an a t t r i b u t e of d w i t h regard to r". Some p o s s i b l e a t t r i b u t e s are (Note t h e fact that this list is not c o m p l e t e and not fully adequate):

- syntactic and semantic features of the description, e.g. gender, number, sexus,

etc.

- n u m e r i c a l values, e.g. "grades of relevance of a description", "degrees of "being the TOPIC / b e i n g in the FOCUS" ", "recency". 5* A simple s t r a t e g y for d e - / c o d i n g is: '~se/try the d e s c r i p t i o n or RefO with h i g h e s t degrees !'" - names of persons, e.g. p a r t i c i p a n t s of earlier discourses. By this a t t r i b u t e it

is p o s s i b l e to find a d e s c r i p t i o n relevant to speaker and listener (Cp. Clark / M a r s h a l l ' s [I] co-presence triples).

- situations in which the d e s c r i p t i o n / R e f O Is relevant or was i n t r o d u c e d (cp. W e b b e r ' s [12] " e v o k e ° - p r e d i c a t e ) .

(4)

104 CH. U. HABEL

Before I explain the concept of "attributes to descriptions" by some examples, I will new sketch some aspects of function and statue of the attributes. Firstly, in

both processes, analysis as well as generation of referential expressions, it is

necessary to take into consideration a set of alternatives. All of them are

possible candidates to be the referent (in analyzing) or to be a good description (in producing) respectively. The goal of the ReP is to choose exactly one from the

alternatives, namely "the best'. It is obvious that trying to choose the best can

fail. Then the decision has to be revised afterwards. (But this is a total

different problem.) In other words, the function of the attributes is to point to

the most appropriate (or being more careful: to that which seems to be the most

appropriate) of the alternatives. Secondly, I think that, on one hand, knowing and processing attributes is part of human natural language processing, and therefore,

these attributes have to be part of the discourse processing model. On the other

hand, knowledge of attributes is different (cp. the list of possible attributes,

above) from the factual or referential knowledge. Thus, I propose a strict

separation between descriptions and their attributes. This strict distinction

contrasts to Webber's [12] solution: her use of the "evoke'-predieate mixes the

concepts of descriptions a n d their attributes in a cognitive inadequate manner.

Now I shall continue with examples of the use of attributes • Let us suppose a

situation in which a questioner Q asks:

(I0) Where can I study Computational Linguistics?

and the answerer A knows that the university which is (internally) represented by r.l is a good university to study CL. The selection of the best description of r.l

depends on Q and A (and the situational context). I assume the following

attributes to the descriptions of r.l (from (5), (8)):

(ii) descr, general relevant to: time of relevance:

-d.ll "unspecified"

-d.12 SPEAKER(I), LISTENER(1), Barbara NEIGHBOR(r.4) 6*

-d.13 SPEAKER(1), LISTENER(I), KNOW(Barbara) always

(II) describes a situation, in which the name / location of the university in

question is not represented, e.g. since the information about the possibility to

study C L w a s given by an earlier utterance of Barbara "We have some courses in CL" but A (whose RefN is represented by (ii)) does not know where Barbara teaches. The best description can be chosen by comparing the attributes of Q with those of the

descriptions of r.l. If Q knows Barbara, .i.e. <KNOW(Barbara)> ATT (Q) 7*, then

the best description will be:

(12) d.13 - "The university where Barbara teaches (as a professor)."

Ranking d.13 higher / better t h a n d.12 depends on the dimension "time of

relevance'. Furthermore, if Q knows more about r.l it is possible that Q will

react with,

(13) "Well, at Amherst.'"

and now A is able to extend hls/her RefN with a further description of r.l (14) r.l - d.14 - Amherst - - "general high relevance" "always"

(This attribute depends on the fact that proper names have high relevance in

general.)

Anagously we can use attributes in the decoding process. Look at the following

pair of sentences (similar to (I)).

(15) a. I will meet the head of the linguistic department tomorrow. b. She is a specialist for Montague grammar.

To solve this referential problem ( and not to be puzzled by the pronoun "she" )

we need further background knowledge, e.g. "Barbara is the head of the linguistic

department.". With this prior knowledge and the additional (theoretical) concept

of "attributes to RefOs" 8* as described in (16) "female" "local TOPIC'- r.3-- d.31- Barbara

d.32- IOTA(x) : head-of(x ,linguist. dept.)

the referential problem of (15) is easy to solve. A system (human listener or

machine) has to match the pattern corresponding to the pronoun "she" ~ I c h is

induced by the context (15.5) against the descriptions, their attributes and the

attributes of the RefO's. The crucial attribute is here "female" for both "she"

and "r.3".

(5)

between attributes of descriptions and those of the RefOs, is very important in

German. In spite of the syntaetlc ineongruence (neutr. vs. fem.) it is possible

(in colloquial German) to say:

(17) Ich traf gestern ein Maedchen. Sle hatte rote Haare.

I m e t yesterday a glrl . She had red hair .

neutr. ~ fem.

4. CONCLUDING REMARKS

By means of an extended referential net (w~th attributes both of descriptions and

RefOs) it is possible to describe how to store and process referential entities

(and expressions) in a cognitive adequate and powerful way.

Following the principles and concepts described above a system for anaphora

resolution has been implemented (Guenther [3]), which will be a component of the

next version of the BACON-system (cp. fn 3).

FOOTNOTES:

* Parts of the first draft of this paper were prepared during my work with the

project "Automatic Construction of Semantic Nets" at the Technical University

of Berlin. My current research on "referential nets" has been supported by the

DFG (German Research Foundation). Author's address: Blelbtreustr. 36 a, D-1000

Berlin 15; Fed. Rep. Germany.

I* From a formal point of v i e w "inference rules" are transformation rules on SRL.

Their cognitive and philosphlcal status, which is analogous to Miller's [I0]

"conceptual entailment', is described in }{abe], [7].

Here, i,e. in (3), and in the focussing rule (7) "p" stands for all types of

concepts which are represented by open formulas.

2* The focussing transformation mentioned above is based on the concept of

referential nets and therefore its description will be postponed to the

following chapter. In (4) a special "bar-convention" is used, which I do not

want to explain in detail here. In case a term is necessary as argument, e.g.

of "meeting", the "bar" will change the open formula "p(x)'" to an "open term"

3* Cp. }{abel [7] and Gust / Hahel / Rollinger [4]. This model is the theoretical

basis of the natural language QAS BACON (Berlin Automatic COnstruction of

semantic Nets), developed at the Technical University of Berlin, which was

supported by a grant from the Federal Government since 1978. It should not be

confused with the BACON-system of Carnegie-Mellon University (we chose the same name at the same time).

4* The concept of referential nets is similar to discourse referents (Karttunen

[9]), reference diaries (Clark/Marshall [ 1 ] ) and discourse models (Webber

[12]). The most relevant difference between referential nets and the other

approaches and the main advantage of referential nets consist in the further

attributes described below.

Note the fact that the acrony=s ReP, RefN, etc. can stand for the respective

concept, e.g. "referential net', as well as for a specific instantlation, e.g.

the RefN (5) or (8). Furthermore ReP concerns the system of referential

procedures as well as the individual procedures. In all these cases the meaning of the acronym is obvious through context.

5* In this paper TOPIC / FOCUS are used in an informal and intuitive way. (For a

detailed investigation of these concepts cp, Grosz. [ 2 ] ) Note that the

(6)

106 CH.U. HABEL

6* SPEAKER(l) stands for "the speaker of u t t e r a n c e ( I ) ' , i.e. this a t t r i b u t e gives more i n f o r m a t i o n than W e b b e r ' s [12] "evoke(l)'. "KNOW(_)" represents "people knowing _ ' . "NEIGHBOR(--)" stands for "points / periods of time near to the point of time b e i n g the argument'. I do not want to explain the u n d e r l y i n g "logic of time" here.

7* I w i l l not explain the concept of "attributes of a p e r s o n / participant" here. I use it in a s t r a i g h t f o r w a r d and intuitive manner.

8* The extention to a t t r i b u t e s of RefOs is a n a l o g o u s to that f r o m (6) to (9): A A R e f N c ATT. I x D - T E R x (REDO x ATT.2) •

ATT.I stands for the attributes of descriptions, A T T . 2 for the a t t r i b u t e s /

p r o p e r t i e s of RefOs. A A R e f N is a "double attributed RefN'. N o t e the fact that s a m e of the a t t r i b u t e s of the RefOs w i l l be c o m p u t e d f r o m those of descriptions~ e.g. "female" from a a t t r i b u t e of "Barbara'.

REFERENCES:

[ I ] C l a r k , H . H . / M a r s h a l l , C . R . ( 1 9 8 1 ) : " D e f i n i t e r e f e r e n c e a n d m u t u a l k n o w l e d g e ' " • i n : A . K . J o s h l / I . Sag / B. W e b b e r ( e d s ° ) : E l e m e n t s o f D i s c o u r s e U n d e r s t a n d i n g . C a m b r i d g e U n i v . P r e s s : C a m b r i d g e .

[2] G r o s z , B. ( 1 9 7 8 ) : " D i s c o u r s e k n o w l e d g e " i n : D. W a l k e r ( e d . ) : U n d e r s t a n d i n g Spoken L a n g u a g e • North-Holland: N e w York.

[3] Guenther, S. (1981) : V e r f a h r e n der R e f e r e n z a n a l y s e fuer das n a t u e r l i c h - s p r a c h l i c h e System BACON . Diplomarbeit (Thesis) T U B e r l i n , FB-Informatik.

[4] Gust, H. / Habel, Ch. / Rollinger, C.-R. (1982) : " L a n g u a g e - W o r l d Systems : A c o m m u n i c a t i o n m o d e l " in : Proceedings of the w o r k s h o p "Models of d i a l o g u e : theory and a p p l i c a t i o n " Informatlcs L a b o r a t o r y of L i n k o e p i n g University. Jan.1981. ed. by U. H e i n / S. Haegglund. to a p p e a r at H a n s e r / M o M i l l a n : M u n i c h / London.

[5] Mabel, Ch. (1980) : " R e f e r e n z e n Inferenzen". in : Rollinger,

C . - R . / S c h n e l d e r , H.-J. (Hrsg.) : I n f e r e n z e n in n a t u e r l i c h - s p r a c h l i c h e n S y s t e m e n • Einhorn: Berlin.

[6] Habel, Ch. (1981) : "Zur G e s c h i c h t e y o n R e f e r e n z o b J e k t e n - Tell I: D i s k u r s m o d e l l e " . ms. , Berlln.

[7] Habel, Ch. (1982) : "Inferences - the b a s e ot ~emantics?" to a p p e a r in the P r o c e e d i n g s of the K ~ n s t a n z Conference on "Maaning~ Use, and I n t e r p r e t a t i o n of L a n g u a g e " (Sept. 1981), de Gruyter: Berlin.

[8] Kallsh, D. / M o n t a g u e , R. (1964) : L o g i c - Techniques of formal reasoning. Harcourtt Brace &World : N e w Y o r k •

[9] Karttunen, L. (1976) : "'Discourse referents" in. J.D. M c C a w l e y (ed.) : Notes f r o m the linguistic underground. Syntax and Semantics 7.

[10] Miller, G. (1978) : "Semantic relatlons among words" , in : Halle, M. /

Bresnan~ J. / Miller, G. (eds.) : L i n g u i s t i c theory and p s y c h o l o g i c a l reality. Cambridge, M a s s . .

[11] Schank~ R.: (1979) : R e m i n d i n g and M e m o r y Organization: An I n t r o d u c t i o n to M O P s • Yale Univ. D e p t . C o m p . S c i e n c e . R e s e a r c h Report 170.

References

Related documents

After fixing some notation (Section 2.1), we briefly recall the main result by Hush and Scovel (2003) about SVO and rate certifying pairs (Section 2.2) and the main result by List

The ability of the National Early Warning Score (NEWS) to discriminate patients at risk of early cardiac arrest, unanticipated intensive care unit admission, and death. The trauma

Notice that an adverb clause, like an adjective clause, contains its own subject and predicate, but it must be subordinated to a main clause to make sense..

To analyze the tribological effectiveness and related lubrication mechanism of nanoparticles enriched palm TMP ester in actual engine environment, the corresponding

The backend systems and the legacy systems cover a variety of information systems such as the integrated customer content system, the data warehouse system, the management

Rossi (eds) Modern Monetary Macroeconomics. A New Paradigm for Economic Policy, Cheltenham, UK and Northampton, MA, USA: Edward Elgar. Citraro) ‘World monetary disorders: the

How might mobile interaction be situated around artefacts of personal places, and support users to access content from their smartphone while managing their physical presence..

We compare the targeting performance of the following household identification models: (1) the current Oportunidades model (current SUP) which is the model that officially selects