herican
JournaI of Computational Linguistics
M i c r o f i c h e 2 0AN APPROACH
TOV E R B A L I Z A T I O N
A N D
T R A W S L A T I O N
B Y
M A C H I N E
WALLACE
L ,
CHAFE
University of California
B e r k e l e y
C o p y r i g h t 1 9 7 5
Phis retport d o 3 c r i b t : n [I m o d d f o r m n c h i n o t r ~ n n l n t iori d e i ~ e l o p c d
at Berkeley d u r i n ~ l'972-7/tm 'Jhe ,nodel i~ brli1-t r~raurbd a c c t of
procedures c z l k d v o r b n l i z a t i m , i n t e n d c t l t o sir::ul:iLlt? t h e procn::r,us
e m d o y e d by a sb)ecaker o r writer
in
t u r n i n y s t o r m 1 k ~ ~ o w l e c l c ntow o r d s , v e r b n L i z ~ t i o n in z c s n tr> c o n n i n t of : ; ~ d - ~ c r ) t ~ c w t 1 1 ~ 1 j ~ n t i r ~ r ~
and l e x i c ? l i z n . t 8 o n prr>cannr?r: i ~ h i c h i rlvrlPve c , l b . a t ivr; r:hoicr:n on t h e
p a r t CI
:'
t h e ~ 1 ~ 5 , l l izr;*r, t o t 7 c t h c r w i t h : l l r ; o r i t h a l c s - r r ~ t a c t i c ~ j ~ ~ i e , c r , r , c r ;r ? e t e m i n e ( 3 b ; ~ 1 t i I , 3 i n 1 :; v i c ~ : . \ r l :1:3
1 )
the r \ c o n s t n u c t o f tblo venb;31 i x r 1 t i \ ~ l 1 )rr)c:f:r,zc:; l ~ , ! l i - ~ r : ~ l v ~ ~ c ~ ~ Li n ' t o t h e o r i ( < i n n l R ~ O : r C r 1 I , t ( : x t af1d ( 2 tA.;n l i 1 7 , l i c ~ t ion ~ f '
~ a r n l l e l e r l ' r a l l z a t i ~n .,lroct?E;sos i n t lit t : i r i r e t 1 -he t a r ; c t lm-IJR + e ~ ~ e r S , J i z a t i ~ n .\)o%r; T o r c m ? * 3 t i v e c:/xncea t I t h t :our:c
- + langu,l:l;s - v e r b : * l i z -it ion wid t r i e s to ;1:;1j117 c ~ r r e g : ~ o n d l n g ; C ~ ~ ~ C Cd b S ,
t h e silme t i m e t h a t it an l i e s s j r n t a c t l c D F ~ > G ~ : B S ~ S d ~ c t r q t c c : 5-7 t h e
gram=*
01 t i ~ c t:)rIyet L a n r r 1 ~ ~ 3 f r e . J e r b g l l i z ~ t i o n :i:ld t r a n s l a t i j np r o c e s s e s a r e i l l u s t r a t e d , w l t h e x a n p l l ; ~ t!jkcln fro-.,: c;ny;llsh i - h i
J a p a n e s e , . t i few of t:lesc 9rocctr;:c:s h:jvc b v c r l i,.,, 1sr;ierit;sd L Y L an i n t er:jc t ~ v c p-,s.op;r m i t P:ic:i1 1 t 1 t ; s of I L , a ~ r r e r ~ c ~ ~ L k ~ ~ 1 : r 2 1 : : , I T
L a b o r a t o r y , but t i n t ~ ~ ~ ! t of L C . r e i:; LC, ( I f ? r : ~ o r l s L r > : i t l : L :I&:
A b s t r a c t
~ c k n o w l e d m e n t s
I
Overview11, dubconceptualization
111, b Lx:imnle
1 1
IV.
~ e x i c a l i z a t i o n of a J JV.
L e x i c a l i z a t i )n o f aPI
VI. Jhe Lexicon
V I I . Discourse I n f o r m a t i o n and R e a d j u s t m e n t s
V I I I , Translation
This r e p o r t d o n l n w i t h work p e r f o r m e d by t h e C o n t r a s t i v e
Gemant ics P r o j e c t in t h e Department of L i n ( w i : : t i c s of t h e Univer- s i t y of C a l i f o r n i a at Berkeley. The p r o j e c t was s u p p o r t e d by
Air F o r c e C o n t r a c t No. F30602-72-C-0/cc)6, A s s o c i a t e d w i t h t h e
p r o j e c t durin~ i t s entire life, in addition to the author, were
Patrrlcia M. Clancy, Leonard PI. r ' a l t z , Christopher Murcano, and
H a s m i g S e r o p l a n . A l s o active d u r i n g more t h a n half of this period
were Masayoshi S h i b ~ t n n i aqd L i n d a oobek. Associated d u r i n g s h o r t e r
p e r i o d s of t i m e were T e r e s a
M.
Chen, Charles J. Fillmore, R o b e r t E. Gaskins, and Marie-Claude J o r l a t i d . Masayoshl I I i r o s e s e r v e d as a c o n s u l t ant on J a p a n e s e d u r i n g t h e f i n a l two months.Thls s ~ m e r e p o r t , in slightly different f o r m , was published
C e n t r a l t o t h e
vlew
o f
t ~ . ~ ~ r l s ~ u t l o n t h a t w 1 1 1 be preqentedh e r e 1 s t h e notlon o f ~ r b a h z a t l o n , Verb i l l z n t l n n 1 s t h e ~ p p l ~ c n - t l o n of p r o c e s s e s by whlch som?! h o l l s t x c conceptual c h u n k ,
r e c a l l e d
from
memory, 1 sconverted
intosentences
a n d words--lnco
a
phonetically o r g r a 3 h l c a l l g c o m u n l c a b l e llngulstlc r e p r e -sentatlon.
buch a n o t l o n a s s u m e s t h a t t h e underlying c o n t e n to f
what 1 s b e l n g communicated 1 s h o t , or need a3t b e , I n v e r b 3 1 form to begln w l t H .
At
t h e v e r y l e a s t it n a y be a complex sjrszern ofd l s c r e t e elements and r e l , ~ t l o n s , r e p r e s e n a b l e p e r h i p s as a n e t w o r k
of
nodes aria a r c s .It
may
a l s o l n v o l v e m i m 7 o r t a n tnondlscrete
o r a n a l o g component, r e p r e s e n t a b l e
only
m some other t e r m s .o or
excellent
s w a r l e s o f b o t h s l d e s o f t h l - particularlssue
seePylyshyn 1913 and P a l v l o
1977.)
a h a t e v e rmay turn
o u t t o b e thec a s e h e r e , lt seems c l e a r t h a t some s o r t s of p r o c e s s e s must be
a p p h e d m o r d e r t o t r a n s f o r m t h e o r l g l n a l f o m o f s t o r a g e I n t o a verbs1 o u t p u t : t h a t t b e s t o r e d m a t e r l
1
m u s t be verbqllzed.Xn
anypartlcuIar
i n s t a n c e o f t r s ~ s l a t l o n t h e r e are t d o I n -s t a n c e s of v e r b a l l z a t l o n . One Ts the o r l e l n 11 v e r b a h z a t l o n n p r -
formed by the c r r ~ t o r o f the s o u r c e language text. The v t h e r 1 s
t h e v e r b a l l z a t x o n r ~ r o d u c e d
in
t h e t z r ~ e t 1ani:uq:e by th? tl m s l a t q r .3 e s i d e s b e l n g
In
d l f f e r e n t l.ngune;es, t h e s e two v e r b ~ h z a t l o n sare
fundamentally different in one o t h e r r e s p e c t . 2he s o u r c elanguage w r b a l l z a t l o n i s ,
we
mlght say, autononous.It.
1sfreely
language vcrbnlizntlon,
on
t h c o t h e r hnnd, 1 s p a r n o l t l c on t h es o u r c e Imguage one. Not o n l y
must
t h et r a n s l a t o r
adheret o
t h e
rules o f hls own language, he must a l s o produce a verballzatlon
t h a t commun~cates, so f
lr
as posslble, t h e srme u n d e r l y r n g c o n t e n t orknowledge
t h n t was communicatedby
t h e s o u r c e language verbal-l z a t l o n .
?he v e r b ? l ~ z a t l o n
Ln t h e t a r g e t language isthus
subject
t o
thls
s p e c i a l k l n d of constra~nt, I t sp r o d u c e r
isnot
f r e e t o "say what he wants," b u t must insofaras
p o s s l b l e say t h e same t h l n g as t h e p r o d u c e r of t h e s o u r c e language t e x t . bde suggested I n an e a r l i e r r e p o r t t h a t t h e r e are two c h r n e n s l ~ n s of high q u a l l t yt r a n s l s t l q n , whnch
we
t e r m e d n a t u r a l n e s - s and fldelltg, N a t u r a l n e s s1 s achleved when t h e t u g e t language v e r b a l ~ z a t ~ o n adheres t o a l l
t h e
constralnts
of t h a t language;the
o u t p u tw ~ l l
then sound"natural". F l d e l ~ t y 1 s achleved t o t h e extent t h a t
the
t z r g e tlanguage v e r b a l ~ z a t ~ o n communicates the same content as the source
langusge one.
V e s b d l z a t ~ o n
In
g e n e r a l , as we s e e~ t ,
c o n s l s t s ofa
m i x t u r eo f t w o k l n d s o f orocesses: those w h ~ c h necessitate c r e a t l v e de-
clslons on the n a r t o f the v e r b C d l z e r and t h o s e which do n o t , s e m g governed by t h e constraints lrnlmsed by t h e l m f r u a g e . e
rnlght s n e a k o f c r e q t l v e n r o c e s s e c and a l ~ o r l t h m i c p r o c e s s e s . Srea-
t l v e p r o c e s s e s a r e
ultlnately
~ o v s r n e d by thecontent
w h l c h under- lies the v e r b a l l z a t l a n ; t h e verb l l z e r h a s t o declde how b e s t t o verbalize t h a t c o n t e n t . N o r m a l l ya
range o f c h o l c e swlll
be onen t o him, and hemust
d e c l d e what willmost
effectively conveywhat
he has I n rnlnd. A f t e r he has made s s c h cholces, t h e r e a r e
often
p n r t i c u l a r u l e s o f t h e 1:mp;ungc (hut which RTB themselves l i k e l y
t o l e i i d t o t h e n e c e s s i t y of f u r t h e r c r e n t i v c choices). 'de can Say,
t h e n , with r e s p e c t t o t h e two v c r b a l i z a t ~ o n s involved i n a t$ans- l a t i o n , that t h e p r o d u c e r o f t h e s o u r c e language, v e r l ~ a l i z a t ~ o n , h a s a p p l i e d b o t h c r e a t i v e and a1p;orithmic p r o c e s s e s ,
wherehs
i n t h etarget
l w g u o g e v e r b a l i z a t i o n o n l y a l g o r i t h m i c p r o c e s s e s areauton-
omously a p p l i e d , t h e
necessary
c r e a t i v e c h o i c e sbelng
d e t e r m i n e dby
the
c h o i c e s t h a t weremade
i nthe
s o u r c e lan(.;ufq;everbalization.
Thus t h e
naturalness
of t h e f l n a l t r a n s l a t i o n dependslargely
on adherence t o the algorithmic p r o c e s s e s of t h e target l a n g u a g e , w h i l e i t s f i d e l i t y depends on t h e e x t e n t t owhlch
t h e t r a n s l a t i o nhas
been a b l e t o i n c o r p o r a t e q r e a t l v e c l ~ o i c e s t h a t c o r r e s p o n d t ot h o s e o r i g i n a l l y a p p l i e d i n t h e s o u r c e language.
I n
a11 proba-b i l i t y t h e r e a r e cases where e x a c t c o r r e s p o n d e n c e
i n
these c h o i c e sIs n o t p o s s i b l e , and where a
ceqtain
m o u n t of autonomous c r e a -t i v i t y
has
t o be i n t r o d u c e d l n t o t h e t a r g e t v e r b l i z a t ~ o n s w e l l . These are t h e c a s e s where a u t o m a t i c t r a n s l a t i o n becomes nostp r o b l e m a t i c . One u s e f u l g o a l o f machine
translation
r e s e a r c h canbe t o d e t e r m i n e p r e c i s e l y t h e
nature
and e x t e n t o f such c a s e s .We a r e l e d , t h e n , t o the g e n e r a l p i c t u r e o f t r a n s l a t i o n which
is shown
in
F i g u r e1.
The t w o v e r t l c i l l columns r e p r e s e n t the twov e r b a l i z a t i o n s whlch a r e i n v o l v e d :
.In
t h e l e f t the s o u r c e languasgev e r b a l i z a t l o n and on the right t h e t a r g e t v e r b a l i z a t i o n . zhe lnpEzt
t o a t o a
translation
p r o c e d u r e , of c o u r s e , i s an a l r e a d y producedv e r b a l o u t p u t o r t e x t i n
the
s o u r c e l a n g u a g e . Thefirst
majorwhich
it
w:l,c, prorlllced, r)k
i l i d o f " c i ~ ~ c .bml i z ~ . t i o n " , h e 1 r o f c st o t h i s nn t h e p n r s i n P L
c o m ) ~ n ; : n b ,
nlthou(;h i t i n c J n o r l g d i f b f d r . c n tf r o a c ' ; ~ n v o n t i o n n l p n r n i n e . Lt a i m t o
rocanstruct,
n o t n s i n ( 7 l cd q e r ,
structure
u n d o r l y i n y : t h e s1ir'fnf:e t o x t , h a l t r n t hr
n i s ofprocesses
by
whichthnt
t e x twas
z r c ~ j t e r lfrom
t h e knf)tdl pdyrc--nota n l v r ~ o n v c r b
I
bllt r, s s i b l y ever: n o n d i s c r e t e - - w " l ~ h t h e c:3kr:r o rwriter
had
in
~ : l n d . The b u t - ~ ~ u t f ~ f t h e pnrni.n(; c o r r ~ n o n tis
i d o n l l ga c o ~ l e t e r e c ~ r ~ s t r u c t i o n of bot-h t h e c r e a t r v o nnrl
ti-IF
n l r ; r ) r ~ t h n i c q r o c c s s e s which t h e s o u r c e l a n r ; ~ l , y ~ e v e r b a l i z e r ap.111 e d .The o t h e r n n j r ~ r c ~ m : , q n e n t o f t h e t r c a n s l a t i r ) n proced!lre i : i t h e t r a n s l a t i o n componept. I t i s e q u i v a l e n t t o a vnrbql i m t i a n - 1 1 1 trlc
tarr;c?t l a n g u a g e . 'The p r o c e s s e s w k l i c h rn:~Be u~
t
i
v e r b a l - i z : l t ~ n qapem,
to the extent
t h a tthey are
a l c o r i t h m l c ,those
which cxnrrtsst c a r g c t 1nny;uaye c o n s t r a ~ n t s a n d , t o t h e e x t e n t t h a t t h e y
:.T
c r e a - t i v e , t h o s e whlch c o r r e s a o n d t o c h o i c e s a l r e a d g nladen
t h e re-, r
c o n s t r u c t e d s o u r c e l a n g u a g e v e r b a l i z a t i r ~ n . ,:he n e c l . s s i t y of'
r e f e r e n c e t o the sollrcc l m ~ j a 5 9 v e r b a l i z a t , i o n for c r e a t l v t $ c h o ~ c c s
a t
many
r m i n t s 1s suq6:estitd in F'iqure1
the z i r ; z a ~ a r r o w slrie b e l i e v e that- ' t h i r , r ~ i c t u r e p r o v i d e s il p1nu:;lblr. 1)a::ar; T o r
translation re!:(?arch, b u t n f : e r l l ~ ~ s s to m y ~t , ) r e s e n t s rnv!nv p r n b l c ~ n r ,
whose s o l u t i o n s a r e o n l y d i m l y Yoreseen at t h e p?c:;ent t l m c . Uur
p r o j e c t c o n c e n t r a t e d m o r o f i t s a t t q n t i o n an verbnlizntion i t s e l f
than
on parsing o r t p m s l a t i o n , s l n c e b o t h of t h e l a t t e r depend o na prior understanding o f verbalization. Any o t h e r o r d e r m i - o f
priorities would be putt in^ t h e c a r t b e f o r e t h e h o r s e . Any detailed
investigation of
t h e parsingcomnonent
wolild be f u t z l e lf we dld n o ttarget
language
t h e
proccnnes
t h n twent
i n t o
EIp?rt.iairlnr
verbalizatinn.
The k r - n n 8 -l a t i o n
comannent
-
is
Rv e ~ b ~ l i ~ a t i o n , ,
thpr1p;hone
o f
RsneciaL
s o r t ,
and
t h e r ea ~ a i n
a d e t a i l e dunderstmrlinp;
o f v e r b n l i z n t i o np r o -
c e s s e s
i s
necezsary.
This
r e p o r t ,t h e n ,
will
bemost cr)ncerned
wi.tht h e
n a t u r e
ofv e r b a l i z a t i o n .
We w i l l a l s o d e v o t e c o n s i d e r a 7 ~ l e spacet o t h e
n a t u r e
oft h a t s p e c i - a l
sort
o f v e r b a 1 i z : r t i o nwhich
i n
t r n n s -
l a t i o n . 'We
will
havet h e
l e a s t t o sayabout
p a r s i n c . Examples w i l lbe c i t e d f r o m
English
andJapanese.
F o r bout t h e l a s t nine months o f t h e p r o j e c t
we
were
concerned
w i t h
the
development
o f :m i n t x r n c t i v e computer pro,p;ram t h n twould
implement t h e
v e r b a l i z a t i o n
n r o c e s s e swe
h y - p o t h e s l z e d . f ~ l t h o u p - ht
i
prQy;ram remainedprimitive,
t h e intentionwas
t h a ti t
would
~ r a d u a l l ~ achieve i n c r e a s e d sophistication
i n
its
a b i l l t gto simu-
l a t e verb:
l i z a t i o n ,
t r a n s l a t i o n ,and garsing. As it presently
simulates
t h e D r o c e s s e s o fv e r b a l i z a t i o n ,
i t
b e e i n s
with
an itemt h a t r e p r e s e n t s t h e i n i t i a l
holistic idea
which
t h e sneakeror
writer
o f at e x t
wishes
o c ~ ) n m u n i c a t e . I tthen
asks t h euser,
s e a t e d a t a t e l e $ t y n e , t o
make
t h e s e r i e s o fcreative
c h o i c e s t h a ta r e h e c e s s n r y kn
the
production o f t h e f a n a l t e x t . lit t h e samet i m e
it
a t t e m p t s t o anilly onits
own t h e a l ~ o r l f h m i cp r o c e s s e s
w%ich a-e c a l l e d f o r .
I t
knows when c r e : l t l v e c h o i c e s are n e c e s s a r y ,b u t must a s k t h e u s e r what
c h o i c e s
t o make.Ideally
i t
s h o l ~ l d bea b l e
t o
anply t h e a l e o r i t h m i c p r o c e s s e s w l t h o u t h e l p .As
it
s i m u -l a t e s t r a n s l a t i o n
i t
s h o u l dlikewise
be a b l e t o a p p l ythe
algorithmicDrocesses
o f t h e targt:t l a n ~ u a g e a u t o m a t i c a l l y , andalso
to
a p p l yon
its
own
by lookinga t
t h e s o u r c e 1onp;uaf;ev n r b a l i z a t i t ~ n
t osee
wnat c r e a t i v e c h o i c e s
were
made t h e r e . hhenever j.t i s n o t a b l e t omake
ac r e a t i v e
c h o i c b , t h e prop;r:un a s k s t h e u s e r t o do s o . efind
t h a t thiskind
of machine-user i n t e r & i o nwovides
a
valuable
research
technique.Taking
as oui- u l t i m a t e g o d t h e e v e n t u a lelim-
i n n t i o n of t h e u s e r
from
t h e t r a n s l a t i o n Rrogramaltogether,
westart with
a
s i t u a t i o nin which
t h e u6t.rfntervenes
a t manypoints.
As we
learn
morewe
can
g r a a u a l l y g i v e t h e machlne mope t o do andt n e u s e r l e s s . This
technique
can
be f o l l o w e d n o t o n l y i n v e r b a l -i z a t i o n ,
but
a l s oi n
p a r s i n g 'ulhetner r;he u s e r w i l l e v e n t u a l l yd i s sppear from
t h e
~ i c t u r e a l t o g e t h e r i s u n c e r t a i n .However t h a t nay b e , t h e g o a l a1 a pro.;ram
in
which t h econti-i-
bution of
t h e u s e ris
significantlydiminished
in
r e l a t i o n t o t h a to f t h e nachine seems worsable. S h o r t of
the
f i n a l g o a lo f
e l i m i -nating
t n e
u s e r a l t o g e t h e r , an i n t e r m e d i a t e g o a l i d e n t i f i a b l e as'human-iided" machine t r a n s l a t i o n can more
easily
beforeseen.
Here
the machine will
do the manythings
f o rwhich it
is
s u i t e d ;b u t
a
humanbrain
will
beintroduced
=at
t h o s epoints
where t h emachine has
reachedi t s
l i m i t s . This intermediate goalhas,
w eb e l i e v e , s i g n i f i c a n t p - ~ a c t i c a l as well a s t h e o r e t i c a l v a l u e .
Funding
f o r t h i s p r o j e c t c e a s e din
June1974.
The r e p o r tmubt
be r e a d , t h e r e f o r e , as a s:mmary o f work t h n t was interrupted inmid-course,
and-asa
p a r t i a l blueprint T o r f u r t h e r w o r k s h o u l dt h e
necessary
funding e v e r m a t e r i a l i z e .At
t h i s p o i n t ,six
months a f t e r t h e termination o fthe
p r o j e c t ,the
need f o r v a r l b u s modlfl-cations
is
a l r e a d yevident.
I t seems b e s t , howeven,to document
trying t o i+ntroduce
now
andu n t e s t e d m a t e r i a l .
11, Subconcept u a l i z h t i o n
nle assume t h a t a s p e u e r
o r
writer
b e g i n s w i t h a s i n ~ l e ,u n i t a ~ y j h o l i s t i a
concentual
chunkt h a t
he has r e c a l l e dfrom
memory and has d e c i d e d , f o r some r e a s o n t o communicate.
Thus
he
may
nave
i r
mind some i n c i d e n t i n which he was i n v o l v e d , somethingo f i n t e r e s t
he
was
p r e v i o u s l y told a b o u t o r read a b o u t , someex-
periment he wishes t o
r e p o r t
on,or
whatever.
delabel such
a
c h U , as w e l l as t h e
smnllmer
chunks into
whichit
will be analyzed,with tlie p r e f i x
CC
( f o r "conceptual chunk") followed by a f o u r -d i g i f
u b e r .
h e f i r s t digit i n d i c a t e s t h e lanrruwe i nwhich
v e r b a l i z a t i o n i s t o t a k e p l a c e ("1" f o r E n g l i s h and "2" f o r
J a p a n e s e ) , and t h e remaiaing t h r e e d i g i t s c o n s t i t u t s an
arbitrary
index--for t h e p a r t i c u - l a r
chunk.
'fhl*s-e%1001 might
be t h e namegiven t o
some
p & r t i c u l a r chunk o f t h i ss o r t
that
i s a b o u t t o be v e r b a l i z e d i nxnglish.
We assume. futhermore,
that
w h i l e t h i s chunkis
f r o m onep o i n t of view a
w i t ,
f r o m ahother p o i n t o f view i t has a more o r less r i c h c o n t e n t n , <aIrd t h a t 1 . C i s t l - L ~ S c o n t e n t which t71es p s a k e r . w i s h e s t o convey t o
his audience.
Sometimes,though
n o ti n m o s t c a s e s , t h e i n i t i a l
chunk
i t s e l fmay have
a linguisticlabel.
If
i tis
a f o l k t a l e , f o r example,i t
mayhave
a namelike
" C i h d e r e l l a " o r 'lThe Three Bears".
But
someone who has decidedt o t e l l
a
s t o r y i s n o t l i k e l y t osay
ju'st " C i n d e r e l l a " and l e ti t
go at
that. (One
i s reminded o f t h e o l d s t o r y about a conventione l i c i t e d l a u g h t e r aach
t i m e
because everyone knew t h ej o k e s
t h e s enumbers
s t o o df o r . )
N o r m a l l y
i t
i s n e c e s s a r y j n s t e a df o r
t h e
s p e a k e r
t o
g e t i n s i d e the contentof
thisinitial
u n i t - - t o a n a l y e e it i n t o s m a l l e rchunks.
T h i s k i n d of p r o c e s s can be p i c t u r e d a sshown
i nFigure
2 , where t h einitial
chunkCC-1001
has Seen, as wesay,
subconceptualized
i n t o
chunks CZ-l002-&1nd Cd-1003.In
at e x t
o f any
s i z e
eachof
t h e s esmaller
chunks w i l l befurther
brokendown i n t o s t i l l s m a l l e r ones, and sp on, s o that a h i e r a r c h i c a l s t r u c t u r e o f s u c c e s s i v e l y s m a l l e r s u b c o n c e p t u a l i z a t i o n s emerges.
Subconce~tualization
belongs-
t o t h e class of v e r b a l i z a t i o nprocesses which are c r e a t i v e . N o r m a l l y
a
chunk
d o e s not auto- matically determine a particular s u b c o n c e p t u a l breakdown, but t h es p e a k e r
must
c r e a t i v e l ychoose
how
to
subconceptualize eachone.
It
is useful
t o t h i n k of t h e c o n t e n to f
each
chunk--each c i r c l e i n F i g u r e 2--asif it
werea
m o u n t a i ~ o u s l a n d s c a p e , w i t h t h emost
salient
a s p e c t s .-tanding o u t i n boldrelief
and t h e lesssalient
appearing as o n l y minor hills. kll o t h e r
t?ings
b e i n g equal, t h emore salient sople a s p e c t of t h e total c o n t e n t
is,
the more l i k e l ythe s p e a k e r i s t o e x p r e s s i t when he subcoaceptualizes. R e is not likely
t o
make e x a c t l y t h e same subconceptual breakdown each t i m e he communicates the saneinitial chunk,
partly b e c a u s e he m a yjudge different t h i n g s 50 be s a l i e n t
in
differentcontexts
andp w t l y because t h e landscape
itself
may
change over t i m e ,the
r e l a t i v e s a l i e n c e of
its
d i f f e r e n t ~ L S D ~ C ~ S b e i n g modified i nlong-term memory. IJe assume
that
any p a r t i c u l a r subconceptuali-zation n e c e s s a r i l y leaves out p a r t o f the c o n t e n t o f what i s
t h e
l * l r ~ e r
c i r c l e b u t o u t s i d e t h e two smallerc i r c l c s
i n P i e u r q 2 .S u b c o n c e p t u u l i z a t i o n , t h a t i s , is necessarily a s e l e c t ' i - v e [ ~ r o c e s s . No one e v e r says e v e r y t h i n g he
could
say aboutwhat
he
hasin
mind.
~~bconceptualization o f R p a r k i c u l a r chunk, say
GJ-1001,
p r o - duces two o r more h e wchunks.,
sayCC-1002
andCC-1003.
These
new Chunks, f u r t h e r m o r e , a r e conceivy.dof
as related t o each o t h e r i nI t
some way. F o r example,
3;-1002
m i g h t be t h e reason" f o r2C-1003.
Suppose t h e
e n t i r e
t e x t consisted o f -thesentences,
"I
bouqht abi-ke yesterday.
I
d e c i d e d I need more e x e r c i ~ e . " L e t u s s s y t h a tthe
f i r s t sentence isa
verbzlization of ZC-1003
and the secondsentence
o fCC-1002.
d e can say t h a t5s-1002
i s t h e r e a s o n f o rCC-1003. de w r i t e a s u b c o n ~ e p t u d i z a t i ~ f l p r o c e s s o f
this
k i n d in r;he f o l l o w i n g way:1) JC-1001 S> CJ-KE'ASON
(CC-1002,
32-1003)This
statement says t h a t theinitial
chunk,
CZ-1001, is
sub-c o n c e p t u a l i z e d ( S > ) i n t o
the chunks CC-1002
andCX-1003,
and thytth'ese two new chunks n r e r e l a t e d by the p r e d i c a t e labeled CJ-:1EAiLilN,
The p r e f i x
JJ
s t a n d s f o r " c o n j u n c t i o n f 1(derived
f r o n t h e ~ r m r n a t i c a l ,7 ( ' ( (
not the logical
use of thistern).
m y r e l ~ t i o nbetween
Y d sis
l a b e l e d w i t h this p r e f i x .
'VY'e use a different
notation
to r e p r e y e n t each of t h e variouss t a g e s
in
theverb-lization
process. - r ~ the o u t s e t , i n t h l s example the initialchunk JC-1001
was a l lthat
was p r e s e n t . Thisinitial
r e p r e s e n t a t i o n , before any v e r b n l i z a t i ~ n processes had beer] a ~ j p l i e d ,
was siaply:
2 ) ca-1001
Subconce~tu6lization
p r o c e s s e s
o r e +;bus ' r e w r i t e r u l e s , wh'ichr e p l a c e
one
s t a c ein
a v e r h : j l i a n t ~ o nwith
a subsequent stage, The f o r m n tw t e ,;
t o
r e p r e s e n t s d c h stn(;es, as i n3 ) ,
shoiJs p r e d i c a t e s w i t htheir
argumentsw r i t t e n
i n d e n t e d below thenl*In
simulating
verbalization
o u r
programppesently
aokls t h e u s mt o
s p e c i f yall
t h ec r e a t l v e
choices,
restricting
its own
contribution
to the application
of n l l ; o r i t h n i c nrocgsses d e t e r m i n e d bythe
crammar o f d i s c o u r s e , s e n t e n c e s , and w o r d si n
the lanq-uage involved. ?heprogram i s l a b e l e d VAI) ( f o r " v e r b s l i z c - L and t r m s l a t o $ ' j , :md we
can i l l u s t r a t e convcrsntionr; netween
VA?
: ~ r d t h . ~ u s e r identifyin@ themas
V
and L'respectively.
The p r o c r a m . b&[;ins by asking:r v - r ?
4)
V:
,V i
A 3u !ilT? ~ . i ~ l ' ~ ~ , L ~ i 2 d . t o wh;ch onepossible
m s w e r is:5)
U:
V i K B l i L I Z E 2:-1001S k i p p i n g s e v e r a l s t e n s
t o
illnstrat:
u n l ; ~ the ~ c r u ~ ; ~ ~ o u t l i n e s o fs u b c o n c e p t o ~ a l i z n t h Y ,
wt?
T C i n t o v b ? - ber{ j u s t no:.r i n t!:e ~ l e q t iqn:6 ) V: 110,~
1,;
X-1001 ,LXJ13,~\4!! XiJ;'[; Li 5.iU?t 3
l t ~ h l ~ h ~ O S S ib1.e m s w c r i s :7 )
U:
L A G(:?-1002,
22-1006)At
tSlis n ~ i n t 'JI.+T w i l l c o n : - t m c t the representation shown I n3 ) -
s p ~ o l , : r . a 1 s
In
g i v i n z jnnswer l i k e t h c tIn
7) the user of t h - 4a s s u ~ e d t o be :oKlng e ~ p i i c i t n d e c l a i a n w l i c h a m n l s e + e r
would
can a t
l e a s t
i n t r o d u c e
t-hedecision
i t s e l f
i n t o
t h ov e r b r ~ l i z n t i o n
model
a t
t h i s stage.VAT
will
now
apply txnal~orithmic
o r ,
anwe
s a y ,
syntactic
"l
process
t r i g g e r e d by t h e presence ofZJ-REASON
i n
3 ) .
l h e p r o c e s sapplied is o f a
type
t h a )i s
a l s onot
c l c a r i yunderstood,
b u t We mayview
what we d oat
p r e ~ e n t afirst
a p p r o x i m a t i o n . \Ji'i't t h emoment
VAT
s i m p l y t a k e s the t w b ZCs r e l a t e d by3'-REXSOIJ
and o r d e r sthem s o t h a t t h e second
will
be express'ed b e i g r ethe
first.
Thati s ,
f o r exam't~le,if
SC-1002i s
e v e n t u : ~ l l ygoing
t o h e verbalizes as" I d e c i d e d
I
need moreexercise"
-md2C-1003
as"I
bought a b ~ k e yesterday", we want t h etwo
s e n t e n c e s t o beexpressed,
w i t hC2-1003
preceding
CC-1002.
Thus VATwill
a u t o m a t i c a l l y change t h e r e p r e - s e n t a t i o f li n
3)
t ot h e
following:
'Phis kind of ~ e p r e s e n t a t i o n , i n which
no
p r e d i c a t e is shownaoove
the two CGs, i n d i c a t e s t h a t t h e y
( o r
t h e i r e v e n t u a l v e r b a l i m t i 1 , n s ) are t o occur i n t h efinal
t e x t i nthe
order. shown, w i t hdz-1803
p r e - -ceding
CC-1002.I n
J a p a e s e t h e c o r r e s p o n a l n gsyntactic
process w i l l t y o i c a l l yl e a d t o t h e attachnent o f CJ-"KAdk" a t
t h e
end of t h 6 secondsen-
tence.
phusif
arepresentation
l i k e t h a tin
3 )
were
producedi n a
Japanese v o r b : i l i z a t i o nVAT
would a u t o m a t i c a l l y cl~angeit
t o :The q u o t a t l n n
marks
aroundindicate
t h a tthis is
an
i t e ma r e
usedf o r
i t e r h o t h ~ t have EI ~ u r f ~ c e L e x i c a l r e p r e n t l n t a t i o n .The r e p r e f l e p t r i t i o n
in
9)
i . s d e f i c i e n t j.n t h a tit
f l a i l s t o showt h a t CJ-"KAiiAt'
will
be p a r t o f tht: same sentence asCC-1002,
whereas"v-1003
will ( o r islikely
t o ) f o r m ad i f f e r e n t
seotence.
Sle i n d i - c ~ a t esentence
b o u n d a r i e s w i t h t h e n o t a t i o nCJ-
".
",
'since t h e p e r i o dwill a.ipear
in
t h e f i n a l text. T h ~ sfulLer
v n r g i l ~ n s o f 8 ) and7 )
a r er e ~ ~ r e c t
i v e l y sThe c r e n t i o f l ~f t h e s e : ) e r i o d s
is
r* hi>usekeepinp; t a s k t h a t rleed n o tbe d e s c r i b e d
in
d e t a y l h e r e .Given
a
r e p r e s e n t a t i a r l i k e t h n tin
lo),
VAT
w l l l ::o on t oask
a b o u t t h e s u b c o n c e r ~ t u n l - i z n t i 1 , n of f h e f i r s t d z in the o r d e r i n g . ?he
g e n e r a l p r i n c i q l e f o l i o w e d h e r e
is
one o f " d e ; ~ t h f i r s t r ' ,in
t h e s e n s et h a t e g r l i e r i t l s m s . in t h e text a r e Tom letely vc?rS ~ l i z e d bi:fore tile v e r b a l i z a t l ~n o f l a t e r i t e m s i s b e l v n . h i s p r o c e d l ~ r e :,robably 11ns
some ~ s : ~ c h o l o g i c a l vnl i d i t y ; t h : l t is, a speCaker is l i ~ e l y t o t t n n k
o f l a t e r p a r t s o f what he
is
rroing t o s n y o n l y in t e r n s o f t h e m o s t g e n e r a l chunks, while he is elaborating t h e earlier n a r t s ln d e t n i l .Only
a f t e rne
has finished t h e v e r b : ~ l i z a t ' i o n o f t h e s e e a r l i e r p a r t s w i l l het u r n
h i sattention-to
afull
verbalization
o f t n el a t e r ones.
Thus,
ositting
v a r l o u s c o n s i d e r a t i o n s n o t b q g e t discussed,12) V: LJ1ii1T VAT TASK
DO
Y'dU J,,NT PERI;'Ol(MiSD?U:
VERBALIZE C C - 1 0 0 1(VAT
c r e a t e s
t h efollowing
r e p r e s e n t a b i o n :
)Vs
HOW
I S CC-1081 S U B Z O N C i ' l i ' T U f i IZED?
(VAT creates
first
the following r e p r e s e n t a t i o n : ) CJ-REASON
CC-1002
CC-1003
(and
immediately applies a s c o r e d syntactic a l f ~ o r i t h m t h a t changesit
t o : )V:
HOW
I S dC-1003SUBCONCEPTUALIZED?
e t c .
In
this
fashion a s u ~ c o n c e p t u a l hierarchy of any degree o f com-p l e x i t y can be
c o n s t n ~ c t e d
and
expressed.
The o r g a n i z a t i o n o f a t e x t
may
not,
beentirely
hierarchical.
however. Not only does a speaker b r e a k down l a r g e r chunks i n t o
I I
smaller chunks--larger concepts"
into
subconcepts; one chunk may also remind him o f a n o t h e r ,s o
t h a t t h e organization whichresults
may be
in
p a r t conc-atenative. d e have been viewing c o n c a t e n a t i o nin
tepmso f
excursions away f r o mthe
mainhierarchy,
a d hn-ve been c a l l i n g such excurshm 9 r e s s i o n s .In
some discourse, however,t h e r e
is no
necessary c o n s t r a i n t t h a t t h emain
hierarchy Se re-turned t o , and the result may be a r a m b l i n g
t e x t
in
whichdigression
is
added to digression.In
a more tightly organizedtext digressions
which
q u i c k l yr e t u r n
t o
t h e main hierarchy.We
uoc
t h et e r n
parentheeis
f o r
t h i s b r i e f andtransient
k i n d of d i g r e s s i o n .If
subconceptual.ization
be r c p r e s e p t e di n terms
ofa t r e e
diagram
(which
doesnot,
however, p r o v i d e a convenientmean$
o fshowing
therelations
between subcoqcepts, l i k e CJ-BEASON),t h e n
d i g r e s s i o n s can be p i c t u r e d
as
s u b t r o e s a t t a c h e d t o t h emain
t r e ea t one
p o i n t o r another,as
sur;gestodi n
F i g u r e. 3 .
One o t h e r
important
m o d i f i c a t i o n of t h es t r i c t l y
h i e r a r c h i c a lmodel
of. s u b c o ~ c e p t u a l i z a t i o n r e s u l t s from t h e common occurrenceo f
summarization.
Iti s
f r e q u e n t l y t h e case i n v e r b n l i z a t ~ o n t n n tan
i d i t i d chunk w i l l be s u b j e c t t o t y o ~ e p ~ 3 r a t e h i - e r a r c h i e s o fs u l ) c o n c e p t t d i ~ a t i o n , one of
which
can beidentified
3s asummary
of
t h e
o t h e r .It
i s
c h ' a r a c t e r i s t i c of :rsummary
t h . 4its
subcon-ceptllal i z a t i o n prclcesses n e v w proceed beyond some r e l a t i v e l y
large
chunks--cnunks
which packagea
relatively
l a r g e c o n t e n t . e canc o n t r a s t a s u b c o n c e p t u ~ l i z a t i o n h i e r a r c h y which i s a summary with
a h i e r a r c h y which c o n s t i t u t e s t h e body of t h e text and c o n s i s t s o f subconceptualization p r o c e s s e s thxt produce a lar.:cr number o f chunks o f s m a l l e r s i z e .
A surlrnary
is
t y - p i c a l l y expressed at the beginning o r end o fa
t e x t ;
thst
i s , preceding o r f o l l o w i n g t h e body. V a r i o u sconventions
f o r summaries a r e a s s o c i a t e d
w i t h
d i f ' o e n t genres o fwriting.
P o r example, a s c i e n t i f i carticle
may
b e g i n withthe
eel$-conscious k i n d ofsummary
that i s c a l l e d an a b s t r a c t ; a news r e p o r ttypically
con-tains
an
opening par3graphtelling
wllu,
whqt, where, and when; af a b l e i s l i k e l y t o end w i t h a m o r a l , and s o
on.
Our program a tsurnmnry
(one
cxprosr.ed
a t
t h e b e ~ i n n i n p j
o ft h c
t e x t ) .
I f
t h e
answer i~ y e s
i t
asks
f i r s t
f o r
s u b c o n c e ~ t u n l i z a t i o 1 1
o ftho
sum-
mary,
and
moves
on
t o
ask a b o u t t h e bodyof
t h e
t o x t
o n l y
a f t e r
t h e
summary
has
been
completely
verbalized.
n t t h eend o f
t h et e x t i t
asks
whether
t h e r e
is
af i n a l summary.
C w a t i v i t y
w i t h i n
adiscourse is
likely
t o bel i m i t e d by
t h egenre
t o
whlch t h ed i s c o u r s e
belongs.It
would
a.jpear
t h a tt h e r e
is
acontinuum
ranging
from
mnximally storeoty-ped
t o mcmimnllyc r e a t i v e
discourse.
Plost
stereotyped
are
those
forms
o fdiscourse,
such
asr i t u a l s ,
in which
t h e s p e a k e r has
very
l i t t l e
choice
as t owhat he
is
going
t o sayo r
how
he
i s
goinf:
t o sayit.
l l i t h w e i rdiscourse
t h e"grammar"
o f t h egenre
p r o v i d e s
many
o f
t h eanswers
t o t h e
q u e s t i o n s
VATwould
o t h e r w i s e have
t oask
t h e
use-r.
I n o t h e r
words,
VAT
s h o u l dbe able
t o produce
r i t u a l
t e x t s
with
mininurr
recourse
t o
c r e a t i v e
decisions.
K t
t h eo t h e r
extreme
xre
forms
o fdiscourse
such
osdescriptions
o funiauc
+p e r s o n a l
exncriences
whicn
have
n e v e r
b e e n d e s c r i b e d b e f o r e ,where
t h espeaker
1 s r e l a t i v e l yf r e e
t o l a k e a'reat v a r i e t y
o fc r e a t i v e
docislons.
\lie
b e l i e v e
it
would
beo f
considerablei n t e r e s t
t o i n c o r p o r a t e
i n t o
t h e verbalizationp r o c e s s
t h ec o n s t r n l n t s
i l n ; ~ o s e d bys e v e r a l
d i f f e r e n t g e n r e s , b u t
we
haven o t
as
y e t
donr: this.As
it nows t a n d s
o w
program does ask J I A T Is' '2B<
GZlu?i?? as soon asit
hase s t a b l i s h e d
that
a v e r b a l i z a t i o ni s
t o
be performed. P o s s i b l eanswers
t h a t we m l d l i k e t oimplement
in
t h ef u t u r e
are,
f o r
rn
exrunple
ofthese procedure8
ar:
a n p l i e d t o a r o a lt e x t
c a nbe based
on
t h efollowing United Press r e p o r t
trrken,
sl
ightly
con-densed,
f r o mt h e
em
Francisco
Chroniclo
o f
M a y 16, 1974313)
1.
An
11-ye:lr-old boyusing
anew
"super-glue"
2.
acciirenfally
glued h i s
e y eshut
3.
while
building
amodel
i r l a e ,
4. and a
d o c t o r
had t orenpen
t h e e y esurgically.
n i k e
Harris
said6. he rubbed his l e f t e y e
7.
a f t e rs e v e r a l
d r o p s o f t h e g l u e s q u i r t e di n t o
it
last
Sunday
8. and
found
his
e y e l i d w o u l dn o t move.
9.
An
eyesurgeon
debated b r i e f l yabout
10.
using
a s u p e r g l u esolvent
11.
b u t decided a g a i n s tit
12.
f o r
fear
i t
might damqe;e t h eboy's
eye.13. 'Phe surgeon, who asked n o t t o be i d e n t i f i e d ,
1 f i n a l l y p u t Plike
in
t h e o p e r a t i n g room, 15- tri:,inedMike
'
s e y e l a s h e s ,16. t h z n opened t h e e y e l i d s u r g i c a l l y .
1
Mike was
r e l e a s e dfrom
t h e h o s p i t a l Tuesday.It
is
a-:proximately t h e casethat
e a c h o f t h enunbered
l i n e sin
thist e x t
expresses a t e r m i n a lsubconcept
( s e e b e l o w ) . :ie assume t h a tt h e
t e x t
contains
a nllmbero f
i n t e r m e d i a t e
subconcepts
aswell,
Let
U B S U ~ P O B B th:& t h ew m b i n n t i o n
o f
VATand
the u n e r
a r e
attempting
t osimulate
t h ev e r b n l i z a t i o h proces::es
t h a twent i n t o
t h e
~ r o d u c t i o n
ofthis
t e x t .
For t h emoment
we
&reconcerned
o q l yw i t h
subconceptualizati~n
p r o c e s s e s
( a n d ,a s s o c i a t e d
syntactical-
gorithms).
limyo f
t h eu s e r ' s
answers
i n
t h e f o l l o w i n gconversation
w i t h VAT1
a r e
intuitively
based.'lfhe
successo f o u r
e v e n t u a l
parsingcomponent
will
depend
on
t h ee x t e n t
t owhich
t h e s eintuitive
an-swers
can
bepredicted
f r o m t h et e x t
t o g e t h e r
withwhatever
items
o f background
knowledge
w e
relevant.
ihe
example w i l l becarried
o n l y f a r enough t o sup;gest
t h e
n n t y r e o f t h eprocedure.
n
he
exchanre
b e 6 i n s
in the u s u a l way:
VAT
c r e a t e s
t h efollo~ing
representat
i o n ,
including
a text-f i n s 1p e r i o d :
VAT's
n e x t
q u e s t i o n s e e k s t o e s t a b l i s h whatgenre c o n s t r a i n t s
a p p l yin
t h i stext:
6
v:
A!l*lT Iii ?Hi2 GJ22TRd2U: E 3 V L L ~ C ~ ' O H T
VAT will
now assume t h a t the t e x t is a t y p i c a lncws
r e ~ o r t whichbegirls w i t h a sll:runar:r.
Its
f i r s t questions w i l L d e a l w l t h t h es u b c o n c e ~ t u a ~ i z a t i o n
o ft h e
s m a r y (expressed i nthe
text
i n
s e n t e n c e s
1L4)
:1 7 )
V:
HOUI:]
CC-1001
SWC,NGEPTGhLIi;EDIN
Ti12 dU;iMjiliY?the
u s e r
has answered t h a t ;he fi:*st breakdown o f t h e summaryi@
i n t o two subconcepts;
CC-1002
( t obe
expressed as"fh
1 1 - y e a r - U L ~ boy using a new I'm*z l u e "
Bccitientally c l u e dhis
e y e s h u t while building a model irplane")and
CC-1003 ( t o be e x p r e s s e d as "ad o c t o r had t o reoTen the eye surgically"). Furthermore the relation betireen t h e s e two J C s has Seen identified as one l a S z l e d YILLD, in
which the first Z C "leads t o I t or '\rsstGLts dn" t h e second.
YIELD
di f f e r s from a n o t h e r , similar r e l a t i o n which is l s b e l m d C A U 6 5
in
t h a t t h e event conceptualized by the s e c m d
CC
is n o t a necessary consequence o f t h e first. -It i s , however, something t h a t presumablp1 ,-'
would n o t have happened if the event c o n c e p t u a l i z e d by t h e f l r s t J~
had not t a e n p l h c e . (Zvidentls Y I 3 L 3 can be equated w i t h I ~ < I T I A I Z
as
this term is used by Humelhart1974,
the
r e l a t i o n s h i p between an e x t e r n a l event and the willful r e a c t i o n o f an ; m t h r o p o m o r o h i z e dbeing t o t h a t event. 3chank
1974
u s e s 1:JLTIAICE d i f f e r e n t l y . ) As a result of the user's answ-?r in17)
VAT f i r s t c r e a t e s the -.-epre-sent
at
ion:C J-YIELD
and immediately a p r . l i e s syntactic Q r o c e s s e s which Changes it t o :
' F h n t is, t h e two 32s are
to
he e x p r e s s e d with tte "pielderl' pre- ceding the " y i e l d e d " , and they are to be connected w i t h c o m afollowed by the word " L J D " . This is g a t t h e o n l y way which
c e p t u a l i ~ a t i o n
o f
the
e a r l i e s t
CCin
19):
20)
V:
WOWIS
CC-1002CUDGONCISlJTOALIZED
IN
TI143 C\JMMkd?Y?The
user
has answeredthat
CC-1002i s
brokendown
i n t o t,wo
CGs,CC-1004 ("buildir~g
a
model airplane") and C C - 1 0 0 5 ("An 11-year- old boy using a new "super-glue" a c c i d e n t l y g l u e d h i s e y e shut"). They are r e l a t e d byPKAMSU,
a
temporal r e l a t i o n i n w h i c h T i e f i r s t CC o c c u p i e ss
t i m e
p e r i o d l a r g e r t h a n rind i n c l u d i n g t h e tirne p e r i o dof
the second.In
o t h 9 r words t h etime
p e r i o d of3C-1004
includes t h a t ofCC-1005.
VAT
c r e a t e s ,
s e q u e n t i a l l y , t h : following twor e p r e s e n t a t i o n s :
Although there may be several possibilities forq the e ~ p r e s s i o p
~f F m , viLT h a s assumed h ~ t : t h a t t w o f a c t o r s ape involved: a n o r a e r i n g of t h e t w o C Z s s o that t h e " f r a m e r t t p r e c e d e s t h e "framed", and a prefixing of the word
"LfiIL..~"
t o thefirst
122.(In
this
[l / Y
example
the
o r d e r i n g Of t h e s e t w o A S will be r e v e r s e d in as ~ b -
l o w an a r t i c l e of f a i t h .
W
e
would
e x p e c t
VAT
t o asknext
a b o u tt h e
s u b c o n c e p t l l a l i z ~ t i o n
)f C C - 1 0 0 4 , but by a meand not y e t d i s c u s s e d
VAT
w i l l d i s c o v e r thati s is a
terminal CC
(onenot
f u r t h e r s u o c o n c e p t u a l i z e d ) . I fI1 I t
I' AND", VAT would proceed t o
:C-1004 were followed by
.
o r
by,
n
~ s k q u e s t i o n s d i r e c t e d a t t h e comalete v e r b a l i z a t i o n o f t h i s uC.
3ut since CC-10u4
i snot followed
by one q f these boundaries,2 t t e n t i o n i s -ne*t f o c u s e d on
CC-1005:
23)
V:
HOWIS
CC-1005 SUBCoNGEPTUALIZ5lJ I NTHE
SIlMMAriY?VAT
creates the following represcntation:2 4 ) S J - ' ~ ~ W S I L E ~
CC-1004 CJ-PHAME
CG-1006
cc-1007
C J-"
,
AND"cc-1003
I t I 4
CJ-
The user has said t h a t
CG-1006
( " a n11-year-old
boy u s i n g a new1 (1
-
"super-qlue"") o c c u p i e s a
time
p e r i o d which i n c l u d e s 1007 ( " h c -cidently glued
his eye shut").
So f ~ r we wuld
e x p e c t t V 1 i . s secondi n s t a n c e o f
PRfiE
t o be e x p r e s r x d by prefixing t h e w o r d "liilllL.<" t o 25-1006, as was done in 2 2 ) . L e t us s u p p o s e , h o w - t > ~ r e r 9 h:.lt I~':tAPiLa c t u a l l y triggers a more
complex
a l g o r i t h m which says i n e f f e c tt h a t one
"WHILE"
i n a sentence is enough, and that a s e c - ~ n d instancof PHJWIE w i l l l e a d to a different ex~ression. Here the second
1nsr;ance leads
to
the creationof
arelative
clause
whichwill
modify
one of the c o n e t i t u e n t s o f CC-1007. Furthemore, thea l r e ~ d y
created
"WHILE"
clause
w i l l bemoved
t oa
p o s i t i o n a f t v ':G-lOO7.would be
slightly
less
d e s i r a b l e ,
f o rexample,
t nproduce
"While hetl
W R ~ b u i l d i n g
a
model
a i r p l a n e an l l - , y e n r - o l dboy,
u s i n g
a
new
s u p e Iglue"
,
e y e s h u t . I 1C e r t a i n l y ,
the
d i f f e r e n c e s
i n
thia
a r e a
a r e very s u b t l e . ) We w i l li.ndiLate
t h er e l a t i v e
clause
s t a t u s o f CC-1006,t o
be embedded wiatkln t h eex-
pression w i t h slash n o t a t i o n :
The
r e p r e s e n t a t i o n i n
25) will
be discove-redto
be the f i n a l one i n t h e s u b c o n c e p t u a l i z a t i o no f
t h e s u m m m , which h ~ s beenfound
t o
co'nsist
of f o u r CCs ( u l t i m a t e l y f o u r c l a u s e s ) j o i n e dt o g e t h e r
i n
t h e manner i n d i c a t i l d .VAT
will
now proceed t o verbal-ize t h e summary comoletely, making u s e of o t h e n k i n d s o f p r o c e s s e s . Wnen
t h a t
has been done, i t w i l l s s y :SUB",NCEPTUALIZi3D?
U:
YIELD ( G C ~ O ~ P , GC-1003)T h i s is, o f c o u r s e , the same
answer
t h a t wag :;iven t o t h e c o r r e -s p o n d i n g questlon
i n
17).
aboveAS
GC-1.002 andJC-1003
a r e f u r t h e r e l a b o r a t e d , however, rnany d i f f e ~ e r i c e ~will
ertlerge.Ult
iniat e l yCC-1002, Phich was e x p r e s s e d i n
sentences
1-3
o ft h e
Summarv- w i l lbe expressed i n the bna;~ of
t h e
Bextin
sentences
5-8. C C - l O o j , expressedin
t h esummary
-3ssentence
i+, w i l l be e x p r e s s e din
t h e b o d yin
sentences 9-14.W b w i l l n o t r e p e a t here- the ~ p e r ~ t i o n s involved
in.
the sub-p e r t
similar
t o t h o s e ill:^^ t r a t e d above. V a r i q u s o t h e rr e l n t i o n ~
Setween
':Js
.we
i n;roduce(i: f o r
exam d e , t h a t'letween
CG-1015
( M eye s u r g e o n d e b a t e d
brlcfby
i ~ b o u t uqiny; as u p e r
g l u ecolvent
but decided : ~ ~ a l n a t
it
for
f e a ri t
rnicht dnmal:e t n e b o y ' s eye.#> and 52-1016 ("The surgeon. who asked n o t t o beidentified,
T i n a l l y put RZi3tn
operfttiw
f o m ,
trkmedMike's
e y e l r ? s h e s , t h e n opdfied/ Y 1
t h e
eyelid
s u r ~ i c a l l y . " ) The f i r s t o f th'ecei n v o l v e s
an
a l t e r -native
t h a t i s r e j e c t e din
f a v o r o f t h ea l t e r n a t i v e
c o n c e p t u a l i z e di n
t h e second; t h u s ,the
relation
r n q ,e l a b e l e d Ii:I:JE3::3-ILI-PAVOR-Ol' 'n'ithin
ZC-1015
t h e r e is a r e l a t i mof
3 ;;N? td;;L;IOI'I ( d e n i a l o f expeetation) becwsen 32-1017 ( " ~ n eye susKeon debnted ' b r i e flk
a b o u tusing
a s u n e r g l u e solvent"') and 22-1018 ( " d e c i d e d ag.ii.nst i tfor
II '
fear
it
night
dam?gethe boy's'eye.
-It
will
be o f ems d c r a b l e intereat t oi s o l a t e
relations
o f thissort
in
avariety of
texts,
a
n
:
!
t o d e t e r . z i n 8 t h e waysin
whic-h $hgy :ia;y- lje expres:;ed ~ n d e ru a r y i n ~ c i r c a ~ s t a n c e s
in
d i f f e r e n t 1a.ngunp;es.?he text does
cr~ritain
one
e x m * d eof
a parenthesis, exnresseoin
t h o c m r e s t r i c t i y erelative
c l a u s el n ,
l i f l e13
("Thesurgeon,
wboasked n o t t o be
identified,
" ) . The fact t h l t t h e s u r g e o n asked n o tt o
be identified
is a n v l o r li(s,rerision from t h e inalnstrearn o f thea c c ~ : l a t . it i s a t t a c h e d t o t h y node r e p r e s e n t m e .
;he
surf;eonwhich
~
~
becon-1
1
aconst:tuent
o r:;-1022
( " f i n r i l l y p u t !like in t h eo : e r a t i n & r o o a , t r i . m e d Mike
'
s eyelsshes,then
o o e n e a t h e eyelid'1 ,\
sur;icnllg.
,
IV.
L e x i c a l , i z a t ~ o n o f a 33C:omponent
o fv s r b n l i z a t i : ~ n z
s l ~ e . : i f i c a l l y t o nc l u ~ t e r
o f
procwmes
t h a t
a r e
involved in
t h e
choico o f
ap n r t i c u l a r
linguietic
expres-3 *
sion
f o r R vu. a S u b c o n c e p t u a l i z a t i o n b r e a k o downan
i n i t i a l
chunkinto
s m a l l e r chunks. Phese smallerchunks,
however,remain
oncell-t u a l
dn
n a t u r e ,
~ l n d o t h e r o o e r o t ijns
a r e nececsary t oc o n v e r t
t h e ninto
surgace l i n p p i s t i c r e p r c ~ s e n tl i t i o n s .
iiou6hl.v c ~ e a k i n g ,1 s x i c ~ 1 -
i z a t i o n
involves
t h e c h o i c e o f "words" t h i r twill
a o n r o p r i i i t a 1 . ycommupicnte
t h e
c o n t e n t o f 2%.L s x ~ c a l i z a t i o n o f a ZC t a k e s ? l a c e s ar; t h e
n o i n t
where t h eS D - ~ & R ~ d e c i d e s t h ~ t he hrrn s u b c o n c c ~ ~ t u n l i z e d
r
n o . Theair.v
of
s u b c o n c e . ~ : t u n l i z a t i o n is t o 3roduce chunks o f ;I s i z e a n u r o -? r i a t e
t oling
~ l s t i c expression, and n n r t i c u l a r l y t o l i n f - l ~ i s t l c e x : ~ r e s s i o n t h a t will convey n e i t h e r t o o l i t t l eo r
t o o nuchi n f o r -
m a n o n
t o t h e ;iddr::scee. T o o l i t t l e i n f o r m s t l o n is, f o r e x m n l e ,~ r o v l d e d 'by o sun::iaT;y, whx-e ;;ubr.nncel;tu4',izati
Jn
has r j r o c ~ e d e c i o n l y t o a p o i : & wht: e L e x i c a l i z a t i o nw-LS1
g i v e t h e a d r v r - s e e aI t
g e n e r ' i l i d e a " of t h e c o n t e n t o f the
whole.
At
t h e o t w r e n d o fthe
s c a l e , we area11
C - m i l i w w i t h e . : ~ o . S i t i o n zin
d h i c h t ~ f l ~ i ~ c h oinformat~on
is conveyed, vhnyue we :ire t o l l ] m r e h:ln NBw ~ n t
t oknqw. h e a s n e c t o f a ~ n e a k c z r ' s c r ~ v i t i v i t y , t h e n , ic? t b d e c i d e
e x a c t l y wilere
in
t h e p r o c r , m o f sub^ ~ n c e ; , t ~ ~ ~ l ~ z : ~ t i c m he sh ~ l d s t o ~ , t n k i n ~ into a c c c b l n t t h e rleeds a::d i n t e r e s t so f
the
s d d r e s s e e . I tis
at
this :~oi:iQ t h a t he t u r n s to l e x i c ~ i l i z a t i o n .The s ~ e a k e r mag 31.0 be i n f l u e n c e d
in
s u c h dscisionc hy t h eresources h l s l a g u a g e