• No results found

Organization and Programming of the Multistore Parser

N/A
N/A
Protected

Academic year: 2020

Share "Organization and Programming of the Multistore Parser"

Copied!
22
0
0

Loading.... (view fulltext now)

Full text

(1)

n i z e a n d e x p l a i n s t r u c t u r a l p a t t e r n s i n n a t u r a l - l a n g u a g e s e n t e n c e s ( s p e c i f i c a l l y E n g l i s h ) a n d e v e n t u a l l y yield an o u t p u t in w h i c h t h e r e l a t i o n s b e t w e e n t h e v a r i o u s i t e m s o f t h e s e n t e n c e a r e h i e r a r c h i c a l l y d i s p l a y e d .

T h e r e c o g n i t i o n of t h e s e s t r u c t u r a l p a t t e r n s is m a d e b y m e a n s of a s y s t e m o f r u l e s w h i c h o p e r a t e o n a s e q u e n c e o f w o r d s , i.e. a s e n t e n c e , w h o s e i n d i v i d u a l c h a r a c t e r i s t i c s a r e p r e - e s t a b l i s h e d . B y i n d i v i d u a l c h a r a c t e r i s t i c s a r e m e a n t t h e p o s s i b i l i t i e s a w o r d h a s t o c o r r e l a t e (i.e. t o f o r m a s y n t a c t i c c o m b i n a t i o n ) w i t h a n o t h e r i t e m ; t h e s e p o s s i - b i l i t i e s a r e r e p r e s e n t e d b y ' c o r r e l a t o r s ' , t h a t is, b y s y n t a c t i c e l e m e n t s w h i c h l i n k t w o i t e m s i n a c o r r e l a t i o n .

E a c h w o r d is c h a r a c t e r i z e d b y a s e t o f p r e - e s t a b l i s h e d d a t a :

a) t h e S - c o d e , w h i c h d i s t i n g u i s h e s b e t w e e n t h e v a r i o u s s e n s e s of a h o m o g r a p h . F o r i n s t a n c e , a w o r d l i k e " R E A D " w i l l h a v e f o u r d i f f e r e n t S ' s t o d i s t i n g u i s h b e t w e e n :

R E A D = s u p i n e e.g. I C A N R E A D R E A D = p a s t t e n s e Y E S T E R D A Y I R E A D R E A D = p a s t p a r t . I H A V E R E A D

R E A D = n o u n A L O N G R E A D

T h e s e d i s t i n c t i o n s a r e e s s e n t i a l , ' s i n c e w h e n e v e r a h o m o - g r a p h o c c u r s , o n e a n d o n l y o n e o f i t s m e a n i n g s c a n b e t a k e n i n t o c o n s i d e r a t i o n t o m a k e t h e f i n a l p a t t e r n , u n -

(2)

- 2 -

o n e f i n a l p a t t e r n is t o b e r e c o g n i z e d , a s in:

i) p r e s e n t t e n s e I R E A D T H E B O O K

ii) p a s t t e n s e

b) t h e s e q u e n c e of c o r r e l a t i o n a l i n d i c e s (Ic's), t h a t is,

t h e s t r i n g of p o t e n t i a l l i n k s t h a t e a c h w o r d - s e n s e has.

E a c h Ic r e p r e s e n t s a p o s s i b l e s y n t a c t i c c o n n e c t i o n b e -

t w e e n t w o i t e m s a n d is i d e n t i f i e d by:

i) t h e c o d e n u m b e r of t h e r e l a t i o n it e s t a b l i s h e s b e t w e e n

t w o items;

2) t h e 'type' of c o r r e l a t i o n . T h e r e a r e s i x d i f f e r e n t t y p e s

of c o r r e l a t i o n w h i c h s p l i t i n t o t w o g r o u p s : ' e x p l i c i t '

c o r r e l a t o r s a n d ' i m p l i c i t ' c o r r e l a t o r s .

B y ' e x p l i c i t ' c o r r e l a t o r w e m e a n a l i n k i n g e l e m e n t w h i c h is

r e p r e s e n t e d b y a l i n g u i s t i c item; p r e p o s i t i o n s a n d c o n j u n c t -

i o n s a r e e x p l i c i t c o r r e l a t o r s ; b y ' i m p l i c i t ' c o r r e l a t o r w e

m e a n a r e l a t i o n b e t w e e n t w o i t e m s , w h i c h is n o t e x p r e s s e d

b y a n y l i n g u i s t i c i t e m b u t is i n d i c a t e d b y t h e r e l a t i v e p o -

s i t i o n o f t h e t w o i t e m s (which w e c a l l t h e i r c o r r e l a t i o n a l

f u n c t i o n ) .

T y p e N

T y p e M

T y p e V

I M P L I C I T C O R R E L A T O R

I • A M

l

I

N I - - N 2

A M I

I

I

M 2 - - M I

S E R I O U S L Y , H E L E F T

I

U v 2 ~

(3)

E X P L I C I T C O R R E L A T O R

T y p e E D U C K S IN A T H E N S

I

I

I

E 1 - - - - E 3 - - E 2

T y p e F B Y C A R T H E Y T R A V E L L E D

F 3 - - F 2 - - J

T y p e H D O L L S S H E P L A Y S W I T H

I u - H 1 - - ~ I

H2 I H3

F o r e a c h t y p e t h e r e a r e d i f f e r e n t c o r r e l a t i o n a l f u n c - t i o n s w h i c h d e t e r m i n e t h e p o s i t i o n a w o r d h a s in a c o r r e - l a t i o n . W h e n t w o a d j a c e n t w o r d s h a v e c o m p l e m e n t a r y f u n c t i o n s of t h e s a m e Ic - f o r i n s t a n c e , w o r d A h a s 5050 N 1 a n d w o r d B h a s 5050 N 2 - a ' p r o d u c t ' is m a d e a n d r e c o r d e d in t h e form:

W o r d A 5 0 5 0 N W o r d B

T h i s p r o d u c t is c o n s i d e r e d as o n e p i e c e a n d c a n b e c o m e f i r s t or s e c o n d c o r r e l a t u m in a w i d e r c o r r e l a t i o n a n d is t h e r e f o r e t r e a t e d as t h o u g h it w e r e a s i n g l e w o r d , i . e . , it is a s s i g n e d s t r i n g s of I c ' s w h i c h i n d i c a t e its c o r r e l a t i o n - al p o s s i b i l i t i e s b o t h w i t h a d j a c e n t w o r d s a n d w i t h a d j a c e n t p r o d u c t s a l r e a d y m a d e . S i n g l e w o r d s , h o w e v e r , b e i n g v o c a b u - l a r y i t e m s , h a v e t h e i r s t r i n g s of I c ' s a s s i g n e d a p r i o r i ; p r o d u c t s , s i n c e t h e y a r i s e d u r i n g t h e p r o c e d u r e , h a v e t o b e a s s i g n e d t h e i r I c - s t r i n g s d y n a m i c & l l y . T h e a s s i g n a t i o n of s p e c i f i c I c ' s t o a p r o d u c t d e p e n d s on:

(4)

G

2

t . ,

- 4 -

m a k e s u p t h e f i r s t or t h e s e c o n d c o r r e l a t u m .

T h e o p e r a t i o n a l c y c l e t h a t a s s i g n s I c ' s t o a p r o d u c t w e c a l l ' r e c l a s s i f i c a t i o n ' .

T h e a m o u n t of d a t a i n v o l v e d in a n a n a l y s i s of t h i s k i n d is r e a l l y e n o r m o u s . L e t us c o n s i d e r a s e n t e n c e c o n s i s t i n g of t e n w o r d s , e a c h of w h i c h h a s t w o d i f f e r e n t s e n s e s (S's). O n an a v e r a g e 50 c o r r e l a t i o n a l i n d i c e s a r e a s s i g n e d t o e a c h s e n s e of a w o r d . Now, j u s t t o c h e c k t h e c o r r e l a t i o n a l c o m - p a t i b i l i t y of t w o a d j a c e n t w o r d s a b o u t 1 0 , 0 0 0 m a t c h i n g o p - e r a t i o n s w o u l d be n e c e s s a r y ; t h e m a t c h i n g p r o c e d u r e f o r a l l t h e w o r d s of t h e s e n t e n c e w o u l d i n v o l v e a b o u t 9 0 , 0 0 0 o p e r - a t i o n s . On an a v e r a g e f i v e p r o d u c t s w o u l d r e s u l t f r o m t h e f i r s t 1 0 , 0 0 0 m a t c h i n g o p e r a t i o n s ; e a c h of t h e m w o u l d be as- s i g n e d a b o u t 50 c o r r e l a t i o n a l i n d i c e s t h a t r e p r e s e n t t h e p r o d u c t ' s c o r r e l a t i o n a l p o s s i b i l i t i e s to c o r r e l a t e w i t h a a n o t h e r a d j a c e n t p i e c e - e i t h e r a w o r d or a p r o d u c t . T h e p r o c e d u r e to m a t c h t h e s e f i v e p r o d u c t s w i t h a n o t h e r p i e c e w o u l d i n v o l v e a b o u t 6 3 7 , 0 0 0 o p e r a t i o n s . If t o t h i s f i g u r e ~ w e a d d t h e n u m b e r of o p e r a t i o n s n e c e s s a r y s t a r t i n g f r o m l e v e l 3 (see p. 7) w i t h a l l t h e p r o d u c t s m a d e in t h e i m m e - d i a t e l y p r e c e d i n g l e v e l s ( 2 0 0 , 0 0 0 ) , t h e t o t a l n u m b e r of o p e r a t i o n s i n v o l v e d w o u l d c o m e to 9 2 7 , 0 0 0 .

(5)

l a t o r r e s p o n s i b l e f o r t h a t c o r r e l a t i o n ; t h e o t h e r h a l f d e - p e n d s o n t h e s t r i n g s of i n d i c e s w h i c h t h e t w o c o r r e l a t a of t h e p r o d u c t h a v e . A c c o r d i n g t o t h e p r e s e n c e or a b s e n c e of s p e c i f i c i n d i c e s in t h e s t r i n g s of t h e f i r s t or s e c o n d c o r - r e l a t u m , p r e - e s t a b l i s h e d s e t s of i n d i c e s a r e a s s i g n e d t o t h e p r o d u c t ; or s e t s of i n d i c e s a r e a s s i g n e d t o t h e p r o d u c t o n l y if t h e y a r e p r e s e n t in t h e s t r i n g s of its t w o c o r r e - lata. T h e r e c l a s s i f i c a t i o n of e a c h p r o d u c t w o u l d r e q u i r e a b o u t 2 , 0 0 0 o p e r a t i o n s , w h i c h m e a n s 1 0 0 , 0 0 0 f o r t h e a v e r a g e of 50 p r o d u c t s in a s e n t e n c e of i0 w o r d s , b r i n g i n g t h e to- t a l of o p e r a t i o n s t o o v e r a m i l l i o n o n l [ for t h e m a t c h i n ~ p r o c e d u r e . T h i s w o u l d i m p l y - f o r t h i s p a r t of t h e p r o g r a m a l o n e - p r o c e s s i n g t i m e s of t h e o r d e r of s o m e s e c o n d s of m a c h i n e t i m e if t h e m o s t m o d e r n c o m p u t e r is a v a i l a b l e , or of a b o u t a n h o u r - at b e s t - if t h e w o r k is d o n e w i t h an o l d e r m o d e l .

T h e a m o u n t of w o r k a n d m o n e y i n v o l v e d in a p r o c e d u r e of t h i s k i n d m a d e u s t r y t o f i n d a q u i c k e r a n d m o r e e c o n o m i c a l w a y of h a n d l i n g c o r r e l a t i o n a l i n d i c e s : as a r e s u l t of o u r e f f o r t s t h e M u l t i s t o r e s y s t e m w a s d e v e l o p e d . ( B i b l . i)

(6)

- 6 -

e r a l t i m e s a n d in d i f f e r e n t w a y s a c c o r d i n g t o t h e d i v e r s e d a t a it c o n t a i n s , b u t o n l y as o n e s i n g l e i t e m w h i c h b y its p o s i t i o n a l c o o r d i n a t e ~ i m p l i e s its v a r i o u s s i g n i f i c a t i o n s . M o r e o v e r , t h e I c ' s d o n o t h a v e t o be c o m p a r e d o n e b y o n e w i t h t h e I c ' s of o t h e r a d j a c e n t w o r d s or p r o d u c t s , b u t a r e s i m p l y a d d r e s s e d t o o n e a n d o n l y o n e p r e - e s t a b l i s h e d p o s i - tion. T h u s t h e m a s s of o p e r a t i o n s of c o m p a r i s o n is a v o i d e d a n d a l s o t h e n e c e s s i t y t o a s c e r t a i n , a f t e r e v e r y s u c c e s s f u l m a t c h , w h i c h i t e m s t h e m a t c h e d I c ' s r e p r e s e n t is e l i m i n a t - ed, b e c a u s e t h e v e r y p o s i t i o n of t h e m a t c h e d I c ' s i m m e d i a t e - ly i m p l i e s w h a t t h e y s t a n d for. T o e s t a b l i s h w h e t h e r t w o . I c ' s a r e c o m p l e m e n t a r y a n d r e p r e s e n t a c o r r e l a t i o n t h u s b e - c o m e s t h e s i m p l e t a s k o f c h e c k i n g a l r e a d y p r e s e n t i n f o r m a - t i o n a c c o r d i n g t o t h e r u l e s of s e q u e n c e , of c o r r e l a t i o n a l f u n c t i o n , a n d o f c o r r e l a t o r t y p e , a l l of w h i c h a r e i m p l i c i t in t h e l o c a t i o n of t h e m a r k e r s w h i c h a r e b e i n g h a n d l e d .

T h e M u l t i s t o r e c a n b e r e p r e s e n t e d as a r e c t a n g u l a r a r e a d i v i d e d i n t o l i n e s a n d c o l u m n s . (see Fig. 1 below)

E , F a n d H N , M a n d

! i I

r

I¢ - ~

L 3 . . . i i .

'

i

i

i

(7)

E v e r y c o l u m n is d e d i c a t e d t o o n e Ic a n d s u b d i v i d e d i n t o t w o s u b c o l u m n s , if t h e Ic is of t y p e N, M, or V ( i m p l i c i t ) ; if t h e Ic is of t y p e E, F, o r H ( e x p l i c i t ) , t h e c o l u m n is d i - v i d e d i n t o t h r e e s u b c o l u m n s .

T h e l i n e s LI, L2, L3 etc. d i v i d e t h e a r e a i n t o l e v e l s . T h e l e v e l s a r e d e t e r m i n e d b y t h e s u c c e s s i o n of w o r d s in i n p u t . T h u s e a c h l e v e l b e a r s t h e n u m b e r of t h e w o r d it r e p r e s e n t s . E v e r y i n p u t w o r d c a u s e s f o r e a c h Ic in its Ic s t r i n g t h e i n s e r t i o n of a m a r k e r i n t o t h e M u l t i s t o r e c o l u m n c o r r e s p o n d - i n g t o t h a t Ic; a n d t h e l e v e l of t h a t m a r k e r in t h e Ic c o l - u m n c o r r e s p o n d s t o t h e i n p u t n u m b e r a n d t h e p o s i t i o n of t h a t w o r d in t h e s e n t e n c e . T h u s a l l t h e m a r k e r s i n s e r t e d f o r o n e w o r d r e p r e s e n t t h e c o r r e l a t i o n a l p o s s i b i l i t i e s of t h a t w o r d .

(8)

8 -

i n t o t h e M u l t i s t o r e o n t h e s e c o n d l e v e l ; t h i s m e a n s t h a t it c a n e n t e r i n t o c o m b i n a t i o n s o n l y w i t h t h o s e w o r d s t h a t b e l o n g t o t h e i m m e d i a t e l y p r e c e d i n g l e v e l , or w i t h p r o d u c t s w h i c h c o n t a i n t h e w o r d s of t h e i m m e d i a t e l y p r e c e d i n g l e v e l . S u c h a c o r r e l a t i o n , w h e n e v e r it is m a d e , w o u l d s t i l l b e l o n g to t h e l e v e l of p r o d u c t x. In o u r s p e c i f i c c a s e p r o d u c t x c o u l d c o r r e l a t e o n l y w i t h a n i t e m o f l e v e l z e r o , w h i c h d o e s n o t e x i s t , b e c a u s e p r o d u c t x is o n l e v e l t w o a n d a l r e a d y c o n t a i n s w o r d No. i. H e n c e w e c a n f o r m u l a t e a r e s t r i c t i v e r u l e to t h e e f f e c t t h a t a p r o d u c t c a n b e a p o t e n t i a l s e c o n d c o r r e l a t u m in a n N c o r r e l a t i o n o n l y if its l o w e r l e v e l is l a r g e r t h a n 1. T h e M u l t i s t o r e s y s t e m l e n d s i t s e l f to t h e i n t r o d u c t i o n of m a n y s u c h r e s t r i c t i o n r u l e s .

W h e n o n a g i v e n l e v e l a l l p r o d u c t s t h a t h a v e s p r u n g f r o m t h e i n s e r t i o n of m a r k e r s c o r r e s p o n d i n g to t h e w o r d of t h a t l e v e l h a v e b e e n r e c l a s s i f i e d , a n d t h e p r o d u c t s o r i g i n a t i n g f r o m t h a t r e c l a s s i f i c a t i o n h a v e , in t u r n , b e e n r e c l a s s i f i e d a n d h a v e i n s e r t e d t h e i r m a r k e r s , a n d t h e r e a r e n o m o r e p r o d - u c t s t o be r e c l a s s i f i e d , t h e n t h e p r o c e d u r e i n s e r t s t h e n e x t w o r d a n d t h u s b e g i n s t h e n e x t l e v e l . T h i s m e a n s t h a t o n c e a s u b s e q u e n t w o r d of t h e s e n t e n c e h a s b e e n i n s e r t e d , a l l p r e c e d i n g w o r d s a n d p r o d u c t s b e c o m e ' i n a c t i v e ' p i e c e s , h a v - ing e x h a u s t e d e v e r y p o s s i b l e a t t e m p t of c o r r e l a t i o n w i t h

(9)

t e n c e d e t e r m i n e s t h e e n d o f t h e a n a l y s i s . A t t h i s p o i n t t h e p r o d u c t (or p r o d u c t s ) t h a t c o n t a i n s a l l w o r d s o f t h e s e n t e n c e is c a l l e d ' c o m p l e t e ' a n d r e p r e s e n t s t h e h i e r a r c h - i c a l s t r u c t u r e o f t h e s e n t e n c e .

T h e f i r s t t e n t a t i v e p r o g r a m M P 1 (Bibl. i) w a s w r i t t e n f o r u s e o n a G E 4 2 5 c o m p u t e r a n d i t s m a i n p u r p o s e w a s t o s h o w t h e a p p l i c a b i l i t y o f t h e M u l t i s t o r e s y s t e m t o c o r r e - l a t i o n a l g r a m m a r a n d t o c h e c k t h e m e t h o d of p r o g r a m m i n g b a s e d o n ' s i g n i f i c a n t a d d r e s s e s ' .

T h e p r e s e n t p r o g r a m , M P 2 (Bibl. 3) , is a r e v i s e d a n d e n l a r g e d v e r s i o n o f M P I w r i t t e n f o r u s e o n a n I B M 3 6 0 / 6 7 c o m p u t e r . O n t h e b a s i s of o u r p r e v i o u s e x p e r i e n c e it c a n b e c o n s i d e r e d a n a c t u a l w o r k i n g t o o l .

M a n y s o l u t i o n s , a s w e l l as m a n y r e s t r a i n t s , d e p e n d o n t h e f a c t t h a t u n d e r m a n y r e s p e c t s it is a m a c h i n e - o r i e n t - ed p r o g r a m . T h e p r o g r a m is s t r u c t u r e d o n a l a r g e a r e a o f t h e c e n t r a l c o r e , d i v i d e d i n t o l i n e s a n d c o l u m n s , w h o s e s i z e is 528 x 3 3 0 b y t e s . E a c h l i n e (330 a l l t o g e t h e r ) c o n s i s t s o f 5 2 8 b y t e s a n d is d i v i d e d i n t o t w o s e c t i o n s : A a n d B. S e c t i o n A c o n t a i n s a l l t h e d a t a n e c e s s a r y t o d e f i n e a line; s e c t i o n B c o n s i s t s of 4 9 6 b y t e s , t h a t is, o f a s m a n y b y t e s a s t h e r e a r e c o r r e l a t o r s o p e r a t i v e i n t h e

i

(10)

- l 0 -

is specified in the columns of section B.

3

I

33(~

A

( 32 bytes )

Illll

L Istcor-2ndl

L

* r e c l a s s i f i c a t i o n rule

H~4*I

B

( 496 bytes )

-=--

IIItllllllll

I !

I

L _ L - - -

Fig. 2

Each byte of section B is divided into 8 bits as illus-

trated below.

1

2

3

4

5

6

marker of CF3 (explicit correlator)

marker of CF2 (right-hand piece)

marker of CFI (left-hand piece)

marker for special linguistic rules

r e c l a s s i f i c a t i o n rule marker

Ic assignation marker

Fig. 3

Bits 1 to 4 are therefore used in the matching procedure,

whereas bit,5 and 6 are pre-established data to be used

[image:10.612.139.449.135.591.2] [image:10.612.149.446.160.367.2]
(11)

P r o c e d u r e

E a c h S - v a l u e o f a w o r d o c c u p i e s o n e l i n e o f t h e M u l t i - s t o r e a r e a a n d i t s s p e c i f i c a t i o n s a r e r e c o r d e d in s e c t i o n A o f t h e s a m e l i n e . F o r e a c h Ic c o n t a i n e d i n t h e s t r i n g o f t h a t S - v a l u e o f t h e w o r d , a m a r k e r is i n s e r t e d , a c c o r d i n g t o t h e c o r r e l a t i o n a l f u n c t i o n , i n b i t 1 , 2 o r 3 o f t h e c o r - r e s p o n d i n g b y t e o f t h e l i n e , t h a t is, i n t h e b y t e w h i c h b e a r s t h a t Ic as l a b e l .

A c c o r d i n g t o i t s f u n c t i o n , a m a r k e r c a n b e a l e f t - h a n d p i e c e 'LH', a n d a s s u c h i t ~ i s s i m p l y r e c o r d e d , o r a r i g h t - h a n d p i e c e 'RH', i n w h i c h c a s e , i m m e d i a t e l y a f t e r it h a s b e e n r e c o r d e d , t h e c o l u m n is s e a r c h e d f o r a c o m p l e m e n t a r y a n d c o n t i g u o u s L H p i e c e . If t h i s is f o u n d , a n i n d i c a t i o n o f p r o d u c t is r e c o r d e d i n t h e f i r s t f r e e l i n e of t h e M u l t i -

s t o r e ; t h i s a d d r e s s c o n s i s t s of t h r e e d a t a : a) t h e a d d r e s s o f t h e l i n e w h e r e t h e L H p i e c e w a s f o u n d , w h i c h is r e c o r d e d i n t h e a r e a ' f i r s t c o r r e l a t u m ' ; o f t h e l i n e of t h e p r o d u c t ; b) t h e a d d r e s s o f t h e l i n e w h e r e t h e R H p i e c e w a s r e c o r d e d , w h i c h is r e c o r d e d i n t h e a r e a ' s e c o n d c o r r e l a t u m ' a n d c) t h e r e l a t i v e a d d r e s s of t h e c o l u m n w h i c h c h a r a c t e r i z e s b o t h b o t h L H a n d R H p i e c e s , w h i c h is r e c o r d e d i n t h e a r e a 'cor- r e l a t o r ' .

(12)

1 2

I W / P

!

L 1

~ s a

, i I

i s

~st. co~,. I

''

- I

I

~ . n d . R u l e s 1 2

]

2

I

4

l

,

I ,

-

/

,

7 ' 5 -

• I

,

_ _ / "

i

~ . P x I

i

i L 2 , ,, i

/

' f ' T

/ ' - ' / ' ~ 1 "

I

. 3 : = /

i

4' '

i . . . .

6

IL056 . . .

I

}

I

I

0 0 0 0

i

' i

i

i

L o •

- - N s e c t o r - -3

Z

x w I w

m w

! i

I

I ~ X

X X

1 1 1 1 I

I

I

I

I

F~

- - m

-I m ~

i

[

, , ,

T h e p o s i t i o n 6 i n t h e M u l t i s t o r e a r e a c o r r e s p o n d s t o c o T r e l a t o r N o y i n t h e s a m e w a y a s p o s i t i o n 7 c o r r e s p o n d s t o c o r r e l a t o r N o z, a n ~ s o o n ,

(13)

of t h e w o r d c a u s e s a n e w p r o d u c t t o b e m a d e , t h e p r o c e d u r e is r e p e a t e d a n d t h e p r o d u c t is r e c o r d e d o n t h e n e x t f r e e l i n e of t h e M u l t i s t o r e a r e a . O n l y w h e n a l l t h e I c ' s of t h e p i e c e w h i c h h a s c a u s e d t h e p r o d u c t i o n h a v e b e e n i n s e r t e d , t h e r e c l a s s i f i c a t i o n r o u t i n e t a k e s p l a c e , s t a r t i n g f r o m t h e f i r s t p r o d u c t n e w l y r e c o r d e d .

(14)

8

Ll\

IN

tD u~ W 2 o

,I

~ P x , r - i

L 2

ist. iCorr.

I

t

"

o . g a ~ I s ~

\

1 4

S a

Rules

--c-

$"

J

A

s~

h ~ ...

- Z - z - -

/

I

~ _

0000 6 ~ 1056 ~-- x

1 1 1 1

I

i

11

C o n d i t i o n e d rule. i U n c o n d i t i o n e d rule. # C h e c k on 1st correlatum.

Ic 4 CFI. @ Assign--the string CFI c o n t a i n e d in the rule to the product. _% A s s i g n the string CF2 c o n t a i n e d in the rule to the product.

(15)

T h e a n a l y s i s of t h e s e n t e n c e is c o m p l e t e w h e n t h e l a s t m a r k e r o f t h e l a s t w o r d - s e n s e h a s b e e n i n s e r t e d a n d t h e r e a r e n o f u r t h e r p r o d u c t s t o b e r e c l a s s i f i e d o r r e - c y c l e d . A t t h i s p o i n t t h e o u t p u t r o u t i n e s t a r t s . T h r e e d i f f e r e n t k i n d s of o u t p u t a r e p r o d u c e d :

a) a l i s t o f a l l t h e p r o d u c t s m a d e in t h e c o u r s e of t h e a n a l y s i s of t h e s e n t e n c e ;

b) a l i s t of a l l I c ' s a s s i g n e d t o e a c h p r o d u c t d u r i n g t h e r e c l a s s i f i c a t i o n r o u t i n e ;

c) a g r a p h i c r e p r e s e n t a t i o n o f t h e h i e r a r c h i c a l s t r u c t u r e o f a l l ' c o m p l e t e ' p r o d u c t s (that is, c o n t a i n i n g all w o r d s of t h e s e n t e n c e ) . T h i s s t r u c t u r e is e q u i v a l e n t to a t r e e s t r u c t u r e w i t h w o r d s a t t h e t e r m i n a l s a n d c o r r e - l a t o r s a t t h e n o d e s . (see A p p e n d i x )

T h i s is a g e n e r a l o u t l i n e of t h e p r o c e d u r e of c o m b i n a - t i o n , p r o d u c t i o n , r e c l a s s i f i c a t i o n a n d o u t p u t . In a d d i t i o n to t h a t t h e r e a r e s e v e r a l r o u t i n e s w h i c h m e e t s p e c i a l r e - q u i r e m e n t s . A s p e c i a l r u l e , for i n s t a n c e p r e v e n t s s p e c i f i c RH p i e c e s f r o m b e c o m i n g e l i g i b l e LII p i e c e s o n c e a c e r t a i n c o r r e l a t i o n - w h i c h c o n t a i n s t h e m as RH p i e c e s - h a s b e e n m a d e . A w o r d l i k e " L I T T L E " , for i n s t a n c e , in its f u n c t i o n as a q u a n t i f i e r , o n c e it h a s b e e n c o r r e l a t e d w i t h t h e d e ~ f i n i t e a r t i c l e a n d m a d e t h e p r o d u c t " T H E / / L I T T L E " c a n n o t b e c o m e L H p i e c e in t h e c o r r e l a t i o n :

L I T T L E / / H E K N O W S

(16)

16

T h e i n d i c a t i o n ' d i s c a r d ' o n p r i n t - o u t t y p e 'a' - i.e. o n t h e l i s t o f a l l t h e p r o d u c t s m a d e d u r i n g t h e a n a l y s i s - w i l l s h o w t h a t " L I T T L E " is n o m o r e a v a i l a b l e a s L H p i e c e

f o r a n y o t h e r c o r r e l a t i o n . •

A n o t h e r r e s t r a i n t c o n c e r n s s o m e ' c o m p l e t e ' p r o d u c t s w h i c h , t h o u g h g r a m m a t i c a l l y c o r r e c t , c a n n o t b e a c c e p t e d a s i n t e r p r e t a t i o n o f t h e s e n t e n c e . F o r i n s t a n c e , i n a s e n t e n c e l i k e :

T H E Y / / W E R E R E A D Y

t h e s t r u c t u r e w h i c h t a k e s " W E R E " a s s u b j u n c t i v e is n o t a c c e p t a b l e , s i n c e i t w o u l d r e q u i r e s o m e t h i n g e l s e - a n "IF" o r "I W I S H " etc. - t o p r e c e d e . I n c a s e s l i k e t h i s t h e i n d i c a t i o n ' n o n - s e n t e n c e ' a p p e a r s i n p r i n t - o u t t y p e

l a l ,

A s e t of s p e c i a l r o u t i n e s s e r v e s t h e p u r p o s e o f r e c - o g n i z i n g i d i o m a t i c e x p r e s s i o n s . W h e n o n e o f t h e m is r e c - o g n i z e d , i n s e r t e d i n t h e M u l t i s t o r e a n d r e c l a s s i f i e d - l i k e a n y o t h e r p r o d u c t - t h e i n d i c a t i o n 'idiom' is p r i n t e d o n p r i n t - o u t t y p e 'a'

(17)

t e n c e s of u p t o 16 w o r d s - a l i m i t f i x e d i n a c c o r d a n c e w i t h t h e a v e r a g e l e n g t h of s e n t e n c e s i n s c i e n t i f i c t e x t s (Bibl.5) a n d a m p l e e n o u g h t o a l l o w a n y t y p e of s y n t a c t i c s t r u c t u r e . P r o c e s s i n g - t i m e s f o r 1 0 - w o r d s e n t e n c e s a r e a b o u t 1 - 1 . 5 s e c - o n d s . O u r p r e s e n t v o c a b u l a r y is l i m i t e d t o 1 5 0 w o r d s f o r r e a s o n s of p u n c h e d c a r d m a i n t e n a n c e . H o w e v e r , i t c o u l d b e e n l a r g e d w i t h o u t a f f e c t i n g t h e p r o g r a m .

(18)

- 1 8 -

B i b l i o g r a p h y

'Multistore': A P r o c e d u r e for C o r r e l a t i o n a l A n a l y s i s (E.v.Glasersfeld, P.P.Pisani, J.B.Burns), Informal Report T-10, A u t o m a z i o n e e A u t o m a t i s m i , vol. IX, No.2 Milan, Italy, 1965

A u t o m a t i c E n g l i s h S e n t e n c e A n a l y s i s (Glasersfeld, Pisani, Burns, Notarmarco, Dutton), Final R e p o r t T-14 Grant A F EOAR 65-76, IDAMI L a n g u a g e R e s e a r c h Section, Milan, Italy, 1966.

The M u l t i s t o r e System MP-2 (E.v. G l a s e r s f e l d and P.P. Pisani), S c i e n t i f i c Progress Report, Grant A F O S R 1319-67 Georgia Institute for Research, Athens, Georgia, 1968.

The M u l t i s t o r e Parser for ilierarchical S y n t a c t i c S t r u c t u r e s (E.v. G l a s e r s f e l d a n d P.P.Pisani) Grant A F O S R 1319-67, Georgia Institute for Research, Athens, Georgia 1969 (paper submitted to C o m m u n i c a t i o n s of ACM)

C o m p u t a t i o n a l A n a l y s i s of P r e s e n t - D a y A m e r i c a n E n g l i s h (Henry K u c e r a and W . N e l s o n Francis), B r o w n U n i v e r s i t y Press, Providence, R h o d e Island, 1967

(19)

°1

c 2

I 1 I

c , ,

C2. I t~ .I., '4" c."

.!

I I

c' ~

n i

N

u. 'C" i i

~g

~J ' C

m ~

o

- ~ i ~ -

C

A P P E N D I X , I

Complete Parsin~

C i

I

, " I er~

?-

o

to' .

/

i ~ ' ~ u- t

h.~" Ox

I 0 C

I ¢Xl ~ '

i

I

t

t'

1

~ ° u

;,J

f~

I

I

,

' I

I t >

g

-L

I

t

c~

• c.~t

C'; C ' I

(

lu,.

ic'

i:

I

I r .

%.

c [~.,

C, ', c.,,

!o

i

,

l

i •

C C?I

[N

u ¢~J

~-, .3

C, ?

:;'1

'.J [

0 C '

L~

I

it,

i['.

! v ,

1

i

ITS', . t C.~ " ,

7" C

.... LI

' c :

II'~

o I

n ,

! :

' 1

' I

! o

L,.b , ; g

L

~1

L I t'-

t,J ¢.'1

c . ,

ic.; i f

!

I ' r r-

' C .

o u ,

I

(20)

i

i IC' .~r,- , c

Ic"

ic,

u"

i

t ic.

p,

J~

i~:

. jg

If°

I

' C ~ 0

i

; . . o :

o c

u , " u ~

C ' C,~

,~.

(

I I

r L .

!.

P (%1 LU u ~ i=~ C'

~J

2,,

~:, : ~::.

"iI

I

r~

'

I"

t r. I

(..; • I ~ 2X' I

' I C"

~..)

• .11!~

'. '..' ~

L c ~ . l . iu

L. i, U~

t.L

~ j

.J L~

ig

,ig

~ tN L~

~

~

'

6)- , I . ' 3 .

)

)

C

u'

u~

(21)

( P r i n t - o ~ t t y p e c )

( 3 0 p C ? = + . . . - - - & ~ ! t~ ~-I- 4

J o

%

p C ~ 3

P c 2 ~ , . "

," C 1 2

C I I ,

o

. ÷ . . . n ~ O F . 4

, t

. . .' ¶ . . . .

• + . . . . ~ . . = - . 7 ¢ ) | f , ~ - 4

. . . *i . . . * =

. . . . . . . . .. . . . . e . .

- - - " . . . r i F t ~ . . . . .

• | ~ F F ... I I t 3 w lll.q (}I 'j P I l l I ^ I~CC "!,;~ ~N "~P.l~ ?

I

(22)

! u .

i ~ ,

Is

I

: t,,,. : c

!o

! e , .

!

i v ; r ' t

i .

t ~

I

~ - - ' ~ " " f" ' - "

f

I I I I

,,

, ! ! I 0 ! I t I ! t

,,

i !

:

:

I i. I

! ! ! I ! 5 ) ! .

(

~I/£8L'I:

I V

,I,~ e e,*

F, ,,

~o,

I ,''~ ' C

N ~" L~. r u ~ , u ~

I o ' C.

, r . I

! t

!:

!

I i

',

,

I I I o , ! ! .

:

I

:r~'

I ' N

~

iQ. I~

I i I i I I i i

! f

i • I I

,I

"d r * dn~' e o l

'drl i " .

t~J. e

L J ¢4

"r/

u . ~L f f

• I

Figure

Fig. 3 Bits 1 to 4 are therefore used in the matching procedure,

References

Related documents

• Like typedef in C – a name might be more useful for communicating intent than just the type structure.. Quiz 2: What does this

Despite the outstanding progress thus far, the current state of the art falls short when it comes to dealing with such a highly demanding scenario where a large and highly

(b) in the directors’ opinion, the attached financial statements and notes thereto are in accordance with the Corporations Act 2001, including compliance with accounting standards

Bangladesh (Web & Software Programming, Offshore Outsourcing Service), Bulgaria (Programming and R&D),. Ukraine (Programming and R&D), Belarus (Programming, R&D),

Areal (3D) topographic measurement has afforded a better understanding on the correlation between topography and fatigue behaviour, particularly through analytical studies on

gas (or air), solvent evaporates from the incipient fiber. The temperature of the heated gases in the column is above the boiling point of the solvent. The solidified fiber is

Reluctant to words letters invented silent letters to read and circle the clues to spot them to be taught to listen without flash card set of the way.. Decode words can use

9 Spacecraft delivered 3-σ confidence ellipses in B-plane parameters as mapped from the state uncertainty at different times along the robust trajectory without successive maneuvers