• No results found

Syntax and prosody in a Danish Text to Speech System

N/A
N/A
Protected

Academic year: 2020

Share "Syntax and prosody in a Danish Text to Speech System"

Copied!
18
0
0

Loading.... (view fulltext now)

Full text

(1)

Syntax and prosody in a Danish Text-to-Speech

System

Peter Molbcek Hansen & Ebbe Spang-Hanssen

University o f Copenhagen

0, Introduction

T h is p a p e r is stru c tu re d as fo llo w s:

In se ctio n 1. w e p re s e n t th e b a s ic stru c tu re o f a D a n ish te x t-to -s p e e c h sy ste m (h e n c e fo rth T T S -sy s- tem ), an d illu strate so m e o f th e sp ecial n e e d s o f T T S -s y s te m s a s c o m p a re d w ith o th e r n a tu ra l la n g u a g e p ro c e ssin g sy ste m s (N L P -sy ste m s). In th is c o n n e c tio n w e e x e m p lify s o m e c a se s o f sy n ta c tic a lly d e te rm in e d stre ss lo ss in D an ish .

In se ctio n 2. w e p re se n t and d isc u ss s o m e p re lim in a ry re su lts o f an e m p iric a l in v e s tig a tio n o f th e rela tio n b e tw e en sy n ta c tic stru c tu re an d p a u se d istrib u tio n in re a d -a lo u d D a ru sh p ro se .

In se ctio n 3. w e d e sc rib e a sp e c ia lise d g ra m m a r fo rm a lism fo r e x p re s s in g s y n ta c tic a s w ell as m o rp h o lo g ic a l and p h o n o lo g ic a l s tru c tu re , an d w e o u tlin e th e p a rs in g s tra te g y u s e d to tra n sfo rm in p u t se n te n c e s to p h o n e tic a lly c o n siste n t an d p ro so d ic a lly a d e q u a te tra n s c rip tio n s w ith sp e cia l e m p h a sis o n h o w to a c h ie v e ro b u stn ess.

1, Syntactic Aspects of the TTS-System

A P P E N D IX 1 is a sc h e m a tic o v e rv ie w o f th e d a ta flo w th ro u g h th e se ria lly c o n n e c te d c o m p o n e n ts w h ic h c o n v e rts strin g s o f A S C II-sy m b o ls to so u n d w a v e s in a ste p w ise fash io n . T h e c o m p o n e n t lab e led S P E Q A L IZ E D P A R S E R (S S P S F O R M A L IS M IN T E R P R E T E R ) ta k e s c a re o f th e le x i- c o -m o rp h o lo g ic a l an d sy n ta c tic a n a ly sis n e e d e d in o r d e r fo r th e s y ste m to e x tra c t su ffic ie n t lin g u istic in fo rm a tio n fro m th e in p u t t e x t T h is c o m p o n e n t, h e n c e fo rth re fe rre d to a s S S P S (S u rfa c e S tru c tu re P a rsin g S y stem ), is th e o n e w h ic h w e h o p e to b e o f m o s t in te re s t to c o m p u ta tio n a l lin g u ists, a n d w e w ill n o t c o m m e n t o n th e o th e r c o m p o n e n ts a t a ll, a lth o u g h m o s t o f th e m a re o f c o u rse m o s t im p o rta n t fo r th e te x t-to -sp e e c h c o n v e rs io n p ro ce ss.

(2)
(3)
(4)
(5)
(6)
(7)
(8)

P e te r M o lb æ k H a n se n

In stitu t fo r A lm e n o g A n v e n d t S p ro g v id e n sk a b K ø b e n h a v n s U n iv e rsite t

N ja ls g a d e 8 0

D K -2 3 0 0 K ø b e n h a v n S E -m a il: p m @ q > h lin g .d k

E b b e S p a n g -H a n sse n

In stitu t fo r A lm e n o g A n v e n d t S p ro g v id e n sk a b K ø b e n h a v n s U n iv e rsite t

N ja ls g a d e 8 0

(9)

ASCII text: De spiste altid middag kl. 19.

TEXT NORMAUZER (A TRIVIAL PREPROCESSING PROGRAM)

Normalized: de spiste altid middag klokken nitten

SPECIAUZED PARSER (SSPS FORMAUSM INTERPRETER)

Morphophonemic: di,spis*t0,a11+^tidd midd-a klokk-0n,nltt-0n

PHONOLOGICAL TRANSFORMER (SPL FORMAUSM INTERPRETER)

Phonetic: disbi:sdO’alUiD?m’edak]Cg‘'nn'ed‘^n

PHONETIC TRANSFORMER (SPL FORMAUSM INTERPRETER)

parameterized:

PI PI PI PI PI PI PI PI PI -.. PI PI

P2 P2 P2 P2 P2 P2 P2 P2 P2 .... P2 P2

A P P E N D IX 1: T H E D A N IS H T E X T -T O -S P E E C H S Y S T E M :

Pn Pn Pn Pn Pn Pn Pn Pn Pn .... Pn Pn

SYNTHESIZER (SOFTWARE SIMULATION)

Wave image: Frl Fr2...JFr 1649 Frl650

D/A-CONVERTER

Electric oscillations:

LOUDSPEAKER

(10)

APPENDIX 2; TASKS IN TEXT-TO SPEECH S \’STEMS (ITS)

AS OPPOSED TO OTHER SYSTEMS (O)

1 . TTS: surfaa structure 1 -> surface structure 2

vs

O: surface structure -> some other structure

Example:

word inventory and order must be identical in input and output

in ITS. (TTS-systems involve both

analysis (identification) and generation.

2. ITS: Semantic structure is often irrelevant

vs

O: Semantic structure is usually essential

Example: semantic structure essential in translation systems

3. ITS: Prosodic features often essential

vs

O: Prosodic features often redundant

Example 1:

"Jens drak mælk" is distinguished from "Jens drak mælken"

both morphologically and prosodically, but for

0-parsers the prosodic difference is irrelevant.

Example 2:

In Danish many but not all stød susceptible

monosyllabic first parts of compounds

exhibit stød-loss. Ilius 'balgulv"

(11)

(æmma = > next word unstressed)

(single quote = > next word stressed)

Noun phrase internal constructions of various

semantic subtypes:

a) Determiner phrase ending in indefinite noun:

^t ’stort Jbundt Yriske Vadiser

to /ed lividiøg

b) Constructions with indefinite noun + proper name

j)rofessor Spang-Tlanssen

^lagter X)lsen

^nte ’Agate

/ætter ’Jens

c) Constructions with indefinite noun + numeral

gummer ’et

.indgang 7

Jrappe ’6

d) Constructions with indefinite noun + '^lace name"

/estaurant Den X^yldne Tortun

/Cap Det

Håb

e) Hypotactic proper name phrases:

Jens Teter

,Oie Svendsen

Jens / ’eter ’Jensen

(12)

( c o m m a = > n e x t H o r d u n s t r e s s e d ) ( s i n g l e q u o t e = > n e x t w o r d s t r e s s e d )

a ) V e r b w i t h i n d e f i n i t e d i r e c t o b j e c t , i r r e s p e c t i v e o f i n t e r v e n i n g m a t e r i a l :

J e n s i d r i k k e r a l d r i g Y a d ø l J e n s i l e d e r e f t e r ^ n g e

A P P E N D IX 4: S T R E S S L O S S IN \ E R B P H R A S E S

v s

V e r b m t h d e f i n i t e d i r e c t o b j e c t , i r r e s p e c t i v e o f i n t e r v e n i n g m a t e r i a l :

J e n s * d r ik k e r a l d r i g ,s in ^ æ l k J e n s l e d e r e f t e r ’Jp e n g en e

b ) V e r b + a d v e r b i a l c o m p l e m e n t , p o s s i b l y w t h i n t e r v e n i n g d i r e c t o b j e c t :

J e n s iS a tte s i n c y k e l I n d i s k u r e t

v s

V e r b p h r a s e s w t h p e r i p h e r a l lo c a t iv e a d v e r b i a l : J e n s ^ a t t e s i n c y k e l I n d e i s k u r e t ’

c ) A c o m b i n a t i o n o f a ) a n d b ) : J e n s ,s t0v e d e ’b o r d e t ’a f

v s

(13)

A . ( T h is l i t t l e i n d d e n t ) / / g iv e s ( a n e w z e s t ) ( t o o u r i n v e s t i g a t i o n )

B . ( C h i c k e n s ) w e r e e a t i n g / / ( t h e r e m a i n i n g g r e e n v e g e t a b l e s ) A P P E N D K 5: V ERB B A L A N C IN G R U L E

I gave th e bo ok to John fo r C h ristm as

APPENDIX 6: PAUSES IN WRITTEN DANISH READ ALOUD ( 16 PAGES )

YES

NO

OBLIGATORY SITES

1. A ft e r coordin ated S

65

5

2. A ft e r embedded S

40

2

POTENTIAL SITES

3. Around a p p o s itio n s , quotes

37

8

4. A ft e r coordin ated non-S

25

17

5. A ft e r complex i n i t i a l phrase

42

39

6. A ft e r complex in v e rte d s u b je c t 10

6

7. B efore c la u s e

89

75

8. B efore PP

41

252

TOTAL NUMBER OF PAUSES ‘

343

(14)

1 . S o m b a r n e l s k e d e h u n k i r s e b æ r , / / o g h u n t o g im o d p o s e n m e d e n s e lv f ø lg e , d e r f ø r s t b a g e f t e r h a r f o r u n d r e t

m i g / / o g g i k r u n d t b l a n d t d e ø \ r i g e p a s s a g e r e r / / o g b ø d a f s i n r i g d o m . ( D ,2 A )

2 . M e n j e g o p l e v e d e t r e t i n g d e n s i d s t e d a g , h u n d e r s t a d i g f } i d e r m i g m e d u n d r e n . ( D , 1 3 )

3 . V e d H i t l e r s m a g t o v e r t a g e l s e d e n 3 0 . J a n u a r 1 9 3 3 / / m a r c h e r e d e b r u n s l g o r t e m e / / S A - k o r p s e n e / / , i e t f l r e t i m e r s f a k k e l to g g e n n e m p o r t e n . (R ,3 ,5 )

4 . 1 m a n g e l a n d b r u g s e g n e / e r h e l e l a n d s b y e r p å f l u g t / / p å

g r u n d a f m a n g e l p å m a d o g v a n d / / o g a f f r y g t f o r e p i d e m i e r . ( R ,6 ,2 7 )

5 . M e n v e d r e t t e n i M i i n c h e n / / k ø r e r e n p r o c e s m o d e n a f W o lf f s t i d l i g e r e u n d e r o r d n e d e / / g e n e r a l m ^ 'o r S c h u t t . ( R , 5 4 )

6 . S å s k u l l e t o h u n d r e d e å r s d a g e n f o r å b n i n g e n a f B r a n d e n b u r g e r T o r i

d i g 11 v æ r e a n d e r l e d e s p r o b I e m I ø s .( R 4 > 1 3 )

7 a . E n n a b o , d e r ø n s k e d e a t h j æ l p e h e n d e / / b e k l a g e d e s i g o v e r / / a t n å r h a n k o m f o r b i p å v e je n / o g g e r n e v ille h i l s e p å h e n d e / / s å v e n d t e h u n r y g g e n t i l . ( D3 4 1)*

T b . D a h a n e r f a r e d e , a t j e g e n d n u i k k e h a v d e s o v e t / /

i n s i s t e r e d e h a n p å a t g iv e m i g e n i n d s p r ø j t n i n g a t s o v e p å . (D 3 > 2 8 ) D e r f o r s t o d M a r k u s W o l f f n a t u r l i g t n o k h ø j t p å ø n s k e s e d l e n h o s d e v e s tt y s k e m } 'n d ig h e d e r / / v e d g e n f o r e n i n g e n . ( R , 4 4 3 )

(15)

N P : : = D E T ? A P * N O U N N O U N > D E F C ' D E H ' ' D E T > D E F C A P > D E F D E T > A N C A P > A N D E T > A G C A P > A G A P > A N C N O U N > A N A P > A G C N O U N > A G D E T > A N C N O U N > A N D E T > A G C N O U N > A G N P > D E F ; T ) E F D ''

D E T : : = Q U A N T ? A R T ? E N U M ? Q U A N T > D E F C A R T > D E F Q U A N T > A N C A R T > A N Q U A N T > A G C A R T > A G A R T > D E F C E N U M > D E F A R T > A N C E N U M > A N A R T > A G C E N U M > A G

Q U A N T : ; = W R D W R D > W c " W K V A "

A R T ; : = W R D ? N P

W R D > W C " W P R O N " N P > W c "VVGEN" A R T > D E F : 'D E F D '' A R T > A : "A"

A R T : : = W R D

W R D > W C "V V D E F R "

E N U M : : = W R D W R D > W c "VVNUM "

(16)

F O R E A C H I N P U T S E N T E N C E B E G I N

I N T U T A S E N T E N C E .

S V M T C H T O M O R P H O L O G I C A L M O D E = T O P - D O W N , D E P T H - H R S T , F m S T - R U L E - n R S T , E A C H C H A R A C T E R S U R R O U N D E D B Y

T W O C O N S E C U T I V E V E R T I C E S .

F O R E A C H W O R D B E G I N

P A R S E T H E W O R D A C C O R D I N G T O M O R P H O L O G I C A L P A R T O F G R A M M A R , A C C E P T I N G A T L E A S T 1 A N D A T M O S T 3 I N T E R P R E T A T I O N S .

E N D

A R R R A N G E T H E R E S U L T S A S T E R M I N A L E D G E S I N A C H A R T F O R T H E S Y N T A C T I C M O D E .

S W I T C H T O S Y N T A C T I C M O D E = B O T T O M - U P , L E F T - C O R N E R , E A C H W O R D - R E S U L T S U R R O U N D E D B Y

T W O C O N S E C U T I V E V E R T I C E S .

P A R S E T H E S E N T E N C E A C C O R D I N G T O S Y N T A C T I C P A R T O F G R A M M A R , A C C E P T I N G E X A C T L Y 1 I N T E R P R E T A T I O N .

O U T P U T I N T E R P R E T A T I O N I N

U N E A R I Z E D M O R P H O P H O N E M I C F O R M . E N T )

(17)

A P P E N D I X 1 0 : E X A M P L E O F A N A L Y S IS

I n p u t : E n s u r t j e n e r t a b e r a l t i d a n s i g t .

M O R P H O L O G I C A L A N A L Y S IS :

0-1 e n < W N U M A N T JM S A G E N C D E H > 0 - 1 e n , < W D E T A N U M S A G E N C D E F l >

1 - 2 s u r r , < W V I M P S T R 0 > 1-2 s u r r < \ W I M P S T R l >

1- 2 s u r ! < W A D J A N U M S A G E N C D E F l >

2 - 3 t J E n = 0 r , < \V S U B A N U M S A G E N C D E O S T R 0 > 2 -3 t j E n = 0 r < W S U B A M J M S A G E N C D E F l S T R l > 2 -3 ^ E n + O r , O V W I N S T R 0 >

2 - 3 ^ jE n + O r < \ W F I N S T R l >

3 - 4 t a b = 0 r < W S U B A N U M S A G E N C D E n > 3 -4 t a b * 0 r , < \ W F I N S T R 0 >

3 - 4 t a b * 0 r < W V F I N S T R l > d - S a l l + J i ^ t i d d < W A D V >

4 - 5 a l t i d , < W S U B A N U M S A G E N D E H S T R 0 > 4 -5 a l t i d C W S U B A N U M S A G E N D E H S T R l > 4 - 5 a l t i d < W A D J A N U M S A G E N C D E H >

5- 6 a n h ^ I g t , O V V I M P S T R 0 > 5 -6 a n h % s l g t < ^ W I M P S T R l >

5 -6 a n h = s l g t , < W S U B A N U M S A G E N N D E H S T R 0 > 5 -6 a n h = s l g t < W S U B A N U M S A G E N N D E n S T R l >

S Y N T A C T I C A N A L Y S IS :

e n , s u r ^ E n = 0 r t a b * 0 r , a Il+ 4 ^ t i d d a n h = s l g t

(18)

U N S U C C E S F U L P A R S E S A R E A V O I D E D B Y 1 ) F I R S T R U L E H R S T S T R A T E G Y

2 ) D E F A U L T R U L E S :

M O R P H O L O G I C A L D E F A U L T R U L E :

S T E M : : = I e t r +

S T E M > M : 'M N S A C M C C O M N P X "

I £ . A N Y L E X I C A L L Y U N I D E N T I F I A B L E P A R T O F A N I N P U T W O R D W I L L D E F A U L T T O

1 ) A N O U N S T E M ,

2 ) O F T H E C O M M O N G E N D E R , 3 ) W I T H P L U R A L I N - E R ,

4 ) W I T H O U T - E - O R - S - A S 1 S T P A R T O F C O M P O U N D

5 ) W r r a S P E L L I N G = M O R P H O P H O ^ ’E ^ ^ C R E P R E S E N T A T I O N .

E .G . A N I N P U T W O R D L I K E 'S M U L P E R N E " W I L L B E O U T P U T A S :

•S m u Ip + O rn O " < > V S U B A N U M P D E F D >

S Y N T A C T IC D E F A U L T R U L E :

S : : = W R D +

l E . "A S E N T E N C E I S A S E Q U E N C E O F W O R D S "

T H I S P R I N C I P L E A L L O W S A N Y D E G R E E O F M O R P H O L O G I C A L A N D S Y N T A C T IC E X P L I C I T N E S S T O B E C O M B I N E D W I T H A G I V E N P R A C T I C A L I M P L E M E N T A T I O N .

References

Related documents

In short, the Prospero framework allows the public to both contribute functionality and content in the form of widgets, and control the widgets displayed on

Methods: The present study aimed to evaluate incidence and patient characteristics, and to assess current medical service use, usual care, and medical expenses of knee disorders

Consistent with the original report and the group mean analysis, substan- tially more patients receiving etoricoxib or naproxen achieved various thresholds of response for both

Our results suggest that multiple vertebral compression fractures affect physical performance (slower walking speed, lower chair stand time and shorter functional reach)

examination, 5) nerve conduction studies in all subjects, 6) ability to exclude pre-existing or prevalent cases at baseline, 7) blinding of Health Outcomes Assessment and Job

Shetty AA, Kumar VS, Morgan-Hough C, Georgeu GA, James KD, Nicholl JE: Comparing wound complication rates following closure of hip wounds with metallic skin staples or

From May 2010 to October 2015, 296 consecutive pa- tients with intertrochanteric femoral fractures were treated with PFNA-II (PFNA Asian version; Synthes, Oberdorf, Switzerland) via

Effect of bone marrow transplantation on soluble P-selectin (sP-sel) and soluble E-selectin (sE-sel) levels.. The endothelial-specific expression of E-sel indicates that the