• No results found

Test Suite Generation

N/A
N/A
Protected

Academic year: 2021

Share "Test Suite Generation"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

Test Suite Generation

Sylvain Schmitz

LORIA, INRIA Nancy - Grand Est, Nancy, France

(2)

Issues with Surface Generation

*Jean que cherches-tu est grand. Jean qui baille s’endort.

Le chat noir est grand. Il le faut.

Beaucoup de chats noirs se lavent. Jean qu’il arrive agit.

*Le chat est avec beaucoup de poils grand. Jean qui agit est grand.

*Le repas que attendez-vous arrive. *Jean est avec chat grand.

(3)

Overgeneration

Definition

Equivalently, if a grammar

I assigns incorrect structure to grammatical

sentences

I accepts agrammatical sentences I generates agrammatical sentences

(4)

Error Mining

van Noord [2004], Sagot and ´Eric de la Clergerie [2006] Applied to undergeneration:

1. parse a large corpus of correct sentences 2. failures indicate coverage issues

3. statistical analysis identifies a probable culprit for each failure

(5)

Error Mining

van Noord [2004], Sagot and ´Eric de la Clergerie [2006] Applied to undergeneration:

1. parse a large corpus of correct sentences 2. failures indicate coverage issues

3. statistical analysis identifies a probable culprit for each failure

(6)

For Overgeneration?

Which test suite for pass/failure?

I a TreeBank

I a corpus of incorrect sentences

I sentences generated from the grammar I which input?

(7)

For Overgeneration?

Which test suite for pass/failure? I a TreeBank

I a corpus of incorrect sentences

I sentences generated from the grammar I which input?

(8)

For Overgeneration?

Which test suite for pass/failure? I a TreeBank

I a corpus of incorrect sentences

I sentences generated from the grammar I which input?

(9)

For Overgeneration?

Which test suite for pass/failure? I a TreeBank

I a corpus of incorrect sentences

I sentences generated from the grammar I which input?

(10)

“Exhaustive” Generation

I not in terms of elementary trees (about 6,000) I in terms of linguistic phenomena

I grammar compiled from a meta grammar I compilation traces

(11)

Meta Grammar

XMG, Crabb´e [2005] and many others

S VP V⋄ N∗ C S que SNA N N∗ C S que SNA N

CanonicalSubject activeVerbMorphology CanonicalObject

S S = S N↓ VP ∧ ∧ N↓ V N↓ VP V⋄ N↓ VP S VP V⋄ ∧ ∧ = RelativeObject activeVerbMorphology InvertedNominalSubject S VP Cl↓ Cl↓ VP V⋄

(12)

Guided Generation

Input: bag of classes {InvertedNominalSubject, RelativeObject} Output: set of trees vous Cl C que que D Le attendez V VP VP V arrive N S N repas N . S S

(13)

Algorithm

repas .

Le que attendez vous arrive S

(14)

Algorithm

repas .

Le que attendez vous arrive

N↓

S

VP V⋄

(15)

Algorithm

repas .

Le que attendez vous arrive

N⋄ S VP V⋄ n0V noun

(16)

Algorithm

repas .

Le que attendez vous arrive N⋄ N D⋄ S VP V⋄ n0V noun stddeterminer

(17)

Algorithm

repas .

Le que attendez vous arrive

N⋄ S N N S S D⋄ C que V⋄ VP Cl↓ VP V⋄ n0V noun stddeterminer n0Vn1

(18)

Algorithm

repas .

Le que attendez vous arrive

N⋄ S N N S S D⋄ C que V⋄ VP Cl⋄ VP V⋄ n0V noun stddeterminer n0Vn1 CliticT

(19)

Algorithm

repas .

Le que attendez vous arrive

N⋄ S N N S S D⋄ C que V⋄ VP Cl⋄ VP V⋄ n0V noun stddeterminer n0Vn1 CliticT

(20)

Digression: 2-level Syntax

Shieber [2006] and many others

tree-adjoining grammar

regular tree grammar macro tree transducer

derivation trees derived trees

conversions

(21)

Digression: Regular Tree Grammar

Ss −→n0V(Ns) Ns −→noun(Na) Na −→stddeterminer(Na) Na −→n0Vn1(Na, Cls) Na −→ε() Cls −→CliticT() n0V noun stddeterminer n0Vn1 CliticT ε

I need to account for feature structures

(22)

Algorithm, again

I derivation-tree centric

I distances computed using the accessibility

relation in the regular tree grammar

I elementary tree selection uses distances I to the remaining target classes

I to the globally accumulated classes

(23)

Experiments with the Algorithm

I issue: non termination of the grammar

I experiments on small controlled subsets

I larger generation using GenI

(24)

Experiments with Error Mining

Issues: Ordering Suspects

Worst Form Number: 1 0.220409692766 complexAdvDeDeterminer s0Pv1post

d7 0.220780944532

Worst Form Number: 2 0.185227187873 AdjectivalPredicativeform s0Pv1post d7 0.185539179236

(25)

Experiments with Error Mining

Issues: Bigrams

Worst Form Number: 5 0.0836861365418 CanonicalSubject InvertedNominalSubject d10 0.0838670966961 d1 0.0838670966961 InvertedNominalSubject RelativeObject d10 0.0838670966961 d1 0.0838670966961

(26)

Random Concluding Remarks

I application of two-level syntax

I opens new issues with error mining

(27)

B. Crabb´e. Grammatical development with XMG. In P. Blache, E. Stabler, J. Busquets, and R. Moot, editors,LACL’05, volume 3492 ofLecture Notes in Computer Science, pages 84–100. Springer, 2005. ISBN

978-3-540-25783-7. doi: 10.1007/11422532 6.

C. Gardent and E. Kow. Spotting Overgeneration Suspects. InENLG’07. 2007. URLhttp://hal.inria.fr/inria-00149372/.

B. Sagot and ´Eric de la Clergerie. Error mining in parsing results. InACL’06, pages 329–336. ACL Press, 2006. doi: 10.3115/1220175.1220217. URL

http://www.aclweb.org/anthology/P06-1042.

S. Schmitz and J. Le Roux. Feature unification in TAG derivation trees. In C. Gardent and A. Sarkar, editors,TAG+9, 2008. URL

http://www.loria.fr/˜schmitsy/pub/tagunif.en.pdf.

S. M. Shieber. Unifying synchronous tree-adjoining grammars and tree transducers via bimorphisms. InEACL’06. ACL Press, 2006. ISBN 1-932432-59-0. URLhttp://www.aclweb.org/anthology/E06-1048. G. van Noord. Error mining for wide-coverage grammar engineering. In

ACL’04, pages 446–453. ACL Press, 2004. doi: 10.3115/1218955.1219012. URLhttp://www.aclweb.org/anthology/P04-1057.

References

Related documents

Urine and appropriately timed venous blood samples were collected during 1, 2 or 3 periods totaling 30 to 40 minutes for the determination of glomerular filtration rate (CIN),

In order to investigate the localisation pattern of the activated kinase immunohistochemically, under conditions of osmotic stress, frog hearts were perfused with hypertonic medium,

The differences in the oxygen affinity of larval and adult blood are related to differences in the chemical properties of their haemoglobins, a feature illustrated by the lack of

The increase in skin temperature at low ambient temperatures is believed to be a result of a decrease in heat flow through the breast feathers brought about by feather adjustments,

The aim of this study is to identify the critical success factors which are responsible for completion of construction projects and analyze them to identify success which can

Elbjaoui: Département de Mathématiques et Informatique, Faculté des Sciences de Rabat, Université Mohamed V, BP 1014, Rabat, Morocco. E-mail address

Through this initiative the various attributes identified in unorganized sectors are education level very low, professional and vocational skills are very low, lack of

Having found the solid angle Ω e at the edge point of the regular dodecahedron, let us now find the solid angle that Ins A subtends at the vertex a.. Note that this expression does