• No results found

Multi Answer Focused Multi Document Summarization Using a Question Answering Engine

N/A
N/A
Protected

Academic year: 2020

Share "Multi Answer Focused Multi Document Summarization Using a Question Answering Engine"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

!"#$%#&'(

!"# $%#&'

!

)

* +

,Æ +

- ** *

.)/ 0

.1-/ 2

+

3 (

4

* ) + )

(

( . +!##&5 6

+ !##!/+ * (

(

* * 0+ 3

78!##9 4*

( (

( 0.3

!##9/+ 4

2

**

, *

(*

+

: * *

44

* ( 0 ((

(

(* 1- (

(0+ 6*

4 +

((

* 4

) 0 (

* (+

- 0

0 *

( ( (

*

(((

+ (( ((

0 (

((2+ &/ 2

((

!/

9/;

4

+ < (

0+

./ 8 ( *

1-

( &/+ . 9+/

.*/ 8 ( *

) . )/

((***

( &/ 9/+ .

"+/

./ 8 2

* =2 = )

.==)/ ( !/ 9/+

. >+/

6 ./

(* .

%/+ < &

4'(

* (( +

(

7 0 (

?

+ ( 2

2

(2)

Smoothing of Sentence Importance

with a Hanning Window Function

Sentence Selection by MMI-MS

Arranging Selected Sentences in an order determined by the cluster

structure of documents and the chronological order of documents

A set of

Documents

A set of

Questions

Calculation of

TF*IDF*IGR as

words’ weight

Calculation of QA

score as words’

weight

Mixing two kinds

of Sentence

Importance

(3)

),-.../$/0)-,123/$///4/ ),-.../506)-,7)-$/// $/ ),-..(/$0$)-,78-$///6/ ),

-..(/60&)-,7)-$///5 / ),-..(/&/-)-,7*-$///5/ .(/--/-&5 .(/60&/0. .(/$0$-50 .(/0-5/-(

< !B ! "#$%"&' $%($

9:;< = :< =! :< = :< =

: :<=

28<=! :

<=>:

<= :<=?: <

< 9B )* #$%"&'$%($

1 % 28

?9:;

1 28 ! ?

: :

? :@9";

0& :

28:

282A 28?

: @

9"; 0&

28 : @

9"; 06 %

< "B +! *

' $

$ (

= +.= !##!/ ((

(

0 * )+ 2

*

(

(** *

+ 6 (

* ( (

+ E

* *

) (**

*

, +

(+

&+ 6 *

4

*

(

( < %

*

+

*

+

!+ 6*

+ 3

* (

+ 6

+ 6 (

)(( (

./

+

Cluster of all

documents

(Document DB)

Cluster of

unretrieved

documents

Cluster of all

retrieved (given)

documents

Cluster C

root

Cluster C

1

Cluster C

n

IGR(w,C

root

)

IGR(w,C

1

)

IGR(w,C

n-1

)

IGR(w,C

n

)

IGR_ave(w,D)

+

+

+

+

D

W

/ (n+1)

< %B 6 6 ./ *

)

* *

* 2

++ + 6 , (

.

/

(4)

.

/+ < (

.

/

,

*

.

/

.

/

< ."/ 2 +

(

'

) $% $

* +%

*

( (+ 6

*

* *

*

0 *

(+

( *

( + 6

(

+

(

4 ,+

(

( ( *

(

* + -

*

( +

( (

* (

+

, "# *

$$*

6

* =2= ) .==)/

(( * 8* + + .8*

&$/+ ==)

( *

*

0+ - <.>/ ((*

(

* (B

¾ Ò

¾

*

4

*

(

+ ((

===.=2= (

F=/+ 6 ((

<.>/ (

*

* * (

+ -

(

( * +

(

+

- ( $$*

(( 2

( (

2* 0

(+ ( (

D*

+

*

(

+ < (

+ *

+

,

+

. /0 $# /#

6 ((

8)"89+ 8)8

( 2 4

4 *

'(.<3!##&/+

( 8)"89 G

'!##". + !##"/+ (

( ((

( B &/ (

?2

( 2 !/

?

( *+ 2

* ((

* 4+

9#

(+ (

7'(((

* 4 ( (

.C ? CE ?/

0 . E /

( *

+ ( C ?

CE ? *%H &#H (+

(

, ,2((

(( * 4 *

+ 6 ((

*

(5)

CE ?+ 2

System using no qestions

System using qestions

*

System using no questions

System using questions

./

Answer Coverage (Edit Distance)

Answer Coverage (Exact Match)

Lead

System using no questions

System using questions

*

Answer Coverage (Edit Distance)

Answer Coverage (Exact Match)

Lead

System using no questions

System using questions

(6)

0.1

Answer Coverage (Exact Match)

Alpha

Answer Coverage (Edit Distance)

Alpha

Lead (Short)

Lead (Long)

Human (Short)

Human (Long)

IGR+MMR+QB (Short)

es

Short

Long

(7)

.8* &$/+

+. + !##&/ ( 4

2

4+

0 4

B &/

( 0 !/

( 1-

( 2 *

+ (

2

4 1- + (

8*(1-

78!##9 *

.+!##9/+ 4

+. 4 !##9/

(G C8*(?

0

4+

(* ((

* +

* 0

* 4

0.*

&$5: =!###5*

!##9/+ 3 +.3

= 4 !###/ 2

( 0 +

+. +!##&/(

0*

( 0(+ 6

+.6+!##!/2 *

(0

2 + 3 2

(*$

((

1-

( * (

0+

"#

4 4

* ( 0

(

1-+ 6 ((

4 +

(( * (

* *

((? 8)" 89

< )+

- * (

( *

+ 3

,

+ - (

( 1- +

(

2

* 4+ *

(0 K8

+ < 1-

*+ 6(4

((2 +

(( *(

0 *

(

((

*

(( ( +

@ B + 7 0///

'

!"! ###$

0.$C&/-) # ) 3 -..( 2

77 :@ !

:*%

% & '

(

( &&5C&&4

!'! 7+! 0//- D

%#D

)E%#F !

##%&) )

3! 0//& #

G==!==D

> ,! %! >! !

0//- DD

!

! ##% &)

4-C4(

> 7 +! !

'! > " 0//$ D

% # & H D

"# ! $ H

*+

$/6C$-- )

7 + 8! 'I

8 0//&

, , +

& 1(4:E.FG-45(C

-44( %%D*

7 0//0

H H

%-

! ".! ' #$

4((C4.$

"! @%J %! %

@3 @ %%

B > 8 78

0//& # : 2

#0//& /

##

#!" %%! 0//&

# A=",2 :2#0//& D

/ ##

7+!>I7! 0///

D

-4E$FG56(C5(5

* + ) , 0//&

:2#0//&G D

/ ##

7 % -..(

@%

%

& ' (

( 0C-/

> : ' 0//0

%

References

Related documents

When a category is selected, our method retrieves documents using each word in the selected category and the original query words, then checks the number of answer candidates in

Using a fixed document collection and question set, an initial set of answer locations is used as an evaluation set alongside the TREC answer keys. and

1222 pdf ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? # ? ? ? ? ? % & ( * , , 2 4 2 7 8 9 ; ; > ? @ A C E @ 9 ? H I K H K C N A O P C S 9 N C E 9 N V X Y Y Z [ ] ^ ` C 9 b

The remainder of this paper is organized as follows: Section 2 discusses the process of human summarization activity with some historical background, Section 3 discusses

The inverse sentence frequency is a measure of the general importance of the term (obtained by dividing the total number of sentences by the number of sentences containing the term,

Its approach is based on the context where candidate answers appear, by measuring distances between question and answer terms and by using similarity measures to compare

In Proceedings of the Twelfth Text REtreival Conference (TREC 2003), pages 426-435, Gaithersburg, Maryland. Maya: A fast question-answering system based on a predictive