• No results found

A Method of Measuring Term Representativeness Baseline Method Using Co occurrence Distribution

N/A
N/A
Protected

Academic year: 2020

Share "A Method of Measuring Term Representativeness Baseline Method Using Co occurrence Distribution"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

!" #$%&'"

()*+),

-./0,..123!" '&'"

! !

! " # $

%

& ' ! !

$ (

)

(*+) (&,) & &,

&, ( .//0) " "

-. "

%

( )

( ).//1

-% ( .//0)

45

2

( ) ()

()

%

3 & "

()

( ) $ % %

4 (.///) 55 %

(.///) !

"

(2)

% "

"

6 7 ! % !

& 8 3

!

! !

3

# ( ) ! 3 !

9 &, " *+ ( : .//1) & ! : 55 55; 5 5 5 ( ) 5: 8

, &, " % -"

%

(7 ./0< ./00) $

%

-

(7 ./0<)

% & !

( ./01) , " *+

%

-(4 .//=) (> .//<) " 2

(- ! .//1 .//?: .//@)

A" ;

(.)4

% %

(B)$

( )

! (<)$ *+

(@)

2 ! % (-./C0);

5 !

! 5

# ;

(3)

For any term T, if the term is

representative,

D(T),

the

set

of

all

documents containing T, should have

some characteristic property

compared to the "average".

55 5 5

;

Choose a measure M characterizing

a

document

set.

For

term

T,

calculate

M(D(T)), the value of the measure

for D(T). Then compare M(D(T)) with

B

M

(#D(T)), where #D(T) is the number

of words contained in #D(T), and B

M

estimates the value of M(D) when D

is a randomly chosen document set

of size #D(T).

" #

!

# "$ 2 (

%%&") "

%%&"

( .///) ( )

()

./// ''( - B (D"()) " %%&" ''(

!

.//1

- " (!( ")

!"(())

#(D()) "

# !

E ''( " " !

- < F(D()

''((())G ''(

()

( )

()

( ) () () ()

- < " ''(((

)) ''((( )) 55 55 ''(((

)) ''(((

))

''(((

)) ''(

''((()) D() # !

# !

& #(H)

" - < -

#(=)I

#(D) I = 3 D #(H)

" " (==) " <== (*

<== ) A (D

''(()) ((D) (''(())) ((D)(''(())) " - I F* J .==== * K .C===G (''(()) " . .=< L . =B<(D)(I= //1

- ''( (!( ''() !''((()) #(D()) ;

5 . 555 4 46 /4

555 4 4 /4 4 .

(4)

-

.//1 .==(''(())

6(# (D)) .)+ = ==@B<

= @1C A +@ //M 3< (0 )

+L@

7 "

%

() #

(!( ''() % &

()

()(I.C=) () (!( ''() .==(''((())) 6(# (D())) .)

" ' %

N (!( ''()

(!(

) I= C0<(!(

) I @ =? (!(

) I 1 ?=

!"#

(!(")

;

(.)&

(B)& % %

(<)

(@)&

$

&, ( - .) " (!(

")

( ) " .C?===

(" ) .//1

%

# B==== ?1=== % B B===

; ( )

()

( ) &

"

4

%

.

B=== # 5!5

$ 55 (

()

(+)) &

&'(%

( ) ( ) # B==== ;(!(H ''()(!(H %%&")

( %)

(.B===)

%()%

()

)

(5)
(6)

*+%

# %

(!(H%%&")(!(H''()

# 5

5B===%

(!

C1< @=/ (!(H ''() 2 (!(H

%%&") @C<

)*

# B====" @ . , 7 8 : 8 (3")B=== B==== B

(!(H''()

& (!(H''() (!(H %%&")

(!(H''()

(!(H %%&")

'

"

!"

! # $$$%&' $'$' $(&)

* $$+,+ $(+( $(-.

# " ' !

(!(H''() 7

! (!(H

''() " @ .

(!(H''() @ . ;

!"#!$! !%&'!# ( !"#! &&!"#! !"#!

)!"*!# #!+),-,!%! (

)+..*%''''*(+),-,!

7 <

"

4

#$%& #$%&' (!!! & '& &' '/ (! ! &/& &/& /& &''

#$%') #*') #*'++ (!!! '/&&/ '//& &/ (! ! & &'/ '

-0

55 (

@ .) 43** & - 0

!

#

7

# " @ B( @) .==M": 8 347&7

"

/*&+01

2"

#$%&')

#$%&'

#$%&'

#$%'

) #*') #*'++ , $&&% $&&% $&&+ $&&& $&(' $&$$

$&%$ $&-+ $&-( $&%& $%)& $%)$

(!(H

!!!! "!!!!!

(7)

References

Related documents

In this paper, to build a cost-effective and secure data sharing system in cloud computing, we proposed a notion called RS-IBE, which supports identity revocation and

When the foot radiographs (Chingford year 23) were scored using the LFA (Technique 1), the total (i.e. com- bined joints of left and right feet) prevalence of radio- graphic foot OA

Since several reports on miRNA profiling human cartilage [32], cancer [23] and general human tissues [21,36] have already been published, we chose to follow up on MMP-13 and IGFBP-

The risks for belonging to the worst scores (0 and 1 were combined due to small subject size) and the worst quartile were compared for standing balance test and for 6-m usual walk

DOE ~$34 million – Ames Lab – R&amp;D contracts – Project grants – Cooperative agreements USDA ~$48 million – ARS Research Facility at Iowa State University – Nat’l Animal

After first discussing patient-level barriers to accessing preventive care, we will consider some practices that care providers utilize to help patients overcome the

Initial acidic pH (or the alkaline pH) may not allow the efficient growth of the organism thereby affecting both acid production and phosphate solubilization Hence the given

O u r approach is a direct one; we do not, for instance, provide any evidence that Sterling movements are a good predictor of realignments (actually they do not appear to be) nor