• No results found

Application of Personalized Study Recommendation Service for Big Data Oriented Education

N/A
N/A
Protected

Academic year: 2020

Share "Application of Personalized Study Recommendation Service for Big Data Oriented Education"

Copied!
6
0
0

Loading.... (view fulltext now)

Full text

(1)

n o e c n e r e f n o C l a n o it a n r e t n I 8 1 0

2 Communicaiton ,NetworkandAritifcia lIntelilgence(CNA I2018) 8

7 9 : N B S

I -1-60595- 50 -5 6

e

c

i

v

r

e

S

n

o

it

a

d

n

e

m

m

o

c

e

R

y

d

u

t

S

d

e

z

il

a

n

o

s

r

e

P

f

o

n

o

it

a

c

il

p

p

A

f

o

r

B

i

g

D

a

t

a

O

ir

e

n

t

e

d

E

d

u

c

a

it

o

n

o

a

B -

x

i

a

n

J

I

A

a u

n

d

S -

z

h

e

n

B

A

O

9 5 0 2 5 2 , e c n i v o r P g n o d n a h S , y ti s r e v i n U g n e h c o a i L

: s d r o w y e

K Educaiton b i d ag a , t Personailzed recommendaiton, Collaboraitve iftlering, Semanitc simliartiycomputaiton, Ontology.

. t c a r t s b

A Witht hecomingoft heeraofbigdataandt hei ntroductionofpersonalizededucationcon -i f f e e h T . t o p s t o h e m o c e b s a h y l k c i u q s e c r u o s e r e l b a u l a v r i e h t e u l a v o t s t n e d u t s e d i v o r p o t w o h , t p e c

r e p f o y c n e i

c sonailzed recommendaiton serviceoflargeeducationa ldataismainly reflected in the l

a e

r -time of the smal lflie storage of educaitona lresources and the accuracy of the recommended it

l u m r o F . m h t i r o g l

a -sourcedata ,heterogeneouscharacteristicsoft heeducationdata ,wedesigneda m e l b o r p e h t e v l o s o t p o o d a H n o s e c r u o s e r n o it a c u d e s s a m f o e m e h c s n o i t a z i m i t p o e g a r o t s e li f l l a m s

l a e r e v o r p m i d n a m e t s y s g n i s s e c o r p a t a d n o it a d n e m m o c e r w o l s f

o -timepersonalized recommenda

-d n e m m o c e r g n i r e t li f e v i t a r o b a l l o C . n o i

t aitonalgorithmi sverysuccessfuli nmanyapplicationareas . e h t d e v o r p m i e w , e r o f e r e h T . h g i h t o n s i y t i l a u q s t l u s e r d e d n e m m o c e r e h t , a t a d n o it a c u d e l a i c e p s r o F

o y z z u f d e s u e W . a t a d g i b n o it a c u d e g n i t p a d a m h t i r o g l a n o it a d n e m m o c e r g n i r e t li f e v i t a r o b a l l o

c pti

-l a e r o c e h t f o s t h g i e w s r o t c a f t n e r e f f i d e n i m r e t e d o t t p e c n o c y g o l o t n o d n a t h g u o h t g n i t r o s n o i t a z i m

-m i e b o t l a it n e s s e s i n o i t a t u p m o c y t i r a l i m i s c i t n a m e S . m h t i r o g l a g n i r e t l i f e v i t a r o b a ll o c f o m h t i r o g

i w h c r a e s e r e h T . n o it a l u c l a c f o y c a r u c c a e h t r o f d e v o r

p l lprovideguidanceforbigdataineducation .

a e r a

d n u o r g k c a B

e l i h W . t n e s e r p t a d e t c e l l o c e r a a t a d l a r u t c u r t s d n a t i c i l p x e e r o m d n a e r o m , a t a d g i b l a n o it a c u d e r o F

t s o m e h t e t a r e n e g y l l a e r t a h t a t a d e h t d n A . d e r u t c u r t s n u e r o m s i g r e b e c i e h t r e d n u a t a d e h

t valuefor

s k n il e e r h t e h T . d e d n e m m o c e r d n a d e t a v a c x e e b o t d e e n a t a d g i b e h T . g r e b e c i e h t r e d n u s i n o i t a c u d e

e h t y l l a i c e p s E . n o i t a d n e m m o c e r s i s y l a n a a t a d d n a n o i t a t u p m o c , e g a r o t s n i t s i x e a t a d g i b n o i t a c u d e f o

a d n a e r u t c e t i h c r a l a c i n h c e t f o n o i t c u r t s n o c e

r lgorithmi st het echnologyi nnovationfocusi nt hef uture r e s u s t e e m t a h t n o i t a m r o f n i d n e m m o c e r d n a t c e l e s o t w o h , d a o l r e v o n o i t a m r o f n i f o e g a s ' y a d o t n I . ] 1 [

h w , r e v e w o H . g n i t u p m o c e c i v r e s f o d l e i f e h t n i s e u s s i y e k e h t f o e n o e m o c e b s a h y l e v it c e f f e s d e e

n en

y n a m e r a e r e h t , y l t c e r i d a t a d g i b l a n o i t a c u d e o t d e i l p p a e r a s m h t i r o g l a n o it a d n e m m o c e r g n i t s i x e

l a e r n i s k c a b w a r

d -timeandaccuracy .Therefore,i mprovingt hewayofstoringbigdataandi mproving o

p m i t s o m e h t o s l a e r a s m h t i r o g l a n o i t a d n e m m o c e r g n i t s i x

e rtan taspec tofeducationa lbig dataap -.

h c r a e s e r n o i t a c i l p

-m o c e r d e z i l a n o s r e p f o d l e i f e h t n i s e g a t n a v d a l a r u t a n s t i s a h m h t i r o g l a g n i r e t l i f e v i t a r o b a l l o c e h T

, s t c e j o r p w e n r o f t r a t s d l o c e h t , s r e s u w e n r o f t r a t s d l o c e h t n i e i l s g n i m o c t r o h s s t I . n o i t a d n e

m andt he

-e r g n i n r a e l , n o i t a c u d e f o d l e i f e h t n I . m e t s y s e h t f o e l c y c e f i l e l o h w e h t t u o h g u o r h t y ti s r a p s a t a d

t e n r e t n I e h t f o t s o M . e r u t a m y r e v t o n e r a e c i v r e s n o i t a d n e m m o c e r d e z i l a n o s r e p d n a y r o m e m s e c r u o s

l e v i t a l e r s e r o c s r a l i m i s d n e m m o c e r m r o f t a l p g n i n r a e

l y higher learning resource according to user t

I . y r o t s i h h c r a e

s cannotminepotenitali nterest ,withuserspersonalizedf eatures .Therefore,t heshort . s u o i v b o e r o m s i a t a d g i b n o i t a c u d e f o n o i t a c i l p p a n i m h t i r o g l a g n i r e t l i f e v it a r o b a l l o c f o s g n i m o c

a l i m i

S rity computaiton is oneofthe mos timportan taspects of building an efficien tcollaborative -r o f r e p d n a y c a r u c c a n o it a d n e m m o c e r e h t n o t c a p m i t a e r g a s a h t I . m e t s y s n o i t a d n e m m o c e r g n i r e t l i f

. m e t s y s n o i t a d n e m m o c e r e h t f o e c n a m

e r n i m e l b o r p n o i s i c e r p w o l e h

T a -ltimerecommendationandrecommendationqualityexistswhen e h t e v a g e w o S . e c r u o s e r a t a d n o i t a c u d e e h t o t m h t i r o g l a g n i r e t l i f e v i t a r o b a l l o c l a n o i t i d a r t g n i y l p p a

o d o h t e m n o i t a l u c l a c d e v o r p m i n a n o d e s a b e m e h c s e g a r o t s s e c r u o s e r n o i t a c u d e f o n o i t a z i m i t p

o f

(2)

e c n a c i f i n g i s l a c i t e r o e h t t n a t r o p m i s a h h c r a e s e r e h T . m h t i r o g l a g n i r e t l i f e v i t a r o b a l l o c n o i t a d n e m m o c e r

. g n i n r a e l d e z i l a n o s r e p e h t r o f e u l a v n o i t a c il p p a d n

a

s u t a t S h c r a e s e R

-r o f r e p e h t d e c n e u l f n i t a h t s r o t c a f 5 d e z y l a n a . l a t e D G N A I J , ] 2 [ t n e m u c o d n i , h c r a e s e r n g i e r o f n I

n i ) g n i l u d e h c s d n a g n i x e d n i , l e d o m g n i t u p m o c , g n i s r a p a t a d , l e d o m O / I ( m e t s y s p o o d a H f o e c n a m

m r o f r e p g n i d n o p s e r r o c d e s o p o r p d n a , e r u t a r e t i

l ancei mprovemen tmethodfort hese5factors .Indoc -e h t o t g n i d r o c c a n o i t r o p o r p n i a t a d g n i r o t s f o d o h t e m e h t d r a w r o f t u p . l a t e g n o i J e i X , ] 3 [ t n e m u

-o r e t e h e h t t n u o c c a o t n i s e k a t y g e t a r t s e g a r o t s a t a d s i h T . e r u t a r e ti l e h t n i s e d o n f o r e w o p g n i t u p m o c

y t i e n e

g ofnodes ,andi mprovest heserviceabiiltyofHadoopi nheterogeneousclusters .Indocumen t n o s e l i f l l a m s f o y c n e i c i f f e e g a r o t s e h t e v o r p m i o t d o h t e m S F D H E a d r a w r o f t u p S r a k e s a r d n a h C , ] 4 [

e h t e c u d e r o t s e l i f d e t a i c o s s a e h t s e n i b m o c e m e h c s s i h T . p o o d a

H amoun tofmetadatainformation

s t n e m u c o d d e t a l e r e h t , r e v e w o H . s e li f l l a m s f o y c n e i c i f f e s s e c c a e h t e v o r p m i o t m s i n a h c e m x e d n i d n a

. s e l i f d e t a l e r e h t f o s n o i t a r e p o d e t a l e r e h t e c u d o r t n i t o n d i d

n I f o n o i t a d n e m m o c e r d e z i l a n o s r e p n o s e h c r a e s e r e m o s e r a e r e h

T terne tlearning platformin the

. d e v l o v n i y l e r a r s i s e c r u o s e r g n i n r a e l f o n o it a d n e m m o c e r d e z il a n o s r e p n o h c r a e s e r e h t t u B . d l r o w

n o i t a m r o f n i , s e c r u o s e r g n i n r a e l r o f s r e n r a e l t e n r e t n I f o s t n e m e r i u q e r t n e g n i r t s y l g n i s a e r c n i e h t h t i W

e b s i n o n e m o n e h p d a o l r e v

o comingmoreand morecommon .Personalizedrecommendationservice t e n r e t n I f o l l a f d n a e s i r e h t e d i c e d o t e c r o f g n i d a e l e h t e m o c e b y l l a u t n e v e l l i w s e c r u o s e r g n i n r a e l f o

. m r o f t a l p g n i n r a e l

o n i m r e t f o d o h t e m n o i t a l u c l a c y t i r a li m i s c i t n a m e s e h t n o h c r a e s e r e h

T logybyforeignresearchers

s i s u p r o C . s u p r o c n o d e s a b n o i t a l u c l a c y ti r a l i m i s c i t n a m e s s i s u c o f e h T . tl u s e r h c i r a d e m r o f s a h

n o i t a m r o f n i l a u t u m d e t n i o p a e c l a h i M , s u p r o C l a n o i t a N h s it i r B , t e N d r o W e h t n i d e t a r t n e c n o c y l n i a m

d e s a b s u p r o c f o s d n i k o w t A S L d n

a methodsindocument[5] .GabrilovichwithWikipedia ,pu tfor -d n a d r a w r o f t u p s i + + T a u q S o c r a M . ] 5 [ t n e m u c o d n i d o h t e m ) s i s y l a n a c i t n a m e s y a l p s i d ( A S E d r a w

e h T . ] 7 [ t n e m u c o d n i k s a t h c r a e s b e W r o f y l n i a m , d o h t e m e e r t g n i n n a p s m u m i x a m e h t f o e c n a l a b e h t

s s e c c u

s fu lapplicaitonsystemofsemanticsimilartiyi ncludest hefollowing.

l a v e i r t e r f o s i s a b e h t n o M B I d n a e l p p A y b d e p o l e v e d m e t s y s l a v e i r t e r c i t n a m e s a s i k e e S o t n O

a e s a s i e l g o o w S . l a v e i r t e r y r o t c e r i d r o f d e s u y l n i a m s i m e t s y s s i h T . ] 8 [ t n e m u c o d n i t n e t n o

c rchengine

-r e t n I e h t n o a t a d d n a s m r e t d e t a l e r d n a s t n e m u c o d c i t n a m e s r o f g n i h c r a e s y l n i a m , e l g o o G o t r a l i m i s

c it n a m e s h g u o r h t y r o t i s o p e r y g o l o t n o w e n g n i t c u r t s n o c d n a , s e c r u o s e r y g o l o t n o d e t a l e r g n i r i u q c a , t e n

. ] 9 [ t n e m u c o d n i g n i n o s a e r

s r e p n o h c r a e s e r e h

T onailzedl earningrecommendationserviceofeducationbigdatai sahott opic h c r a e s e r c i t s e m o d e h t ,t n e s e r p t A . t n e s e r p t a e d a m n e e b e v a h s t n e m e v e i h c a w e f t u B . t n o r f e r o f e h t n i

n i a m , p o o d a H n o k r o w h c r a e s e r f o t o l a e n o d e v a h s e i t i s r e v i n u d n a s n o i t u t i t s n

i lyconcentraitngi nt he

-u c e s d n a t n e m e g a n a m e c r u o s e r m e t s y s , y ti l i b a l i a v a m e t s y s , n o it a z i m it p o e c n a m r o f r e p , e g a r o t s a t a d

s tl u s e r e h T . n o it a c u d e y t i n u m m o c e c r u o s n e p o p o o d a H e h t o t s e t u b i r t n o c h c r a e s e r e h t f o t s o M . y t i r

h c s n o i t a z i m i t p o e g a r o t s e li f l l a m s f

o emeresearch scattered ,farfromformingthesystematictheory e h t n i h c r a e s e r e h T . n o i t a c il p p a l a c i t c a r p d n a h c r a e s e r l a c i t e r o e h t n e e w t e b p a g g i b a s i e r e h T . m e t s y s

e h t n i d e t a r t n e c n o c y l n i a m s i t i t u b , e r u t a m e r o m s i n o i t a d n e m m o c e r d e z i l a n o s r e p f o d l e i

f business

n e e b t o n s a h s e c r u o s e r l a n o i t a c u d e r o f e l b a t i u s m h t i r o g l a n o i t a d n e m m o c e r f o y r o e h t e h T . d l e i f

. d e m r o f

s e c r u o s e R n o it a c u d E s s a M f o s e li f ll a m S f o n o it a z i m it p O e g a r o t S

e g a t n a v d a g n i k a t d n a , s e li f a t a d g i b n o it a c u d e f o s c i t s i r e t c a r a h c e h t g n i y d u t s y

B ofotherresearchers '

f o s t n e n o p m o c t n a v e l e r g n i s u y b s r e t s u l c p o o d a H d e z i m i t p o d n a d e v o r p m i e w , s e i g e t a r t s t n e l l e c x e

. m e t s y s o c e p o o d a H

. s m e l b o r p y n a m g n i c a f o s l a s i a t a d g i b n o it a c u d e f o e g a r o t s e l i f , a t a d g i b f o d n u o r g k c a b e h t r e d n U

e m e h c s e h t n o d e s a

B proposed byscholarsa thomeandabroad ,wetrytoimprovetherelatedcom -a o t n i s e l i f l l a m s g n i p p a m , e li f e g r a l a o t n i s e l i f l l a m s e h t d e t a r g e t n i e W . p o o d a H n o d e s a b s t n e n o p

n a r e v r e s t n e il C e h t n i y g e t a r t s e h t e h c a c e h t d e h s i l b a t s e e h t e w d n A . e l i f e g r a

(3)

h t i w s e li f l l a m s d e r u t c u r t s n u g n i c a f p o o d a H n e h w d e v o r p m i s i m e t s y s e h t f o y c n e i c i f f e t n e m t a e r t e h T

g n i w o l l o f e h t e n o d e v a h e w , e v o b a e s o p r u p e h t e v e i h c a o t r e d r o n I . e c n a m r o f r e p e h t n i s e p y t x e l p m o c

. s t c e p s a

h c e h t f o s i s y l a n a e d a m e

W aracteristicsoft hebigeducationdata .Theeducationbigdatai nvolves e v i t c e p s r e p e h t m o r F . e s r e v i d d n a x e l p m o c e r a s e p y t a t a d e h t d n a , s d l e i f y n a m d n a s t n e m t r a p e d y n a m

n e g n o i t a m r o f n i t c e r i d n i d n a n o it c e l l o c t c e r i d o t n i d e d i v i d e b n a c t i , n o i t c e l l o c f

o eration .Theacquired

a t a d d e r u t c u r t s n u f o t n u o m a e g r a l a s n i a t n o c o s l a t u b , a t a d d e r u t c u r t s o t d e n i f n o c r e g n o l o n s i a t a d

-o r p a t a d g i b f o e s u e h T . s c i h p a r g d n a n o it a m r o f n i n o i t a c o l c i h p a r g o e g , o e d i v , e c i o v , t x e t s a h c u s

l p a s a p o o d a H o t y g o l o n h c e t g n i s s e

c atformforstoring and processingmassivestatistica ldataisa s s e c c a e l i f w o l d n a e r u s s e r p y r o m e m h c u m o o t f o m e l b o r p e h t t a g n i m i A . a t a d g i b e t a c u d e o t y a w w e n

e f o s c i t s i r e t c a r a h c e h t h t i w d e n i b m o c e w , p o o d a H n i s e l i f l l a m s h t i w g n i l a e d n e h w y c n e i c i f f

e duca

-i t p o o t t l i u b s i k r o w e m a r f s s e c c a e l i f l l a m s A . r e t s u l c p o o d a H r e m r o f e h t e z i m i t p o o t a t a d g i b l a n o i t

e h t , r e g r e m e l i f l l a m s f o n g i s e d e h t n o s e s u c o f t I . s e l i f l l a m s f o s s e c o r p l a v e i r t e r d n a e g a r o t s e h t e z i m

h t d n a , s e l i f e g r a l o t s e l i f l l a m s f o g n i p p a

m edesign ofcachingandprefetchingmechanism .Thede -i t p o e h t n i d e v l o v n i s a w y g e t a r t s t n e m e c a l p e r e h c a c d n a y g o l o n h c e t e l i F p a M e h t f o s i s y l a n a d e l i a t

. y g e t a r t s n o i t a z i m

p o g n i d a e r d n a e g a r o t s e l i f m e t s y s S F D H d e v o r p m i e h t f o n o i t a u l a v e e v i t a t i t n a u q e v a g e

W eraiton

-e S d n a r e t s u l c p o o d a H e h t f o e c n a m r o f r e p e h t d n a , t li u b s i m r o f t a l p l a t n e m i r e p x e e h T . y c n e i c i f f e

d a e r a t a d d n a e g a s u y r o m e m e d o N e m a N f o s m r e t n i d e z y l a n a d n a d e r a p m o c s i d o h t e m e l i f e c n e u q

e l i f l l a m s f o e c n a m r o f r e p e h t s s e s s a o t r e d r o n I . y c n e i c i f f

e access ,wecomparedt herandomexperi

-. 1 e l b a t n i n w o h s s a , d e r i u q e r s e l i f l l a m s 0 3 , 0 2 , 0 1 n o i t a c o l e m a s e h t m o r f a t a d t n e m

s e l i f d a e r m o d n a r f o s i s y l a n a y t i l i b a p a C . 1 e l b a

T .

s e l i f f o r e b m u N t

n e m n o r i v n

E 1 0 2 0 3 0 4 0

S F D H l a n i g i r

o 3700ms 7000ms 9000ms 13000ms d

e v o r p m

I HDFS 1100ms 2200ms 3000ms 4400ms

l a n i g i r o e h t n a h t s s e l s i e li f g n i d a e r S F D H d e z i m it p o f o e m i t e h t , e l b a t e h t m o r f e e s n a c u o y s A

h c i h w , d e t p o d a e r a r e v r e s g n i h c a c d n a m s i n a h c e m g n i h c t e f e r p x e d n i e h t t a h t s i n o s a e r e h T . S F D H

e h t s e c u d e

r numberofNamenodeaccess ,whichgreatlyi mprovest heaccessefficiency.

n o it a l u c l a C y ti r a li m i S c it n a m e S f o d o h t e M d e v o r p m I

f o n o i t a l u c l a c e h t g n i r e d i s n o C . d e v o r p m i s i t h g i e w g n i k r a m f o d o h t e m n o i t a l u c l a c d e v o r p m i e h T

c o v e h t e n i b m o c e w , s t h g i e w n o i t a t o n n

a abulary frequency ,location of words ,and feedback from -e r f d r o w e h t e c u d o r t n i o T . n o it a t o n n a f o t h g i e w e h t e t a l u c l a c o t s r e t e m a r a p s a s r e s u l a n o i s s e f o r p

F T m h t i r o g l a d e t h g i e w l a n o i ti d a r t e h t s i y c n e u

q -IDF isaword(conceptsin ontology)appearinthe h

g i

h erfrequenciesmoreoftenandmoreabletorepresen tthetex ttheme ,introduced theposition of -n o c e h t , n o it c e s w e i v e r e h t n i e l c i t r a n a n i , e l t i t y r a l u b a c o v e h t o t s r e f e r r e t e m a r a p a s a y r a l u b a c o v

l u b a c o v e h t n i t x e t n a h t t n a t r o p m i e r o m s i n o it c e s n o i s u l

c ary ,and feedback isused to adjus tthe

c i t n a m e s e h t y b t h g u o r b s t c e f e d e l b i s s o p n o i t a m o t u a e h t p u e k a m , n o i t a t o n n a l a i c i f i t r a e h t f o s t h g i e w

. g n i d n a t s r e d n u

. s w o l l o f s a s i a l u m r o f n o i t a l u c l a c t h g i e w c i f i c e p s e h T

i δ + i c o l + i n e r f = i t h g i e

w ( 1) . n o i t a t o n n a f o e u l a v t h g i e w e h t s a d e s u s i I . y g o l o t n o f o t p e c n o c e h t o t s r e f e r i t h g i e w , a l u m r o f e h t n I

o l o t n o f o r e t e m a r a p y c n e u q e r f d r o w e h t s t n e s e r p e r i n e r

F gyconcep tI ,loc istandsfortheconcep tof . r o t c a f t n e m t s u j d a l a i c i f i t r a e h t s i I a t l e d , n o i t a t o n n a e h t f o r e t e m a r a p n o it a c o l e h t s a d e s u s i I , d r a d n a t s

F T e h t g n i s u s r e t e m a r a p y c n e u q e r f e h t e t a l u c l a c t s r i

(4)

e h t d i a s ) I W ( F D , t e s t n e m u c o d e h t n i s t n e m u c o d f o r e b m u

n documen tseti nt hecurren tw idocumen t .

r e b m u n

i n e r

f � Nn� logdfCw

i� (2)

, e c n e i r e p x e e h t o t g n i d r o c c a , s r e t e m a r a p n o it a c o l e h t e n i m r e t e d n e h

T mos tofthefirs tparagraph

-a r a p g n i d u l c n o c e h t s i n o i t c e s t s a l e h t d n a , n o it c e s n o it c u d o r t n i e h t n e t f o s i s e c r u o s e r l a n o i t a c u d e f o

-r a p p a , e z i l a r e n e g l l i w r o h t u a e h t e l c i t r a e h t n i n o i s u l c n o c d n a n o i t c u d o r t n i , e l t i t e h t n i , n e t f o h p a r g

a t p e c n o c a o s l a y l t n

e ppearedinthethreepostiionsthansomeoftherelativeimportanceoft het ext . s r a e p p a t p e c n o c a f i t a h t w o h s o t s r e t e m a r a p y c n e u q e r f d r o w g n i n u t e n i f f o m r o f e h t e s u l l i w e w e r e H

n a h t e r o m s r a e p p a d r o w e h t t a h t o t t n e l a v i u q e s i t i , s n o i ti s o p 3 e s e h t n

o m timesin thetext .After

e h t h g u o r h t d e t a l u c l a c s i i c o l o s , i c o l , 0 s e k a t i c o l e s i w r e h t o , r e t t e b s i 3 f o t c e f f e e h t , m l a c i t s i t a t s

. a l u m r o f g n i w o l l o f

i c o

l � nN+3� logdfCw

i�� freni ( 3)

, k c a b d e e f l e n n o s r e p y t i r o h t u a e h t t n u o c c a o t n i g n i k a t , r o t c a f n o i t a l u g e r l a i c i f i t r a e h t s i t s a l e h T

y p u c c o d l u o h s n o it a l u c l a c t h g i e w e h t , r e p a p e h t f o g n i n a e m e h t s s e r p x e y l t c e r r o c n a c , y t il i b i d e r c h g i h

h t f o n o i t r o p o r p r e g r a l

a eformula,t obeconsideredwithmaximumweigh tofi deas ,whichi st ofind F

T e h t h g u o r h t e u l a v m u m i x a m e h t t u

o -IDFobtainedbyt hemethodoft hearticle ,andt hencombined F

T l a n i g i r o e h t I a t l e d f o t p e c n o c e h t k c a b d e e f e h t h t i

w -IDFvalue,t wowereaddedasI .

δi � ∑1nMax�frenj�� freni ( 4)

F T e h t y b d e t a l u c l a c t h g i e w e h t s i j n e r f e h t d n a , t n e m u c o d e h t f o e l p i c i t r a p f o r e b m u n e h t s i n e h

T

. J d r o w e h t f o F D

I Finally,t hewordfrequencyparameters ,positionparametersandaritficia ladjust l e b a l e h t f o t h g i e w e h t d n a , 4 . 4 a l u m r o f e h t o t n i d e l li f e r a I t p e c n o c e h t o t g n i d n o p s e r r o c s r o t c a f t n e m

. d e n i a t b o e b n a c

m , n ( R e s u e w , y t i r a l i m i s c it n a m e s e c n a t s i d n o d e s a

B )t oexpresst hesemanitccorrelationbetween c it n a m e s e h T . y g o l o t n o e h t n i s t p e c n o c d e t c i r t s e r n u o w t e h t s a 2 d d n a 1 d t e s d n a , s t p e c n o c o w t e h t

-a r e t i l n i d e d i v o r p s i h c i h w y a w g n i w o l l o f e h t n i d e t a l u c l a c e b n a c 2 D d n a 1 D n e e w t e b n o i t a l e r r o c

. ] 9 [ e r u t

R�n,m� � L(d1,d )12+η ( 5) e m a s e h t n i s t p e c n o c o w t n e e w t e b h t a p t s e t r o h s e h t f o h t g n e l e h t s n a e m ) 2 d , 1 d ( L , m e h t g n o m A

s i η . n o y h c r a r e i h t p e c n o

c aregulator,t hispapert akesηas1.

-s n a r t n o s e s u c o f n o i t a z i m i t p o y z z u F . s 0 7 9 1 n i d e t a n i g i r o d o h t e m d n a y r o e h t n o i t a z i m i t p o y z z u F

-r e c n u r o F . n o i t a m r o f n i l a m i t p o c i t s i n i m r e t e d o t n i n o it a m r o f n i n o i t a z i m i t p o y z z u f n i a t r e c n u g n i m r o f

ti n i f e d a , n o i t a m r o f n i n i a

t eboundary informaiton canbeobtained ,andthenadefiniteresul tcan be e h t f o s t h g i e w e h t e n i m r e t e d o t y r o e h t g n i t r o s n o it a z i m i t p o y z z u f e s u e w , r e p a p s i h t n I . d e n i a t b o

v i t c e j b u s e h t g n i d i o v a , y ll a c i f i t n e i c s d o h t e m n o i t a l u c l a c y t i r a l i m i s c i t n a m e s d i r b y

h eerroroftheex

-. e t a r u c c a d n a e l b a i l e r e r o m s tl u s e r n o i t a l u c l a c e h t g n i k a m d n a , s t r e p

s i a l u m r o f e h t d n a , ) 2 C , 1 C ( m i s s i 2 C f o y t i r a li m i s c it n a m e s e h t , 1 C s e m u s s A . 1 n o i t i n i f e d e h T

. 1 = a m m a g + a t e b + a h p l a , e r e H . ) 4 . 2 ( a l u m r o f

t s r i f o t s i p e t s t s r i f e h

T determine theprioriitesof simA ,simB and simC .Here wesimplify the c it n a m e s e e r h t g n i t n e s e r p e r , y l e v i t c e p s e r , 3 X d n a 2 X , 1 x , ) 2 C , 1 C ( C g n i s u y b M I S f o n o it a t n e s e r p e r

. x i r t a m y t i r o i r p y z z u f e h t t e g e w n e h T . y l e v it c e p s e r s e i t i r a l i m i s

n o r c i m e h t n e h w o

S isreduced to 0.7 ,thefirs tsecond linesareal lequa lto1excep tthediagona l n a c x i r t a m w e n f o s w o r d n o c e s , s n m u l o c d n o c e s w a r d n e h t , y t i r o i r p t s r i f e h t s a 2 X e h t o S . s t n e m e l e

a p u m u s o t o s , 3 X n a h t r e t t e b s i 1 X , d o h t e m e m a s e h t g n i s u y b d e n i m r e t e d e

b conclusion can be

f o n o i t r o p o r p e h t , s i t a h t , y t i r a li m i s c it n a m e s f o y ti r o i r p 3 t e g e w , d o h t e m s i h t y B . 3 x > 1 x > 2 x : n w a r d

(5)

-c a d e t a c o l l a e b t s u m t h g i e w n o i t a c o ll a e h t , y t i r o i r p t e g o t p e t s t s r i f e h t h g u o r h t : p e t s d n o c e s e h T

e d r o s i h t o t g n i d r o

c r .Thepriorityislarge ,andtheweighti slarge .Wege tthefollowingrestricitons p

e t s t s r i f e h t h g u o r h t

c i t n a m e s f o e u l a v m u m i x a m e h t , a t a d t h g i e w f o s t e s e e r h t e v o b a e h t o t g n i d r o c c a : p e t s d r i h t e h T

a l u m r o f n o i t u ti t s b u s e h t y b d e t a l u c l a c s i y t i r a l i m i

s ,whichi st heresul tofreturn.Basedont hehybrid . -n o c y l t c e r r o c n a c h c i h w , t s r i f d e z i t i r o i r p e r a C m i s d n a , B m i s , A m i s , n o i t a l u c l a c y t i r a l i m i s c i t n a m e S

g n i d r o c c a m e h t g n i n g i s s a , m e h t o t s t h g i e w g n i n g i s s a n e h T . h g i h s i n o i t u b i r t n o c r o t c a f e h t t a h t e d u l c

o

t priority ,assigning thesemanticsimilarity ofeachgroup by assigning values ,andpreserving the t h g i e w e h t , y a w s i h t n I . e t a r y c a r u c c a e h t e v o r p m i n a c h c i h w , e u l a v n r u t e r e h t s a e u l a v m u m i x a m

v i t c e j b u s e h t d n a n o i t a l u c l a c c i f i t n e i c s y b d e n i a t b o s i e u l a

v ityi savoided.

. ] 0 1 [ t n e m u c o d n i s d o h t e m 6 e h t y b d e t a l u c l a c s t l u s e r e h t h t i w d e r a p m o c e r a s t l u s e r d e t a l u c l a c e h T

-c e p s e r , e n i b m o c m i S d n a t x e t m i S , t g m i S , s k n i l m i S , l p d m i S , l p m i S e r a ] 0 1 [ e r u t a r e t i l n i s d o h t e m e h T

w o h s e r a s t l u s e r n o i t a l u c l a c e h t d n a , y l e v i

t ni nt able2.

e l b a

T 2. Resultsofsemanticsimilaritycalculation.

d r o

W Simpl Sd piml- lSiinmks- Simgt t Simtex Smibmincoe TMheitshod n

o i t a c u d E m o d s i

W -

n o i t a c u d E n o a t a d g i

b 0.4308 0.2154 0.0380 0.6106 0.2992 0.4298 0.5070 d

r a o b k c a l

B - chalk 0.5000 0.2500 0.0007 0.5443 0.0834 0.1391 0.3976 d

a e r

B - butter 0.4644 0.5573 0.0142 0.0696 0.1670 0.2585 0.6143 m

r o f t a l

P - Architec -e

r u

t 0.5000 0.5000 0.0140 0.5897 0.2621 0.4368 0.5321 t

n e d u t

S - Professor 0.4644 0.3333 0.0123 0.5400 0.2235 0.3725 0.3146 s

t n a r

G - Loan 0.4308 0.1436 0.0515 0.5706 0.3740 0.5789 0.6435 y

t i s r e v i n

U - Univer -y

t i

s 0.5283 0.2133 0.1603 0.6616 0.4459 0.7852 0.5371 n

o i t a c u d

E da n educa -n

o i

t 0.4644 0.3175 0.0664 0.5840 0.5855 0.7827 0.3642 y

t i l a u q s t r o p

S 0.5000 0.3333 0.0486 0.6019 0.2399 0.3998 0.6892 e

c i t c a r p n o i t a v o n n

I 0.4011 0.1604 0.0257 0.5945 0.1264 0.1957 0.2234 t

n e m y o l p m

E - ounter -e

t a r t r a

p 0.5000 0.5000 0.0563 0.5878 0.2601 0.4336 0.4634 t

u o p o r

D - unemploy -t

n e

m 0.5283 0.4227 0.0235 0.5832 0.4642 0.6667 0.5143

n o i t a v r e s b o d n a s t n e m i r e p x e t s a r t n o c y

B table2 ,wecanfindt hatt hemethodi nt hispaperi sbetter . s d r o w e m o s f o y ti r a l i m i s c i t n a m e s e h t g n i t a l u c l a c n e h w ] 8 1 [ e r u t a r e t i l e h t n i d o h t e m e h t n a h t

s k r a m e R g n i d u l c n o C

e h t e c n e u l f n i t a h t s r o t c a f y e k o w t e h

T quailty of recommendation servicein educaitona lbig data l

a e r e h t e r a g n i n r a e l d e z i l a n o s r e

p -timequality andaccuracy ofrecommendation ,whichisreflected e m m o c e r g n i r e t l i f e v i t a r o b a ll o c f o y c a r u c c a e h t d n a s e l i f l l a m s l a n o it a c u d e f o l e v e l e g a r o t s e h t n

i n

. m h t i r o g l a n o i t a d

l a e r n i n o i t a d n e m m o c e R e c r u o s e R a t a D n o it a c u d E e h t o t g n i d r o c c

A -timerecommendation and

d n a e m e h c s n o i t a z i m i t p o e g a r o t s f o y d u t s e d a m e w , m e l b o r p n o i s i c e r p w o l y ti l a u q n o i t a d n e m m o c e r

i t a r o b a l l o c f o d o h t e m n o i t a l u c l a c y t i r a l i m i s d e v o r p m

i ve filtering recommendaiton system .There . m h t i r o g l a g n i r e t l i f e v i t a r o b a l l o c n o i t a d n e m m o c e r f o y t i l a u q d n a e c n a m r o f r e p e h t s e v o r p m i h c r a e s

. g n i n r a e l d e z i l a n o s r e p f o e u l a v n o i t a c i l p p a d n a e c n a c i f i n g i s l a c i t e r o e h t t n a t r o p m i s a h h c r a e s e r e h T

t n e m g d e l w o n k c A

h

T isresearchispartiallysupportedbyNationa lScienceFoundationofChina61603169 ,Basicedu -.

(6)

s e c n e r e f e R

S ] 1

[ .WililamsonB .Digita leducationgovernance :datavisualization ,predictiveanalytics ,and‘real -s

t n e m u r t s n i y c i l o p ’ e m i

t [J] .Journa lofEducationPolicy ,2016 ,31(2): 1- .1 9 J

] 2

[ iang D .TheperformanceofMapReduce :anin-depth study[J] .ProceedingsoftheVLDBEn -2

7 4 : ) 1 ( 3 , 0 1 0 2 , t n e m w o

d - 34 . 8

o a i X n a u R , u h S n i Y , g n o i J e i X ] 3

[ -jun .Improvingmapreduceperformancethroughdataplacemen t s

r e t s u l c p o o d a h s u o e n e g o r e t e h n

i [C] .IPDPSW ,2010IEEEInternationa lSymposiumn. Altanta ,19 -1

, 0 1 0 2 l i r p A 3

2 - 9

S r a k e s a r d n a h C ] 4

[ . ,DakshinamurthyR .ANove lIndexingSchemeforEfficien tHandlingofSmal l s

e l i

F inHadoopDistributedFileSystem[C] .ICCCI ,2013Internationa lConferenceon .Coimbatore , 4-6Jan .2013 ,1- 8

R a e c l a h i M ] 5

[ . ,Corley C. ,StrapparavaC .Corpus-based and Knowledge-based MeasuresofTex t y

ti r a l i m i S c i t n a m e

S [C]/ /Naitona lConferenceonArtificia lIntelligenceandtheEighteenthInnova -6

1 y l u J , e c n e r e f n o C e c n e g i l l e t n I l a i c i f i t r A f o s n o i t a c il p p A e v i

t -20 ,2006 ,Boston ,Massachusetts ,

: 6 0 0 2 , P L B D . a s

U 57 -7 780.

E h c i v o l i r b a G ] 6

[ . ,Markovitch S .Wikipedia-based SemanticInterpretaiton forNatura lLanguage g

n i s s e c o r

P [J] .Journa lofArtificia lIntelilgenceResearch ,2014 ,34(4): 34 -4 498. A

o c r a M ] 7

[ .D. ,Navigl iR .ClusteringandDiversifyingWebSearchResutlswithGraph-BasedWord n

o i t c u d n I e s n e

S [J] .Computationa lLinguistics ,2013 ,39(3): 97 -0 754. u

G ] 8

[ arinoN. ,MasoloC. ,VetereG .OntoSeek :Content-BasedAccesst ot heWeb[J] .IEEEIntelil -:

) 3 ( 4 1 , 9 9 9 1 , s m e t s y S t n e

g 07 - .8 0

T i e W ] 9

[ . ,JiaY. ,Zhang Z. ,e tal .Improved hybrid semanitcsimilartiy algorithmforterminology n

o it a c i l p p

a [C]/ / Internationa l Conference on Natura l Computation and , Fuzzy Systems and :

6 1 0 2 . y r e v o c s i D e g d e l w o n

K 1734-1738.

L g n i D ] 0 1

[ . ,FininT .Boosting semanitcwebdataaccessusingSwoogle[C]/ /Nationa lConference :

5 0 0 2 , s s e r P I A A A . e c n e g il l e t n I l a i c i f i t r A n

o 1604-1605.

t a e F . A y k s r e v T ] 1 1

References

Related documents

As the temperature increases from room temperature, electrical conductivity of the polymers was also found to increase uniformly and it attains the lowest value

A complete data centric approach has been adopted to solve the problem of prediction by building a predictive model using machine learning algorithms with traffic volumes

Visit Step Entry Selection Page Item fi lle d by fil ls of re prese nted i n i s app ears as comp ose d of co mpos ed of pa rt of pa rt of of b oug ht v ia p art of compos ed of mad

We use the Nationwide Inpatient Sample (NIS), a nationally-representative database of inpatient admissions, from 2007 to 2011 to evaluate the effect of the young adult ACA insurance

Surprisingly, those who attended a program focused on interpersonal violence were more likely to respond ‘‘harassment’’ and significantly less likely to respond ‘‘stalking.’’

 Fluoxetine improved Depression through Serotonin mechanisms and probably improved the effectiveness of his hippocampus by increasing its volume and. therefore

In order for your chat agents and other WhosOn users to make use of the WhosOn service, they need to connect to the WhosOn application Server.. There are several methods

The distributions on the left cause all of the test statistics to have greatly inflated type-I error rates, while the distributions towards the right result in type-I error