• No results found

Lecture 9: Scale + Evolution (bonus)

N/A
N/A
Protected

Academic year: 2021

Share "Lecture 9: Scale + Evolution (bonus)"

Copied!
33
0
0

Loading.... (view fulltext now)

Full text

(1)

Lecture  9:  Scale  +  Evolution  

(bonus)

 

 

COMS  4995-­‐1:  Introduction  to  Social  Networks  

Tuesday,  October  4

th

 

1  

 

How  does  time  modify  large  networks?  

(i.e.,  how  will  FB  look  in  10  years?)  

(2)

Practice:  Make  your  own  power  law  

What  would  it  take  to  modify  

today’s  apple  policy  

…  and  obtain  a  power  law?  

1.  A  lot  of  apple  

2.  A  reinforcement  rule  in  

apple’s  distribution

 

(3)

*

Assume  infinite  amount  of  students  

o

Each  one  asks  a  question  (random  uniform)  

o

A  student  with  a  question  receives  2  apples  

*

One  for  her  (red),  One  to  distribute  (green)  

*

Send  the  green  apple  to  a  “friend”  chosen  randomly  

*

With  proba  p,  this  friend  keeps  it  

*

With  proba  1-­‐p  this  friend  sends  to  the  person  who  

already  received  a  green  apple  from  her.  

3  

(4)

4  

(5)

*

The  ubiquitous  power-­‐law  

*

The  three  sources  of  power  law  

o

reinforcement  

o

optimization  

o

artefacts  of  random  stopping  

*

A  bit  more  on  graph  evolution  

5  

(6)

*

A  generalization  of  our  previous  construction  

o

States  that  the  increase/decrease  of  an  entity  should  

be  made  proportional  to  its  size.  

*

Discrete  version:    

o

Let  X(t)  be  a  variable  (e.g.  your  income)  

discretized  

 Y(t)  =  j  if  and  only  if  your  income  X(t)  is  in  ]m

e

j

;  m

e

j+1

]  

o

proportional  evolution  :  P[Y(t+1)=  j  |  Y(t)=i]  =  f(j-­‐i)  

o

E.g.:  f(1)=p;  f(-­‐1)=1-­‐p,  then  P[X(∞)>x]  =  (x/m)

ln(p/1-­‐p)

 

 

6  

Proportional  effect  

A  brief  history  of  generative  models  for  power  law  and  lognormal  

distributions.  M.  Mitzenmacher  Int.  Math.  (2004)

 

(7)

*

A  generalization  of  our  previous  construction  

o

States  that  the  increase/decrease  of  an  entity  should  

be  made  proportional  to  its  size.  

*

Continuous  version:    

o

Let  X(t)  be  a  variable  (e.g.  your  income)  

o

proportional  evolution  :  X(t+1)=  F(t)  X(t)  

o

For  large  t,  X(t)  approaches  

lognormal  

distribution  

(i.e.  log(X(t))  ~  normal  dist;  X(t)  heavy  tailed,  not  PL)  

7  

Proportional  effect  

A  brief  history  of  generative  models  for  power  law  and  lognormal  

distributions.  M.  Mitzenmacher  Int.  Math.  (2004)

 

(8)

*

Skewed  distribution    

o

Can  be  explained  by  reinforcement,  propor.  effect.  

o

This  also  applies  to  degree  distribution  in  graph,  that  

also  exhibit  small-­‐world  and  “hub”  structure  

*

Be  careful  

o

Lognormal  and  power  law  may  both  be  obtained  

o

Graphs  obtained  do  not  exhibit  clustering  coefficient  

o

There  are  other  sources  explaining  power-­‐laws!  

8  

(9)

*

The  ubiquitous  power-­‐law  

*

The  three  sources  of  power  law  

o

reinforcement  

o

optimization  

o

artefacts  of  random  stopping  

*

A  bit  more  on  graph  evolution  

9  

(10)

*

First,  show  that  power  law  are  optimal  

o

Uses  resource  in  the  most  efficient  manner  

o

Creates  the  optimal  topology  

*

Second,  show  that  system  are  lead  to  optimal  

o

Following  Darwinian  argument:  highest  fitness  

o

Greedy  behavior  of  peers  (Game-­‐theoretic  appr.)  

10  

(11)

*

To  convey  information,  you  can  use  

o

Short  words  frequently,  longer  words  rarely  

*

A  measure  of  information:    

o

Average  cost  (in  characters)  

o

Ideally,  p

j

 decreasing  and  c

j

 =  log

d

(j)  

*

Thm:  Choose  Max(H/C)  obtained  when    

o

For  any  j,  derivative  w.r.t.  p

j

 ,  H

j

’/C

j

’  =  H/C    

o

Implies  p

j

 follows  power  law  in  i  

11  

Example  1:  Word  Frequency  

=

j

p

j

p

j

H

log

2

=

j

p

j

c

j

C

An  informational  theory  of  the  statistical  structure  of  language.  

B.  Mandelbrot,  Comm.  Theory.  (1953)

 

(12)

*

Assume  that  nodes  join  sequentially  

o

at  a  location  chosen  uniformly  on  the  plane  

o

Choose  a  neighbor  j  to  connect  to  minimize  its  cost  

o

Cost  (i)  =    α  d(i,j)+  h

j

;  d(i,j)  eucl.  dist.,  h

j

 

netw.  dist.  

*

Thm:    

 

12  

Example  2:  Internet  topology  

Heuristically  Optimized  Trade-­‐Offs.  A  Fabrikant,  E  Koutsoupias,  

CH  Papadimitriou.  ICALP  (2002)

 

1/√2  

c√n  

0  

α  

Star    

Network  

Exponentially  dist.  

Degree  

4  

Degree  

Heavy  Tailed  

(13)

*

Assume  peers  search  for  items  using  random  walk  

o

Ask  a  node,  if  it  does  not  have  ask  its  neighbors,  etc.  

o

Let  q

i

 denote  the  query  rate  of  item  i  

*

Each  node  keeps  some  item  in  cache  

o

Let  p

i

 denote  the  fraction  of  cache  devoted  to  item  i  

*

How  to  minimize  the  search  time  (i.e.  ∑  q

i  

/  p

i

)?  

o

Caching  more  popular  items  (blockbusters)  

o

Preserving  some  diversity  of  content  

 

13  

(14)

*

Thm:  

Square  root  

allocation  of  cache  is  optimal  

o

It  is  defined  as  p

i

 =  √q

i

 /  ∑

j

 √q

j

 

o

If  q

i

 follows  a  power-­‐law,  then  p

i  

as  well  

*

Thm:  This  allocation  can  be  obtained  by  a  simple  

distributed  mechanism  (

path  replication

)  

o

After  finding  the  item,  give  a  copy  it  to  all  nodes  

previously  contacted,  discarding  a  random  items.  

o

Since  search  size  ~1/p

i  

new  copy  produced  at  rate  √q

i

 

o

Copy  erased  at  rate  p

i  

~  √q

i

 

14  

Example  3:  P2P  Topology  

Replication  strategies  in  unstructured  peer-­‐to-­‐peer  networks.    

E  Cohen,  S  Shenker.  SIGCOMM  (2002)

 

(15)

*

The  ubiquitous  power-­‐law  

*

The  three  sources  of  power  law  

o

reinforcement  

o

optimization  

o

artefacts  of  random  stopping  

*

A  bit  more  on  graph  evolution  

15  

(16)

*

Assume  that  you  produce  a  large  corpus  randomly  

o

The  space  bar  is  hit  with  probability  p  

o

Each  other  character  has  same  probability  q=(1-­‐p)/d  

*

What  is  the  frequency  of  words  obtained?  

o

All  words  of  length  j  has  same  probability  pq

j  

o

 

There  are  d

j

 such  words  

o

Hence  P[word  with  rank  i]  =  p

i

-­‐α

 with  α=(-­‐ln(q)/ln(d))  

*

Random  corpus  also  exhibits  power  law!  

16  

Reality  test  #1:  Word  frequency  

Some  Effects  of  Intermittent  Silence.    

G.  Miller.  Am.  J.  of  Psyc.  (1957)

 

(17)

Reality  test  #2:  Internet  AS  graph  

Internet  routers  degrees  

follow  power  law.  

 

How  was  this  measured?  

 

Traceroute:  pick  a  source  

compute  shortest  path  

to  many  destinations.  

17  

On  power-­‐law  relationships  of  the  internet  topology  

Faloutsos,  Faloutsos  &  Faloutsos.  SIGCOMM  (1999)

 

(18)

Reality  test  #2:  Internet  AS  graph  

What  about  running  

Traceroute  on  uniform  

random  graphs?  

 

After  sampling  degree  

resembles  power  law!  

18  

Sampling  biases  in  IP  topology  measurements.    

(19)

*

Traceroute  does  not  discover  all  edges  

o

As  it  needs  to  be  on  the  shortest  paths  from  a  

destination  to  the  source  

o

Moreover,  this  reinforces  the  presence  of  edges  near  

the  source  

*

Thm:  For  general  degree  dist.  random  graphs,    

o

the  degree  distribution  changes  radically  

o

It  may  exhibit  heavy  tail/power  law  artificially  

 

19  

Reality  test  #2:  Internet  AS  graph  

On  the  bias  of  traceroute  sampling.    

(20)

*

Frequency  of  items,  popularity  of  objects  and  

distributions  of  many  variables  are  highly  skewed  

o

This  proves  that  it  cannot  be  the  sum  of  independent  

effect  (Central  limit  theory  would  predict  Normal).  

o

It  could  be  reinforcement,  or  the  result  of  a  bias  

(typically  random  stopping),    

o

Is  some  cases  (caching,  topology)  it  can  relate  to  

optimal  efficiency  

20  

(21)

*

The  ubiquitous  power-­‐law  

*

The  three  sources  of  power  law  

o

reinforcement  

o

optimization  

o

artefacts  of  random  stopping  

*

A  bit  more  on  graph  evolution  

21  

(22)

*

Graphs  evolve  over  time  

o

Densification  and  shrinking  distance/diameter  

o

Some  theoretical  justifications  

*

What  about  a  direct  likelyhood  comparison?  

o

Empirical  data  vs.  predicted  events  at  this  time  

o

Advantages:  (1)  no  focus  on  specific  property,    

(2)  quantitative  comparison  between  models  

22  

(23)

Graph Evolution: Densification and

Shrinking Diameters

JURE LESKOVEC Carnegie Mellon University JON KLEINBERG Cornell University and

CHRISTOS FALOUTSOS Carnegie Mellon University

How do real graphs evolve over time? What are normal growth patterns in social, technological, and information networks? Many studies have discovered patterns in static graphs, identifying properties in a single snapshot of a large network or in a very small number of snapshots; these include heavy tails for in- and out-degree distributions, communities, small-world phenomena, and others. However, given the lack of information about network evolution over long periods, it has been hard to convert these findings into statements about trends over time.

Here we study a wide range of real graphs, and we observe some surprising phenomena. First, most of these graphs densify over time with the number of edges growing superlinearly in the number of nodes. Second, the average distance between nodes often shrinks over time in contrast to the conventional wisdom that such distance parameters should increase slowly as a function of the number of nodes (like O(log n) or O(log(log n)).

Existing graph generation models do not exhibit these types of behavior even at a qualitative level. We provide a new graph generator, based on a forest fire spreading process that has a simple, intuitive justification, requires very few parameters (like the flammability of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study. This material is based on work supported by the National Science Foundation under Grants No. IIS-0209107 SENSOR-0329549 EF-0331657IIS-0326322 IIS- 0534205, CCF-0325453, IIS-0329064, CNS-0403340, CCR-0122581; a David and Lucile Packard Foundation Fellowship; and also by the Pennsylvania Infrastructure Technology Alliance (PITA), a partnership of Carnegie Mellon, Lehigh University and the Commonwealth of Pennsylvania’s Department of Community and Economic Development (DCED). Additional funding was provided by a generous gift from Hewlett-Packard. J. Leskovec was partially supported by the Microsoft Research Graduate Fellowship. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation, or other funding parties. Author’s address: J. Leskovec, Machine Learning Department, School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, 15213 Pittsburgh PA, USA; email: jure@ cs.cmu.edu.

Permission to make digital or hard copies part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or direct commercial advantage and that copies show this notice on the first page or initial screen of a display along with the full citation. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute to lists, or to use any component of this work in other works requires prior specific per-mission and/or a fee. Perper-missions may be requested from the Publications Dept., ACM, Inc., 2 Penn Plaza, Suite 701, New York, NY 10121-0701 USA, fax +1 (212) 869-0481, or [email protected]. C

!2007 ACM 1556-4681/2007/03-ART2 $5.00. DOI 10.1145/1217299.1217301 http://doi.acm.org/ 10.1145/1217299.1217301

ACM Transactions on Knowledge Discovery from Data, Vol. 1, No. 1, Article 2, Publication date: March 2007.

23  

Microscopic  Graph  evolution  

DATA  SET:    

Temporal  graphs  per  day  

Flickr  

Delicious  

Yahoo!  Answers  

LinkedIn  

EMPIRICAL  RESULT  

1.  How  destination  chosen  

p_e(d)  ~  d  ;  p_e(a)  ~  constant  

2.  Locality    

3.  Lifetime  exponential,  inter-­‐edge  

created  according  to  PL  

MODEL  

1)  Arrival  of  nodes:  N(t)    

With  a  first  edge  following  degree  

2)  lifetime  exponentially  distributed  (λ)  

Sequence  of  time  gap  following  dist  (PL)  

Second  edge  closes  a  random  triangle  

 

ANALYSIS  

Produces  heavy-­‐tail  degree  dist.  

 

Numerical  results  show  accurate  

Clustering,  degree  dist,  distance  dist  

Leskovec,  J.,  Backstrom,  L.,  Kumar,  R.,  &  Tomkins,  A.  (2008).  Microscopic  

evolution  of  social  networks.  Proceeding  of  ACM  SIGKDD

 

(24)

Impact  of  degree  

−  Normalized  proba  of  

edge  with  destination  

degree  

(25)

Impact  of  age  

−  Normalized  proba  of  

edge  with  age  of  source  

(26)

Impact  of  distance  

−  Distance  distribution    

computed  before  edge,  

by  convention  h=0  

denotes  disconnectd  

−  Normalized  Proba  of  

edges  with  distance  

Main  findings:  triangle  

closing  edges  are  many!  

65%  Flickr,  50%  LinkedIn  

(27)

What  about  time    

−  Node’s  active  life  

Time  where  it  adds  edges  

exponential  distribution  

−  Time  gap  between  

two  edges  

Follows  Hybrid  Power-­‐

Law+Exp.  tail  

(28)

1.

Arrival  of  nodes  follows  function  N(t)    

o

Each  one  connects  with  an  edge  following  degree  

o

A  la  copying  model/preferential  attachment  

2.

Nodes  adds  triangle  closing  edges  

o

According  a  sequence  following  time  gaps  (α,β)  

o

For  a  “lifetime”  distributed  exponentially  (λ)  

o

Each  triangle  closing  edge  chosen  according  to  

random  2-­‐step  neighbor-­‐of-­‐neighbor  choices.  

28  

(29)

Numerical  Validation  

Clustering  coef.  

Degree  distrib.  

Distance  distrib.  

All  seems  to  

match  well  

(30)

Additional  slides:    

Other  papers  on  evolution  

(31)

Graph Evolution: Densification and

Shrinking Diameters

JURE LESKOVEC Carnegie Mellon University JON KLEINBERG Cornell University and

CHRISTOS FALOUTSOS Carnegie Mellon University

How do real graphs evolve over time? What are normal growth patterns in social, technological, and information networks? Many studies have discovered patterns in static graphs, identifying properties in a single snapshot of a large network or in a very small number of snapshots; these include heavy tails for in- and out-degree distributions, communities, small-world phenomena, and others. However, given the lack of information about network evolution over long periods, it has been hard to convert these findings into statements about trends over time.

Here we study a wide range of real graphs, and we observe some surprising phenomena. First, most of these graphs densify over time with the number of edges growing superlinearly in the number of nodes. Second, the average distance between nodes often shrinks over time in contrast to the conventional wisdom that such distance parameters should increase slowly as a function of the number of nodes (like O(log n) or O(log(log n)).

Existing graph generation models do not exhibit these types of behavior even at a qualitative level. We provide a new graph generator, based on a forest fire spreading process that has a simple, intuitive justification, requires very few parameters (like the flammability of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study. This material is based on work supported by the National Science Foundation under Grants No. IIS-0209107 SENSOR-0329549 EF-0331657IIS-0326322 IIS- 0534205, CCF-0325453, IIS-0329064, CNS-0403340, CCR-0122581; a David and Lucile Packard Foundation Fellowship; and also by the Pennsylvania Infrastructure Technology Alliance (PITA), a partnership of Carnegie Mellon, Lehigh University and the Commonwealth of Pennsylvania’s Department of Community and Economic Development (DCED). Additional funding was provided by a generous gift from Hewlett-Packard. J. Leskovec was partially supported by the Microsoft Research Graduate Fellowship. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation, or other funding parties. Author’s address: J. Leskovec, Machine Learning Department, School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, 15213 Pittsburgh PA, USA; email: jure@ cs.cmu.edu.

Permission to make digital or hard copies part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or direct commercial advantage and that copies show this notice on the first page or initial screen of a display along with the full citation. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute to lists, or to use any component of this work in other works requires prior specific per-mission and/or a fee. Perper-missions may be requested from the Publications Dept., ACM, Inc., 2 Penn Plaza, Suite 701, New York, NY 10121-0701 USA, fax +1 (212) 869-0481, or [email protected]. C

!2007 ACM 1556-4681/2007/03-ART2 $5.00. DOI 10.1145/1217299.1217301 http://doi.acm.org/ 10.1145/1217299.1217301

ACM Transactions on Knowledge Discovery from Data, Vol. 1, No. 1, Article 2, Publication date: March 2007.

31  

Graph  evolution  

DATA  SET:    

Temporal  graphs  from  

Flickr  

Yahoo!  360  

EMPIRICAL  RESULT  

1.  Reciprocation  is  fast  

2.  Densification  +  shrinking  diameter  

after  a  certain  phase  

3.  Outside  Giant  Connected  Comp  (GCC)    

isolated  nodes  +  star  which  grows  

and  merge  to  GCC  

MODEL  

3  types  Passive-­‐Inviters-­‐Linkers  with  fixed  

ratio  of  population  

Edge  originates  in  L  or  I  (chosen  by  deg)  

Edge  from  I  ends  in  new  node  P  

Edge  from  L  ends  in  L  or  I  with  a  

trumping  factor  within  L  

ANALYSIS  

Numerical  results  show  that  it  

reproduces  size  distribution  of  isolated  

communities  accurately.  

Kumar,  R.,  Novak,  J.,  &  Tomkins,  A.  (2010).  Structure  and  evolution  of  

online  social  networks.  Link  Mining,  337–357.

 

(32)

Graph Evolution: Densification and

Shrinking Diameters

JURE LESKOVEC Carnegie Mellon University JON KLEINBERG Cornell University and

CHRISTOS FALOUTSOS Carnegie Mellon University

How do real graphs evolve over time? What are normal growth patterns in social, technological, and information networks? Many studies have discovered patterns in static graphs, identifying properties in a single snapshot of a large network or in a very small number of snapshots; these include heavy tails for in- and out-degree distributions, communities, small-world phenomena, and others. However, given the lack of information about network evolution over long periods, it has been hard to convert these findings into statements about trends over time.

Here we study a wide range of real graphs, and we observe some surprising phenomena. First, most of these graphs densify over time with the number of edges growing superlinearly in the number of nodes. Second, the average distance between nodes often shrinks over time in contrast to the conventional wisdom that such distance parameters should increase slowly as a function of the number of nodes (like O(log n) or O(log(log n)).

Existing graph generation models do not exhibit these types of behavior even at a qualitative level. We provide a new graph generator, based on a forest fire spreading process that has a simple, intuitive justification, requires very few parameters (like the flammability of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study. This material is based on work supported by the National Science Foundation under Grants No. IIS-0209107 SENSOR-0329549 EF-0331657IIS-0326322 IIS- 0534205, CCF-0325453, IIS-0329064, CNS-0403340, CCR-0122581; a David and Lucile Packard Foundation Fellowship; and also by the Pennsylvania Infrastructure Technology Alliance (PITA), a partnership of Carnegie Mellon, Lehigh University and the Commonwealth of Pennsylvania’s Department of Community and Economic Development (DCED). Additional funding was provided by a generous gift from Hewlett-Packard. J. Leskovec was partially supported by the Microsoft Research Graduate Fellowship. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation, or other funding parties. Author’s address: J. Leskovec, Machine Learning Department, School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, 15213 Pittsburgh PA, USA; email: jure@ cs.cmu.edu.

Permission to make digital or hard copies part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or direct commercial advantage and that copies show this notice on the first page or initial screen of a display along with the full citation. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute to lists, or to use any component of this work in other works requires prior specific per-mission and/or a fee. Perper-missions may be requested from the Publications Dept., ACM, Inc., 2 Penn Plaza, Suite 701, New York, NY 10121-0701 USA, fax +1 (212) 869-0481, or [email protected]. C

!2007 ACM 1556-4681/2007/03-ART2 $5.00. DOI 10.1145/1217299.1217301 http://doi.acm.org/ 10.1145/1217299.1217301

ACM Transactions on Knowledge Discovery from Data, Vol. 1, No. 1, Article 2, Publication date: March 2007.

32  

Microscopic  Graph  evolution  (2)  

DATA  SET:    

Flickr  

EMPIRICAL  RESULT  

1.  Reciprocation  is  fast  

2.  Links  created/received  according  to  

out-­‐degree/in-­‐degree  

3.  Proximity  bias  

MODEL  

None  

 

ANALYSIS  

None  

Mislove,  A.,  Koppula,  H.,  Gummadi,  K.,  Druschel,  P.,  &  Bhattacharjee,  B.  

(2008).  Growth  of  the  flickr  social  network.  ACM  WOSN.

 

(33)

Graph Evolution: Densification and

Shrinking Diameters

JURE LESKOVEC Carnegie Mellon University JON KLEINBERG Cornell University and

CHRISTOS FALOUTSOS Carnegie Mellon University

How do real graphs evolve over time? What are normal growth patterns in social, technological, and information networks? Many studies have discovered patterns in static graphs, identifying properties in a single snapshot of a large network or in a very small number of snapshots; these include heavy tails for in- and out-degree distributions, communities, small-world phenomena, and others. However, given the lack of information about network evolution over long periods, it has been hard to convert these findings into statements about trends over time.

Here we study a wide range of real graphs, and we observe some surprising phenomena. First, most of these graphs densify over time with the number of edges growing superlinearly in the number of nodes. Second, the average distance between nodes often shrinks over time in contrast to the conventional wisdom that such distance parameters should increase slowly as a function of the number of nodes (like O(log n) or O(log(log n)).

Existing graph generation models do not exhibit these types of behavior even at a qualitative level. We provide a new graph generator, based on a forest fire spreading process that has a simple, intuitive justification, requires very few parameters (like the flammability of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study. This material is based on work supported by the National Science Foundation under Grants No. IIS-0209107 SENSOR-0329549 EF-0331657IIS-0326322 IIS- 0534205, CCF-0325453, IIS-0329064, CNS-0403340, CCR-0122581; a David and Lucile Packard Foundation Fellowship; and also by the Pennsylvania Infrastructure Technology Alliance (PITA), a partnership of Carnegie Mellon, Lehigh University and the Commonwealth of Pennsylvania’s Department of Community and Economic Development (DCED). Additional funding was provided by a generous gift from Hewlett-Packard. J. Leskovec was partially supported by the Microsoft Research Graduate Fellowship. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation, or other funding parties. Author’s address: J. Leskovec, Machine Learning Department, School of Computer Science, Carnegie Mellon University, 5000 Forbes Avenue, 15213 Pittsburgh PA, USA; email: jure@ cs.cmu.edu.

Permission to make digital or hard copies part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or direct commercial advantage and that copies show this notice on the first page or initial screen of a display along with the full citation. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute to lists, or to use any component of this work in other works requires prior specific per-mission and/or a fee. Perper-missions may be requested from the Publications Dept., ACM, Inc., 2 Penn Plaza, Suite 701, New York, NY 10121-0701 USA, fax +1 (212) 869-0481, or [email protected]. C

!2007 ACM 1556-4681/2007/03-ART2 $5.00. DOI 10.1145/1217299.1217301 http://doi.acm.org/ 10.1145/1217299.1217301

ACM Transactions on Knowledge Discovery from Data, Vol. 1, No. 1, Article 2, Publication date: March 2007.

33  

Microscopic  Graph  evolution  (3)  

DATA  SET:    

FriendFeed  

EMPIRICAL  RESULT  

1.  Old  nodes  receives  new  edges  in  

proportion  of  their  degree  

2.  Proximity  bias  

MODEL  

Each  nodes  picks  a  degree  first  

Then  pick  the  closest  node  with  given  

degree  

 

ANALYSIS  

Numerical  results  shows  it  creates  

proximity  bias  

Garg,  S.,  Gupta,  T.,  Carlsson,  N.,  &  Mahanti,  A.  (2009).  Evolution  of  an  

online  social  aggregation  network:  An  empirical  study.  Proc.  IMC

 

References

Related documents

Wusu, Onipede, Isiugo-Abanihe, Uche C.: Understanding Sexual Negotiation between Marital Partners: A Study of Ogu Families in Southwestern Nigeria.. Gender Inequalities and

Note, that in case of natural signals that are not sparse in their native domain (say time, wave- length, position or functions of any other independent variable), a proper

These applications include micro- machined optical fibre-top cantilever sensor for high temperature and pH measurement, ferrule-top measurement to monitor real-time

Montage Healthcare Solutions developed Montage Search and Analytics™, a unique tool for helping radiology leaders succeed in the Imaging 3.0™ era. Montage Search & Analytics™ is

For the case of changes in affordable housing policy in Johor is made based on the requirements of current legislation the provision of housing and the current state of the middle

Working as a Developer, you will first create the data infrastructure for your basic Website. In Sitecore, creating the data infrastructure means creating data templates,

Table 2 shows the descriptive statistics of the independent and dependent variable. The mean for firms’ Performance is 3.84 which shows much favorable SME Clusters’

Small firm internationalisation and business strategy: an exploratory study of ‘knowledge-intensive’ and ‘traditional manufacturing firms in the UK, International Small