• No results found

Big Data and Large Scale Inference: II. Amr Ahmed Research at Google

N/A
N/A
Protected

Academic year: 2022

Share "Big Data and Large Scale Inference: II. Amr Ahmed Research at Google"

Copied!
142
0
0

Loading.... (view fulltext now)

Full text

(1)

Amr Ahmed

Research at Google

Big Data and

Large Scale Inference: II

(2)

Wrapping  up    

•  Distributed  inference  in  latent  variable  models  

–  Star  Synchroniza:on  

–  Delta  aggrega:on  

 

(3)

Wrapping  up  …  

φ θ

z  

w  

α   Prior  over  topic  

mixtures  

Topic-­‐word   distribu:on  

(4)

Wrapping  up  …  

φ θ

z  

w   α  

•  Global  variables  

–  Φ:  Topic  distribu:on  over   words  

•  Local  variables  

–  θ:  topic  mixing  vector   –  Z:  topic  indicator  

 

(5)

Wrapping  up  …  

z  

w  

•  Collapse  global  variables  

–  Φ  

•  Collapse  local  variables  

–  θ  

•  Couples  all  Zs  

•  Run  collapsed  sampler  

P (z  

di

= k |w

di

= w, z

di

) / (n

dk

+ ↵) n

kw

+ n

k

+ W

P (z di = k |w di = w, z di ) / (n dk + ↵) n kw +

n k + W

(6)

Wrapping  up  …  

z  

w  

P (z

di

= k |w

di

= w, z

di

) / (n

dk

+ ↵) n

kw

+ n

k

+ W

P (z di = k |w di = w, z di ) / (n dk + ↵) n kw + n k + W

Global  counts   (global  state)   Local  counts  

(local  state)   Global  State  

n kw

 

, n k

(7)

Distributed  Inference:  LDA  

θ

z  

w   α  

φ

(8)

Distributed  Inference:  LDA  

θ

z  

w   θ

z  

w  

α  

φ

(9)

Distributed  Inference:  LDA  

z  

w   z  

w  

Global  State  

n kw , n

 

k

(10)

Distributed  Inference:  LDA  

z  

w   z  

w  

Global  State  

n kw , n

 

k

Global  replica   Global  replica  

(11)

General  Architecture  

•  Star  synchroniza:on  

–  Works  when  variables  depend  on  each  other  via   aggregates  

•  Counts,  sums,  etc.  

–  When  state  objects  form  an  Abelian  group  

State  1   u1   u2   u3   State  2  

State  1   u2   u1   u3   State  2  

(12)

Mul:lingual  LDA  

•  Each  topic  has  a  distribu:on  over  words  

•  Fits  parallel  documents  

–  Example:  Wikipedia    

(13)

What    Is  next?  

•  Can  we  fit  any  model  only  with  those   asynchronous  primi:ves?  

–  No  

•  We  need  synchronous  opera:ons  

–  Parameter  op:miza:on  

•  EM  style  algorithm      

–  Non-­‐collapsed  global  variables  

(14)

The  Need  for  Synchronous  Processing  

φ θ

z  

w  

α   Prior  over  topic  

mixtures  

Topic-­‐word   distribu:on        

   What  if  we  need  to  op:mize  over    α?  

 

(15)

The  Need  for  Synchronous  Processing  

φ θ

z  

w   α  

•  E-­‐Step  

–  Run  asynchronous   collapsed  sampler  as   before  

•  M-­‐step  

–  Reach  a  barrier  

–  Collect  values  needed  to   op:mize  α  

–  One  machine  op:mizes  α   –   Broadcast  value  back  

 

(16)

Distributed  Sampling  Cycle  

Sample  Z  

  Sample  Z   Sample  Z   Sample  Z  

Barrier  

Write  counts   to     memcached  

Write  counts   to     memcached  

Write  counts   to     memcached  

Write  counts   to     memcached  

Collect  counts  

and  sample  Ω   Do  nothing   Do  nothing   Do  nothing  

Barrier  

Read  Ω from  

memcached Read  Ω from  

memcached   Read  Ω from  

memcached   Read  Ω from  

memcached  

Op:mize  α  

Requires  a  reduc:on  step  

Asynchronous  

Synchronous  

(17)

Distributed  Sampling  Cycle  

Sample  Z   Sample  Z   Sample  Z   Sample  Z  

Barrier  

Write  counts   Write  counts   Write  counts   Write  counts  

Collect  counts  

and  op:mize   Do  nothing   Do  nothing   Do  nothing  

Barrier  

Read  α Read  α   Read   α   Read  α  

(18)

Up  next  

•  Applica:on  

–  Mul:-­‐domain  user  personaliza:on  

–  Non-­‐parametric  models:  DPs,  temporal  DPs,  etc   –   Temporal  Modeling  of  user  interests  

–  Mul:-­‐task  learning  and  computa:onal  adver:sing  

 

(19)

Modeling  User  Interests  

User-­‐1   User-­‐2  

(20)

Mul:-­‐domain  Personaliza:on  

(21)

Computa:onal  Adver:sing:  

 Mul:task  learning  

(22)

Mul:-­‐Domain  

Personaliza:on    

(23)

Problem  

(24)

Mul:-­‐domain  Personaliza:on  

•  Intui:on  

–  We  observe  user  interac:on  with  news  and  movies   –  Can  we  predict  his  music  taste?  

 

•  Interac:on  defini:on  

–  A  bag  of  words  describing  objects  user  interacts  with  

in  a  given  domain  

(25)

Example  

User  Meta   Profile  

User  Music  

Profile   User  News  

Profile  

Personalized News

Personalized

Music

(26)

Example  

User  Meta   Profile  

User  Music  

Profile   User  News  

Profile   User  Movie  

Profile  

(27)

The  Model  

↵ ✓ z w

N   User  u’s  interac:on  with  domain  d  

'

d

A  user’s  interac6on  with  a  domain  is  a  bag  of  words.  

A  topic  is  a  mixture  of  words.  

x

u

Each  user  has  a  meta-­‐profile:   x

u

2 R k

p

Each  domain  has  a  latent  matrix:   d 2 R k ⇥t

d

User’s  prior  interest  in  a  domain  is  

= log(1 + exp(⇥ d x u ))

Slide  credit  Yucheng  Low  

(28)

The  Model  

User  Meta   Profile  

 

User  Music  

Profile   User  News  

Profile   User  Movie  

Profile  

x

u

2 R k

music

news

movie

lgt( music x u ) lgt( news x u ) lgt( movie x u )

lgt(x) = log(1 + exp(x))

Slide  credit  Yucheng  Low  

(29)

↵ ✓ w z

w 2 Wu,d

Music  

News  

Movies  

↵ ✓ w z

w 2 Wu,d

x 1 x 2 x 3

1

2

3

Topic-­‐>word  table  

Topic-­‐>word  table  

Topic-­‐>word  table   Slide  credit  Yucheng  Low  

(30)

Inference  and  Learning  

↵ ✓ z w

N   User  u’s  interac:on  with  domain  p  

p

x

u

E-­‐Step:  sample  local  variables    

'

d

Collapse  and   synchronize   M-­‐Step  

Op:mize  via  barrier  

(31)

Distributed  Sampling  Cycle  

Sample  Z,  x  

For  users   Sample  Z,x  

For  users   Sample  Z,x  

For  users   Sample  Z,x  

For  users  

Barrier  

Write  counts   to     memcached  

Write  counts   to     memcached  

Write  counts   to     memcached  

Write  counts   to     memcached  

Collect  counts  

and  sample  Ω   Do  nothing   Do  nothing   Do  nothing  

Barrier  

Read  Ω from  

memcached Read  Ω from  

memcached   Read  Ω from  

memcached   Read  Ω from  

memcached  

Op:mize    λ  

Requires  a  reduc:on  step  

(32)

Distributed  Sampling  Cycle  

Sample  Z,  x  

For  users   Sample  Z,  x  

For  users   Sample  Z,  x  

For  users   Sample  Z,  x   For  users  

Barrier  

Write    

sta:s:cs   Write  

sta:s:cs   Write  

sta:s:cs   Write  

sta:s:cs  

Collect  and      

op:mize   Do  nothing   Do  nothing   Do  nothing  

Barrier  

Read Read   Read   Read  

(33)

Results  

•  2  domain  dataset.    

     Frontpage  and  News  clicks  of  5.6  million  users.  

     Frontpage/News:  Ar:cle  text  for  each  click.  

•   Measure  gain  rela:ve  to  independent  models  on  

each  domain  

(34)

Results  

Front Page News

0 5 10 15 20 25 30

% Gain in Cosine Similarity

25 Features

50 Features

(35)

Analysis  

sandra,  oscar,  oscars,  red,  carpet,  bullock,   golden,  gown,  bullocks,  nominee,  bestactress,   sparkles,  stunning,    

       

vienna,  bachelor,  jake,  pavelka,  giraldi,  finale,   show,  stars,  dancing,  love,  season,  :me,  abc,    

bacteria,  fight,  super,  struggling,  developed,   doctors,  resistant,  lethal,  virtually,  drugs,   an:bio:c,  compe:tors,  chad,    

         

film,  movie,  movies,  films,  director,  story,  

avatar,  james,  :me,  hollywood,  big,  make,  hes,   star,    

Celebrity  

Entertainment  

Science  

Science  Fic6on  

(36)

Non-­‐parametric  Mdoels  

(37)

Non-­‐parametric  Models  

•  How  many  clusters?  

•  How  many  topics?  

•  Dimensionality  of  the  model  grows  with  the  data  

•  Perfect  for  big  data  sedng  where  data  keeps  

arriving  and  we  want  to  refine  model  structure  

(38)

Parametric  Mixture  Models  

c

i

w

i

N

π

φ

Κ

φ

k

  φ

1

 

-­‐ For  Document  wi    

-­‐   Sample  ci  ~  Mul:(π)   -­‐   Sample  wi  ~  Mult(φci)  

wi  

∝π1   ∝ πj   ∝ πk  

Genera:ve  Process  

(39)

Infinite  Mixture  Models  

•  Allows  the  number  of  mixtures  to  grow  with  the  data  

•  They  are  called  non-­‐parametric  models  

–  Means  the  number  of  effec:ve  parameters  grow  with  data   –  S:ll  have  hyper-­‐parameters  that  control  the  rate  of  growth  

•   α:  how  fast  a  new  cluster/mixture  is  born?  

•  G

0

:    Prior  over  mixture  component  parameters  

 

•  We  can  arrive  at  them  from  many  angle  

–  Chinese  Restaurant  Process  Mixtures   –  S:ck  Breaking  construc:on  

–  Random  Measure  View  

(40)

Chinese  Restaurant  Process  Mixture  

-­‐ For  data  point  xi    

-­‐   Choose  table  j  ∝  mj        and    Sample  xi  ~  f(φj)   -­‐   Choose  a  new  table    K+1  ∝  α    

-­‐   Sample  φK+1  ~  G0      and  Sample  xi  ~  f(φK+1)   Genera:ve  Process  

φ

2

 

φ

1

  φ

3

 

α 6+

2 6+α

3 6+α

1

α α 6+

[Blackwell,  D.  and  MacQueen  1973]  

(41)

S:ck  Breaking  Construc:on  

c

i

x

i N

π

φ

Κ

α

c

i

x

i N

π

φ

∞ G0  

Parametric  Mixture   Non-­‐  Parametric  Mixture  

X

k

k

k

Mixture  distribu:on  =    

?  

[Sethuraman,  1994]  

(42)

S:ck  Breaking  Construc:on  

α

c

i

x

i N

π

φ

∞ G0  

X 1 k=1

k

k

Mixture  distribu:on  =    

•  φ

k

   ~  G

0  

is  called  atom  loca:on  

•  π

k

 is  called  atom  weight  

–  π    ~  GEM(α)  

 

π1  

π2  

π3  

φ1   φ2   φ3  

(43)

S:ck  Breaking  Construc:on  

α

c

i

x

i N

π

φ

∞ G0  

X 1 k=1

k

k

Mixture  distribu:on  =    

•  φ

k

   ~  G

0  

is  called  atom  loca:on  

•  π

k

 is  called  atom  weight  

–  π    ~  GEM(α)  

 

v

k

⇠ Beta(1, ↵)

k

= v

k

k 1

Y

i=1

(1 v

i

)

v

1

= ⇡

1

(1 v

1

) v

1

(44)

S:ck  Breaking  Construc:on  

α

c

i

x

i N

π

φ

∞ G0  

X 1 k=1

k

k

Mixture  distribu:on  =    

•  φ

k

   ~  G

0  

is  called  atom  loca:on  

•  π

k

 is  called  atom  weight  

–  π    ~  GEM(α)  

 

v

k

⇠ Beta(1, ↵)

k

= v

k

k 1

Y

i=1

(1 v

i

)

v

1

= ⇡

1

(1 v

1

) v

1

(45)

S:ck  Breaking  Construc:on  

α

c

i

x

i N

π

φ

∞ G0  

X 1 k=1

k

k

Mixture  distribu:on  =    

•  φ

k

   ~  G

0  

is  called  atom  loca:on  

•  π

k

 is  called  atom  weight  

–  π    ~  GEM(α)  

 

v

k

⇠ Beta(1, ↵)

k

= v

k

k 1

Y

i=1

(1 v

i

)

v

1

= ⇡

1

(1 v

1

)

(1 v

1

)(1 v

2

)

v

1

v

2

(1 v

1

)

(46)

Dirichlet  Process  Mixture  View  

•  G  is  always  discreet  (we  have  just  constructed  it)  

•  G  =    

•  G  is  a  random  measure,  why?  

 

G ⇠ DP (↵, G

0

)

i

⇠ G x

i

⇠ f(✓

i

)

α

θ

i

x

i N

G   G0  

X1 k=1

k k

E[G] = G

0

V ar[G] = ↵

[Ferguson,  1973]  

(47)

Infinite  Mixture  Models  

•  Chinese  Restaurant  Process  

–  Useful  for  construc6ng  Gibbs  sampling  algorithms   [Neal  1998]  

 

•  S:ck-­‐breaking  construc:on  

–  Useful  for  deriving  Varia:onal  inference  algorithms.  

[Blei  and  Jordan  2004]  

 

•  DP  view  

–  Useful  for  conceptual  understanding  (esp.  later  in  the  

tutorial)    [Teh  et.  al.,  2006]    

(48)

Chinese  Restaurant  Process  Mixture  

-­‐ For  data  point  xi    

-­‐   Choose  table  j  ∝  mj        and    Sample  xi  ~  f(φj)   -­‐   Choose  a  new  table    K+1  ∝  α    

-­‐   Sample  φK+1  ~  G0      and  Sample  xi  ~  f(φK+1)   Genera:ve  Process  

φ

2

 

φ

1

  φ

3

 

α 6+

2 6+α

3 6+α

1

α α 6+

(49)

Inference  Via  Gibbs  Sampling  

•  Construct  a  Markov  chain  over  cluster  indicators   for  each  data  point  c i   and  cluster  parameters  φ

P (c

i

|c

i

, ↵, G

0

) /

⇢ m

k

p(x

i

|

k

) for existing k

↵ R

p(x

i

| )p( |G

0

)d for a new cluster

P ( k |c, G 0 ) / Y

c

i

=k

p(x i | )P ( |G 0 )

(50)

Inference  Via  Collapsed  Gibbs  Sampling  

•  If  G 0  is  conjugate  to  the  likelihood  func:on  p(.|φ)    then  we  can  integrate  it  out.  

P (c

i

|c

i

, ↵, G

0

) / 8 >

<

> :

m

k

p ⇣

x

i

|{x

j

}

cji=k

for existing k

↵ R

p(x

i

| )p( |G

0

)d for a new cluster

(51)

Dynamic  Models  

(52)

Dynamic  Non-­‐Parametric  Models  

•  Evolve  atom  weights  

•  Evolve  atom  loca:ons  

•  Many  models  

–  What  assump:ons  they  are  making    

–  More  expressive  the  harder  the  inference   –  Examples  

•  RCRP  (Ahmed  and  Xing,  2008)  

•  Time-­‐Varying  Polya-­‐Urn  (Caron  et.  Al.  2007)  

•  Dependent  DPs  and  Ordered-­‐DPs  (Griffin  et.  Al.  2004)  

•  Normalized  Gamma  Processes  (Rao  and  Teh  2009)  

 

(53)

Recurrent  CRP  (RCRP)   [Ahmed  and  Xing  2008]  

•  Adapts  the  number  of  mixture  components  over  :me  

–  Mixture  components  can  die  out  

–  New  mixture  components  are  born  at  any  :me  

–  Retained  mixture  components  parameters  evolve  according  

to  a  Markovian  dynamics  

(54)

The  Recurrent  Chinese  Restaurant  Process  

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

Dish  eaten  at  table  3  at  :me  epoch  1  

OR  the  parameters  of  cluster  3  at  :me    epoch  1  

-­‐ Customers  at  :me  T=1  are  seated  as  before:  

-­‐   Choose  table  j  ∝  mj,1        and    Sample  xi  ~  f(φj,1)   -­‐   Choose  a  new  table    K+1  ∝  α    

-­‐   Sample  φK+1,1  ~  G0      and  Sample  xi  ~  f(φK+1,1)   Genera:ve  Process  

(55)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,1

 

φ

1,1

  φ

3,1

 

m'1,1=2   m'2,1=3   m'3,1=1  

The  Recurrent  Chinese  Restaurant  Process  

(56)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,1

 

φ

1,1

  φ

3,1

 

m'1,1=2   m'2,1=3   m'3,1=1  

(57)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,1

 

φ

1,1

  φ

3,1

 

α 6+

2 6+α

3 6+α

1

α α 6+

m'1,1=2   m'2,1=3   m'3,1=1  

(58)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,1

 

φ

1,1

  φ

3,1

 

α 6+

2

m'1,1=2   m'2,1=3   m'3,1=1  

(59)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,1

 

φ

1,2

  φ

3,1

 

Sample  φ1,2  ~  P(.| φ1,1)      

m'1,1=2   m'2,1=3   m'3,1=1  

(60)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,1

 

φ

1,2

  φ

3,1

 

α + +

+ 1 6

2

1 6+1+α

3 6+1+α

1

α α

+ 6+1

And  so  on  ……  

m'1,1=2   m'2,1=3   m'3,1=1  

(61)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,2

 

φ

1,2

  φ

3,1

 

At  the  end  of  epoch  2  

φ

4,2

 

Newly  born  cluster   Died  out  cluster  

m'1,1=2   m'2,1=3   m'3,1=1  

(62)

φ

2,1

 

φ

1,1

  φ

3,1

 

T=1  

T=2  

φ

2,2

 

φ

1,2

  φ

3,1

 

N1,1=2   N2,1=3   m'3,1=1  

φ

4,2

 

T=3  

φ

2,2

 

φ

1,2

  φ

4,2

 

m'4,2=1   m'2,2=2  

m'1,2=2  

(63)

Recurrent  Chinese  Restaurant  Process  

•  Can  be  extended  to  model  higher-­‐order   dependencies  

•  Can  decay  dependencies  over  :me  

–  Pseudo-­‐counts  for  table  k  at  :me  t  is  

History  size  

Decay  factory   Number  of  customers  sidng     at    table  K  at  :me  epoch  t-­‐h  

 

=   -   -  

⎟

 

⎠

 

⎞

 

⎜

 

⎝

 

⎛

 

H  

h   k  t   h   h  

m

 

e

 

1   ,   ρ  

(64)

φ2,1  

φ1,1   φ3,1  

T=1  

T=2  

φ

2,2

 

φ

1,2

  φ

3,1

 

m'1,1=2   m'2,1=3   m'3,1=1  

φ

4,2

 

T=3  

φ

2,2

 

φ

1,2

  φ

4,2

 

 

=  

-   -  

⎟

 

⎠

 

⎞

 

⎜

 

⎝

 

⎛

 

H  

h   k  t   h   h  

m

 

e

 

1   ,   ρ  

m'2,3  

m'2,3  =  

(65)

Tracking  Users  Interest  

(66)

Characterizing  User  Interests  

•  Short  term  vs  long-­‐term  

Jan   April   July   Oct  

Music  

Housing  

Furniture  

Buying  a  car  

Travel  plans  

(67)

Characterizing  User  Interests  

•  Short  term  vs  long-­‐term  

•  Latent  

Jan   April   July   Oct  

millage  

fast  

mortgage   Gaga  

seafood  

Barcelona  

used  

(68)

Problem  formula:on  

Input  

•  Queries  issued  by  the  user  or  tags  of  watched  content  

•  Snippet  of  page  examined  by  user  

•  Time  stamp  of  each  ac:on  (day  resolu:on)    

Output  

•     Users’  daily  distribu:on  over  interests  

•     Dynamic  interest  representa:on  

•     Online  and  scalable  inference  

•     Language  independent  

Flight   London   Hotel   weather    

School   Supplies   Loan   semester   classes  

registra:on   housing   rent  

(69)

Problem  formula:on  

Flight   London   Hotel   weather    

Travel  

Back   To  school  

finance   School  

Supplies   Loan   semester   classes  

registra:on   housing   rent   Input  

•  Queries  issued  by  the  user  or  tags  of  watched  content  

•  Snippet  of  page  examined  by  user  

•  Time  stamp  of  each  ac:on  (day  resolu:on)    

Output  

•     Users’  daily  distribu:on  over  interests  

•     Dynamic  interest  representa:on  

•     Online  and  scalable  inference  

•     Language  independent  

(70)

Problem  formula:on  

Flight   London   Hotel   weather    

Travel  

Back   To  school  

finance   School  

Supplies   Loan   semester   classes  

registra:on   housing   rent  

When  to  show  a  financing  ad?  

 

(71)

Problem  formula:on  

Flight   London   Hotel   weather    

Travel  

Back   To  school  

finance   School  

Supplies   Loan   semester   classes  

registra:on   housing   rent  

When  to  show  a  financing  ad?  

 

(72)

Problem  formula:on  

Flight   London   Hotel   weather    

Travel  

Back   To  school  

finance   School  

Supplies   Loan   semester   classes  

registra:on   housing   rent  

When  to  show  a  financing  ad?  

 

(73)

Problem  formula:on  

Flight   London   Hotel   weather    

Travel  

Back   To  school  

finance   School  

Supplies   Loan   semester   classes  

registra:on   housing   rent  

When  to  show  a  hotel  ad?  

 

(74)

Problem  formula:on  

Flight   London   Hotel   weather    

Travel  

Back   To  school  

finance   School  

Supplies   Loan   semester   classes  

registra:on   housing   rent  

When  to  show  a  hotel  ad?  

 

(75)

Problem  formula:on  

Flight   London   Hotel   weather    

Travel  

Back   To  school  

finance   School  

Supplies   Loan   semester   classes  

registra:on   housing   rent   Input  

•  Queries  issued  by  the  user  or  tags  of  watched  content  

•  Snippet  of  page  examined  by  user  

•  Time  stamp  of  each  ac:on  (day  resolu:on)    

Output  

•     Users’  daily  distribu:on  over  interests  

•     Dynamic  interest  representa:on  

•     Online  and  scalable  inference  

•     Language  independent  

(76)

Mixed-­‐Membership  Formula:on  

job     Career   Business   Assistant  

Hiring   Part-­‐:me   Recep:onist   Car    

Blue   Book   Kelley   Prices   Small   Speed  

large    

Bank   Online  

Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

Diet   Cars   Job   Finance  

Objects  

Mixtures   Degree  of   membership  

Job  Hiring       speed  price    part-­‐6me  Camry  

Career  opening     bonus  package  

 

 

card    diet  calories   loan  recipe  milk  

Weight  lb  kg    

(77)

In  Graphical  Nota:on  

(78)

In  Polya-­‐Urn  Representa:on  

•  Collapse  mul:nomial  variables:  

•  Fixed-­‐dimensional  Hierarchal  Polya-­‐Urn   representa:on  

–  Chinese  restaurant  franchise    

 

x  

x  

(79)

   

   

Food  Chicken  

   

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

       

Car  speed  offer   camry  accord  career  

 

Global  topics   trends

Topic    

word-­‐distribu:ons

User-­‐specific  topics  trends   (mixing-­‐vector)

User  interac:ons:  queries,  

keyword  from  pages  viewed

(80)

   

 Food  Chicken  

 

       

Car  speed  offer   camry  accord  career  

 

•   For  each  user  interac:on  

•     Choose  an  intent  from  local  distribu:on  

•  Sample  word  from  the  topic’s  word-­‐distribu:on    

• Choose  a  new  intent    ∝  λ    

•  Sample  a  new  intent  from  the  global  distribu:on  

•   Sample  word  from  the  new  topic  word-­‐

distribu:on     Genera:ve  Process  

………  

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

(81)

   

 Food  Chicken  

pizza  

       

Car  speed  offer   camry  accord  career  

 

•   For  each  user  interac:on  

•     Choose  an  intent  from  local  distribu:on  

•  Sample  word  from  the  topic’s  word-­‐distribu:on    

• Choose  a  new  intent    ∝  λ    

•  Sample  a  new  intent  from  the  global  distribu:on  

•   Sample  word  from  the  new  topic  word-­‐

distribu:on     Genera:ve  Process  

………  

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

(82)

   

 Food  Chicken  

pizza  

       

Car  speed  offer   camry  accord  career  

 

•   For  each  user  interac:on  

•     Choose  an  intent  from  local  distribu:on  

•  Sample  word  from  the  topic’s  word-­‐distribu:on    

• Choose  a  new  intent    ∝  λ    

•  Sample  a  new  intent  from  the  global  distribu:on  

•   Sample  word  from  the  new  topic  word-­‐

distribu:on     Genera:ve  Process  

………  

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

(83)

   

 Food  Chicken  

pizza    hiring  

       

Car  speed  offer   camry  accord  career  

 

•   For  each  user  interac:on  

•   Choose  an  intent  from  local  distribu:on  

•  Sample  word  from  the  topic’s  word-­‐distribu:on    

• Choose  a  new  intent    ∝  λ    

•  Sample  a  new  intent  from  the  global  distribu:on  

•   Sample  word  from  the  new  topic  word-­‐

distribu:on     Genera:ve  Process  

………  

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

(84)

   

 Food  Chicken  

pizza    millage  

       

Car  speed  offer   camry  accord  career  

 

•   For  each  user  interac:on  

•     Choose  an  intent  from  local  distribu:on  

•  Sample  word  from  topic’s  word-­‐distribu:on    

• Choose  a  new  intent    ∝  λ    

•  Sample  a  new  intent  from  the  global  distribu:on  

•   Sample  from  word  the  new  topic  word-­‐

distribu:on     Genera:ve  Process  

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

(85)

   

 Food  Chicken  

pizza    millage  

       

Car  speed  offer   camry  accord  career  

 

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

Problems

-­‐   Sta:c  Model  

-­‐   Does  not  evolve  user’s  interests  

-­‐   Does  not  evolve  the  global  trend  of  interests  

-­‐   Does  not  evolve  interest’s  distribu:on  over  terms    

(86)

   

 Food  Chicken  

pizza    millage  

       

Car  speed  offer   camry  accord  career  

 

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

At  6me  t   At  6me  t+1  

Build  a  dynamic  model   Connect  each  level    

using  a  RCRP  

(87)

     

At  6me  t   At  6me  t+1  

     

At  6me  t+2   At  6me  t+3  

     

User  1   process  

User  2   process  

User  3   process  

Global  

process   m   m'  

n   n'  

     

     

     

Which  :me  kernel  to    

use  at  each  level?  

(88)

Pseudo  counts

                     =                  *      

Decay  factor

   

 Food  Chicken  

pizza    millage  

       

Car  speed  offer   camry  accord  career  

 

At  6me  t  

Observa:on  1

-­‐ Popular  topics  at    :me  t  are  likely  to  be  popular  at  :me  t+1  

-  φk,t+1  is  likely  to  smoothly  evolve  from     φk,t    

At  6me  t+1  

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

(89)

   

 Food  Chicken  

pizza    millage  

       

Car  speed  offer   camry  accord  career  

 

At  6me  t   At  6me  t+1  

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Blue   Book   Kelley   Prices   Small   Speed   large  

 

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

Observa:on  1

-­‐ Popular  topics  at    :me  t  are  likely  to  be  popular  at  :me  t+1  

-  φk,t+1  is  likely  to  smoothly  evolve  from     φk,t    

    Car   Al6ma   Accord   Book   Kelley   Prices   Small   Speed  

   

φk,t+1      ∼ Dir(βk,t+1)   φk,t  

Car     Blue   Book   Kelley   Prices   Small   Speed   large  

 

Intui:on

Captures  current  trend  of   the  car  industry     (new  release  for  e.g.)  

~  

(90)

   

 Food  Chicken  

pizza    millage  

       

Car  speed  offer   camry  accord  career  

 

At  6me  t   At  6me  t+1  

     

Observa:on  2

-­‐   User  prior  at  :me  t+1  is  a  mixture  of  the  user  short  and   long  term  interest  

How  do  we  get  a  prior   that  captures  both  long  

and  short  term   interest?

job     Career   Business   Assistant   Hiring   Part-­‐:me  

Recep:o nist   Car    

Al:ma   Accord   Blue   Book   Kelley   Prices   Small   Speed  

   

Bank   Online   Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

(91)

All  

week  

job     Career   Business   Assistant  

Hiring   Part-­‐:me   Recep:onist   Car  

Blue   Book   Kelley   Prices   Small   Speed  

large  

Bank   Online  

Credit   Card   debt   porzolio  

Finance   Chase      

Recipe   Chocolate  

Pizza   Food   Chicken  

Milk   Bu{er   Powder  

   

month  

Time                  t                                  t+1                  

Food   Chicken   pizza  

recipe   job   hiring  

Part-­‐:me   Opening   salary  

food   chicken   Pizza   millage   Kelly  

recipe   cuisine  

Diet   Cars   Job   Finance  

Prior  for  user   ac:ons  at  :me  t   μ  

μ2   μ3  

Long-­‐term

short-­‐term

References

Related documents

In order to de fi ne chronic pain, we follow the de fi nition of the International Association of Pain (IASP), which de- fi nes pain as “ an unpleasant sensory and emotional

The spatial utility consistent food bundles from 2005 are valued with their own 2009 survey prices. For each spatial domains, the food shares of the poor are also constant between

DOJ and CV’s 2012–13 successful funding submission for additional prison beds included forecasts that the male prisoner population will grow by 44.7 per cent between June 2011

In a sense, this objective analysis is a project management method, as it allows the organization and restatement of why the project is being proposed (first level

In this paper we model the importance that soci- ety ascribes to each user and each communication instance within an online community, depending on sentiment content and topic

• Getting insight in the choices which have to be made to test within a short time frame in the complex, dynamic and diverse mobile.. environment in which the app has to

The research undertaken studies the relationship between the kind of opinion issued by the auditor in his/her annual report and the change of auditor, in terms of his/her

Since treatment plants vary greatly as to the types, complexities and arrangement of treatment processes used, precise sampling locations must be selected on a case-by-case