• No results found

Quantifying human behaviour using online data. Suzy Moat Data Science Lab Behavioural Science, WBS

N/A
N/A
Protected

Academic year: 2021

Share "Quantifying human behaviour using online data. Suzy Moat Data Science Lab Behavioural Science, WBS"

Copied!
32
0
0

Loading.... (view fulltext now)

Full text

(1)

Suzy Moat

Data Science Lab

Behavioural Science, WBS [email protected]

Quantifying human behaviour

using online data

(2)

Data Science Lab

(3)

The advantage of looking forward

1

(4)

Future Orientation Index 2010

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8

Future Orientation Index 2010

Suzy Moat & Tobias Preis Based on Preis, Moat, Stanley and Bishop (2012)

Ratio of Google searches for “2011” to searches for “2009” during 2010 for 45 countries

more Google searches for “2009” more Google searches for “2011”

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8

Future Orientation Index 2012

Suzy Moat & Tobias Preis

Based on Preis, Moat, Stanley and Bishop (2012)

(5)

Richer countries look forward

Time with Weekly Granularity

Search Volume

0

5

10

2008

2009

2010

0

5

A

B

“2008”

“2007”

“2009”

“2010”

“2011”

“2009”

Future-Orientation Index

GDP / Capita [10

4

USD]

1

2

3

4

0.0

0.5

1.0

1.5

2.0

    Preis, Moat, Stanley & Bishop (2012) Featured by:

(6)

Photo: Perpetual Tourist

Anticipating market moves

(7)

Hypothetical strategy

week

t

Moat et al. (2013); Preis et al. (2013)

number of

(8)

week

t

t-1

t-2

t-3

Moat et al. (2013); Preis et al. (2013)

Hypothetical strategy

number of

(9)

Page views

decreased:

BUY

stock

in week t+1

week

t

t-1

t-2

t-3

Moat et al. (2013); Preis et al. (2013)

Hypothetical strategy

number of

(10)

Page views

decreased:

BUY

stock

in week t+1

Page views

increased:

SELL

stock

in week t+1

week

t

t-1

t-2

t-3

Moat et al. (2013); Preis et al. (2013)

Hypothetical strategy

number of

(11)

Wikipedia

: Dow Jones companies

Views data: signi

fi

cant di

erence

Moat, Curme, Avakian, Kenett, Stanley & Preis (2013) Featured by:

Return [Std. Dev. of Random Strategies]

Density 0.0 0.2 0.4 0.6 −2 0 2 Wikipedia Views DJIA Companies Wikipedia Edits DJIA Companies Random Strategy

(12)

Wikipedia

: Financial topics

Moat, Curme, Avakian, Kenett, Stanley & Preis (2013) Featured by:

Views data: signi

fi

cant di

erence

0.00 0.25 0.50 0.75 1.00 −2 0 2

Return [Std. Dev. of Random Strategies]

Density Wikipedia Views Financial Topics Wikipedia Edits Financial Topics Random Strategy

(13)

Wikipedia

: Actors and

fi

lmmakers?

Moat, Curme, Avakian, Kenett, Stanley & Preis (2013) Featured by: 0.0 0.1 0.2 0.3 0.4 −2 0 2

Return [Std. Dev. of Random Strategies]

Density

Wikipedia Views

Actors & Filmmakers

Random Strategy

(14)

Random strategy mean + 2 sds Random strategy mean + 1 sd return (random strategy sds)

0 1 2

-1

“debt”

“culture”

How

Google

keywords perform

(15)

Random strategy mean + 2 sds Random strategy mean + 1 sd return (random strategy sds)

0 1 2 -1

“debt”

“culture”

“stocks”

“credit”

“garden”

“train”

Preis, Moat & Stanley (2013)

(16)

# occurrences in FT

# hits on Google

Returns signi

fi

cantly

correlated with indicator

of

fi

nancial relevance

Financial relevance

Random strategy mean + 2 sds Random strategy mean + 1 sd return (random strategy sds)

0 1 2

-1

Preis, Moat & Stanley (2013)

(17)

debt money crisis internet technology money debt

Curme, Preis, Stanley & Moat (2014)

What is searched for

before falls?

(18)

55 groups of search

terms

Business and politics

most related

Curme, Preis,

Stanley & Moat (2014)

What is searched for before falls?

Cumulative Returns (%)

-100 0 100 200

Random Strategy Politics I Business

(19)

Photographers as sensors

(20)

Flickr

and tourist numbers

(21)

Seresinhe, Preis & Moat (under review)

Scenicness and wellbeing

(22)

Scenicness and wellbeing

A

D

Average percentage of greenspace 0 0.3 0.6 0.7 0.9 0.94 0.96 0.99

Average scenic rating

1 2.2 3.1 3.6 3.9 4.2 4.6 8 POOR HEALTH GREENSPACE SCENICNESS Average rates of poor health (SMR) 0 0.5 0.6 0.7 0.8 0.9 1.2 3.2 London Birmingham Manchester Newcastle Liverpool Sheffield

B

C

0.00 0.25 0.50 0.75 1.00

All areas Urban Suburban Rural

Proba

blity of the model

gi

ven the data (AICw)

Model

Scenicness only Greenspace only

Scenicness and Greenspace

(23)

Scenicness and wellbeing

Seresinhe, Preis & Moat (under review)

A

D

Average percentage of greenspace

0 0.3 0.6 0.7 0.9 0.94 0.96 0.99

Average scenic rating

1 2.2 3.1 3.6 3.9 4.2 4.6 8 POOR HEALTH GREENSPACE SCENICNESS Average rates of poor health (SMR) 0 0.5 0.6 0.7 0.8 0.9 1.2 3.2 London Birmingham Manchester Newcastle

Liverpool Sheffield

B

C

0.00 0.25 0.50 0.75 1.00

All areas Urban Suburban Rural

Proba

blity of the model

gi

ven the data (AICw)

Model

Scenicness only Greenspace only

(24)

Scenicness and wellbeing

Seresinhe, Preis & Moat (under review)

A

D

Average percentage of greenspace

0 0.3 0.6 0.7 0.9 0.94 0.96 0.99

Average scenic rating

1 2.2 3.1 3.6 3.9 4.2 4.6 8 POOR HEALTH GREENSPACE SCENICNESS Average rates of poor health (SMR) 0 0.5 0.6 0.7 0.8 0.9 1.2 3.2 London Birmingham Manchester Newcastle

Liverpool Sheffield

B

C

0.00 0.25 0.50 0.75 1.00

All areas Urban Suburban Rural

Proba

blity of the model

gi

ven the data (AICw)

Model

Scenicness only Greenspace only

Scenicness and Greenspace

People report

better health in

more scenic

locations

(25)

Scenicness and wellbeing

A

D

Average percentage of greenspace 0 0.3 0.6 0.7 0.9 0.94 0.96 0.99

Average scenic rating

1 2.2 3.1 3.6 3.9 4.2 4.6 8 POOR HEALTH GREENSPACE SCENICNESS Average rates of poor health (SMR) 0 0.5 0.6 0.7 0.8 0.9 1.2 3.2 London Birmingham Manchester Newcastle

Liverpool Sheffield

B

C

0.00 0.25 0.50 0.75 1.00

All areas Urban Suburban Rural

Proba

blity of the model

gi

ven the data (AICw)

Model

Scenicness only Greenspace only

Scenicness and Greenspace

(26)

Scenicness and wellbeing

A

D

Average percentage of greenspace 0 0.3 0.6 0.7 0.9 0.94 0.96 0.99

Average scenic rating

1 2.2 3.1 3.6 3.9 4.2 4.6 8 POOR HEALTH GREENSPACE SCENICNESS Average rates of poor health (SMR) 0 0.5 0.6 0.7 0.8 0.9 1.2 3.2 London Birmingham Manchester Newcastle

Liverpool Sheffield

B

C

0.00 0.25 0.50 0.75 1.00

All areas Urban Suburban Rural

Proba

blity of the model

gi

ven the data (AICw)

Model

Scenicness only Greenspace only

Scenicness and Greenspace

A

D

Average percentage of greenspace

0 0.3 0.6 0.7 0.9 0.94 0.96 0.99

Average scenic rating

1 2.2 3.1 3.6 3.9 4.2 4.6 8 POOR HEALTH GREENSPACE SCENICNESS Average rates of poor health (SMR) 0 0.5 0.6 0.7 0.8 0.9 1.2 3.2 London Birmingham Manchester Newcastle

Liverpool Sheffield

B

C

0.00 0.25 0.50 0.75 1.00

All areas Urban Suburban Rural

Proba

blity of the model

gi

ven the data (AICw)

Model

Scenicness only Greenspace only

Scenicness and Greenspace

(27)

Scenicness and wellbeing

A

D

Average percentage of greenspace 0 0.3 0.6 0.7 0.9 0.94 0.96 0.99

Average scenic rating

1 2.2 3.1 3.6 3.9 4.2 4.6 8

POOR HEALTH

GREENSPACE

SCENICNESS

Average rates of poor health (SMR) 0 0.5 0.6 0.7 0.8 0.9 1.2 3.2 London Birmingham Manchester Newcastle

Liverpool Sheffield

B

C

0.00 0.25 0.50 0.75 1.00

All areas Urban Suburban Rural

Proba

blity of the model

gi

ven the data (AICw)

Model

Scenicness only Greenspace only

Scenicness and Greenspace

(28)

Measuring

fl

u with

Google

(29)

Ginsberg et al., Nature 368, 1012 (2009)

Flu data

Google Trends estimate

 

(30)

Butler, Nature 494, 155 (2013)

“The press reports may have triggered many fl

(31)

● ● ●●●●●●●●● ●● ●●●●●●●●● ●●●●●●●●●●●●●●●●● ●●●● ●●● ● ●● ● ●● ●● ● ● ● ●● ● ● ● ● ● ● ● ●● ●●●●●●●●●●● ●●●●●●●●●●● ●●●●● ●●●●●●● ●● ●●● ●●●●●● ● ● ●● ●● ●●●●●●●●●●● ●●●●●●●●●●● ●●●●●● ●● ●● ● ● ● ● ● ●● ● ● ● ● ● ●●● ● ● ● ● ●●●●●●● ●●●●●●●●●●●●●●●● 2 4 6 2010 2011 2012 2013 Time [Weeks] In fl uenza − Lik e Illness [%] Predicted Value Observed Value 80% Prediction Interval 95% Prediction Interval ● Training Period Out-of-Sample Nowcast

Preis & Moat (2014)

Flu estimate errors signi

fi

cantly reduced

(32)

To what extent can Internet data

help us measure and even predict

human behaviour?

[email protected]

@suzymoat

computer science statistics physics mathematics

crime science

finance health

economics

References

Related documents

We study the sequence of empirical copula processes constructed from the triangular array of vectors of block maxima as the block size and the number of blocks tend to infinity. We

Degree Manager, Maynooth Campus Mr Christopher Reid, BA, Dip VEdT Degree Manager, Kilkenny Campus Studies (Honours – Level 8) degrees are designed specifically for such

In 1990, the NMAI framed its mission in the following adopted policy statement: The mission of the Smithsonian Institution’s National Museum of the American Indian is to affirm

The fact that the volatility is lower for the euro area should not be a surprise as this is an aggregate of national fiscal policies and it is difficult to think about coordinated

We see that in comparison with Cournot Stackelberg has a higher output level, lower market price and lower industry profit, although the Stackelberg leader is still

In order to address this limitation, the aim of the current study was to simultaneously examine the three key systems of the S-REF model, executive processes (emotional

reduction in the number of shoots almost equivalent to that imposed to P1 by thinnings. 4) Relatively to mortality, from the last thinning at 11 years of age, we observed in

• enhance Torque data structures with L&B specific information (server and job attributes). • add Torque-specific data messages (events), update