• No results found

WillettK ISTIBenchmarking EdinUni OCT2014

N/A
N/A
Protected

Academic year: 2020

Share "WillettK ISTIBenchmarking EdinUni OCT2014"

Copied!
51
0
0

Loading.... (view fulltext now)

Full text

(1)

The International Surface Temperature

The International Surface Temperature

Initiative and Benchmarking for

Initiative and Benchmarking for

Homogenisation Algorithms

Homogenisation Algorithms

Uni. Edinburgh, October 2014

Dr Kate Willett

(Met Office Hadley Centre, UK)

(2)

Talk Outline

Talk Outline

The ISTI – facilitating robust climate

The ISTI – facilitating robust climate

analysis

analysis

Our inhomogeneous observing system

Our inhomogeneous observing system

Benchmarking Basics

Benchmarking Basics

Creating a 'clean' synthetic world

Creating a 'clean' synthetic world

Creating a set of dirty/error filled worlds

Creating a set of dirty/error filled worlds

Assessing homogenisation algorithm skill

Assessing homogenisation algorithm skill

against the benchmarks

against the benchmarks

Where are we now...

Where are we now...

(3)

The ISTI

The ISTI

Facilitation of

Facilitation of

Robust Climate

Robust Climate

Analysis

(4)

The real world observing

system is not perfect …

(5)

Its more like these …

Many more examples on www.surfacestations.org

(6)

Inhomogeneities: annual mean minimum

temperature at Reno, Nevada, USA

(7)

Underlying these are two absolutely

fundamental issues …

A lack of adequate documentation

(

metadata

) of the ubiquitous changes

(station location, shelter, observing

time etc.)

A lack of

traceability

to known

standards (metrology) and to original

hard copy data sources for most

historical records

(8)

No doubt that it is warming – the rate and temporal

/ spatial details are the issue

(NCDC Graphics Team (above), John Kennedy, Met Office Hadley Centre (right))

(9)

The ISTI

The ISTI

A framework for creation of multiple robust

independent estimates of land surface temperature

to answer scientific questions and societal demands

of the 21

st

Century

DATABANK: open, transparent processing, traceability to

standards and source

BENCHMARKING: Consistent performance evaluation and

methodological uncertainty estimates

USER TOOLS: fit for purpose, visualisation, intercomparison

(10)

Step 1: Data rescue and

provision

www.surfacetemperatures.org/databank

Lawrimore et al., 2013: Responding to the Need for Better Global Temperature

Data, EOS, 94 (6), 61–62 DOI: 10.1002/2013EO060002

Rennie et al., 2014: The International Surface

Temperature Initiative Global Land Surface Databank:

Monthly Temperature Data Version 1 Release Description and Methods. Geosciences Data Journal, DOI:

(11)

www.iedro.org

(12)
(13)

Benchmarking cycle

Willett, et al., 2014: A

framework for benchmarking of homogenisation algorithm performance on the global scale, Geoscientific

(14)
(15)

Our

Our

Inhomogenous

Inhomogenous

Observing System

(16)

Effects of Changes that

Effects of Changes that

are not of Climate Origin

are not of Climate Origin

STATION MOVE: EXPOSURE AND MICROCLIMATE = abrupt change in mean and diurnal extremes - may affect seasonal cycle extremes

SHELTER CHANGE: EXPOSURE = abrupt change in diurnal extremes - may affect seasonal cycle extremes

OBSERVING PRACTICE CHANGE: SAMPLING = abrupt change possible in mean and

extremes

INSTRUMENT CHANGE: CALIBRATION = abrupt change in mean and possibly extremes

LANDUSE CHANGE: EXPOSURE AND

(17)

Adjustment

Adjustment

Adjust mean

Adjust variance

By month? By season? By full homogeneous

period?

Use candidate – neighbour fields?

Use SNHT, PHA, MISHMASH, SPLIDHOM etc?

(18)

Confidence in

Confidence in

Adjustments Made?

Adjustments Made?

Type I Error

Do not detect or adjust when there has been a

changepoint

Type II Error

Detect and adjust when no actual changepoint

occurred

Missed adjustments vs false alarms: which is

worse?

What about adjustments in the wrong direction?

(19)

Benchmarking

Benchmarking

Basics

(20)

'TRUTH' UNKNOWN

'TRUTH' UNKNOWN

XTRUTH

t,l

=

S

t,l

+

L

t,l

+

V

t,l

+

M

t,l

XTRUTH = a climate element at time t and location l

S = seasonal cycle L = long-term trends

V = variability (ENSO,

NAO, Volcanoes, Solar Cycles...)

M = microclimate (topography, proximity to coast, prevailing wind, local environment...)

XOB

t,l

= XTRUTH

t,l

+

ɛ

ɛ

t,lt,l

+

λ

t,l

XOB = observation at time t, location l and height h

ε

ε = random error at time/place/height = random error at time/place/height (recording error, instrument error etc.)

(recording error, instrument error etc.)

λ = systematic error at time/place/height *possibly correlated* (station move, exposure change, instrument change,

(21)

Benchmarking Cycle

Benchmarking Cycle

Example use of benchmark data for USHCN

Create c.10 analog-error-worlds

– Simulate 'clean' spatio-temporal characteristics of actual stations underpinned by low frequency variability from a climate model to maintain

plausible spatial correlation

– Add abrupt and gradual changepoints to approximate our best guess real world error structures

– Run homogenisation algorithms on the test data and assess ability to recover original 'clean' data

(22)

Creating a 'Clean'

Creating a 'Clean'

Synthetic World

(23)

Team Creation:

Team Creation:

How To...

How To...

Real Station climatology = seasonal cycle (S)

Real Station standard deviation = variability (V)

and microclimate (M)

GCM interpolated to station location = long-term

trend (L) and variability (V)

Vector Autoregressive (VAR) model = variability

(V) and microclimate (M)

IMPORTANT:

Retain serial correlation at at least lag 1

Retain cross-correlations with neighbours

(24)

Station Cross-Correlation

Station Cross-Correlation

(25)

The Neighbour Disconnect Method

The Neighbour Disconnect Method

Station 1 t-1

Station 2 t-1

Station 1 t=0

Station 2 t=0

(26)

Why use a GCM?

Why use a GCM?

Global, spatially correlated trend and variability fields

Ability to play with different background trends (no

warming vs burn everything)

Ability to use other variables from the GCM to apply

errors (e.g., humidity, wind speed, radiation etc.)

Consistency across other variables should we want to

(27)

Voila!

Voila!

Vector Autoregression to

simulate standardised anomaly time series for networks

Multiply by real station

standard deviations

Add interpolated

smoothed GCM trend

Add back real station

(28)

Inhomogeneities: annual mean minimum

temperature at Reno, Nevada, USA

(29)

Creating a set of

Creating a set of

Error-filled Worlds

(30)

Effects of Changes that

Effects of Changes that

are not of Climate Origin

are not of Climate Origin

STATION MOVE: EXPOSURE AND MICROCLIMATE = abrupt change in mean and diurnal extremes - may affect seasonal cycle extremes

SHELTER CHANGE: EXPOSURE = abrupt change in diurnal extremes - may affect seasonal cycle extremes

OBSERVING PRACTICE CHANGE: SAMPLING = abrupt change possible in mean and

extremes

INSTRUMENT CHANGE: CALIBRATION = abrupt change in mean and possibly extremes

LANDUSE CHANGE: EXPOSURE AND

(31)

Team Corruption

Team Corruption

XERRORWORLD

t,l

=

XTRUTH

t,l

+

λERRORWORLD

t,l

Example error models applied to stations SURFACE

TEMPERATURE DATABANK

World 1: no breaks

World 2: few large breaks

World 3: many small breaks

World 4: abrupt and gradual breaks

World 5: many small complex breaks

(32)

Adjustment

Adjustment

Adjust mean

Adjust variance

By month? By season? By other variables e.g,

humidity, wind, solar radiation?

Spatially clustered breaks

Seasonal cycles/features

(33)

Assessing

Assessing

Homogenisation

Homogenisation

Algorithm Skill

Algorithm Skill

Against the

Against the

Benchmarks

(34)

Levels of Assessment

Levels of Assessment

Level 1: Return of original climate features e.g., trends, climatology, variability (UNC)

Level 2: Ability to detect changepoints and characterise them (ALG)

Level 3: detailed assessment of strengths and weaknesses against specific types of inhomogeneities (ALG)

Level 4: Assess performance on benchmarks alongside results from real world homogenisation (how realistic are the

benchmarks?) (BEN) RETURN:

Homogenised stations

(35)

Team Validation

Team Validation

BETTER

False alarm rate

H it r a te WORSE 2 3 4 1 5

67 8

Trend recovery percentage

Contingency Table

(36)

Other Benchmarks

(37)

Daily Benchmarks for the USA using a GAM

(38)

Monthly T and Precip benchmarks for small

Monthly T and Precip benchmarks for small

networks

networks

COST HOME

(39)

Monthly USHCN Tmax, Tmin and Tmean

Monthly USHCN Tmax, Tmin and Tmean

benchmarks

(40)

Come and play, eventually

Come and play, eventually

www.surfacetemperatures.org

~30000 stations, 10 blind worlds, 4 open worlds, monthly mean T

[email protected]

(41)

Methods

Methods

At = Φ1 At−1 + Zt

A = matrix of 1 to J stations for all times 1 to T Φ1 = autoregressive parameter matrix

Zt = matrix of residuals (random shocks)

Φ1 = Γ(1)Γ(0)-1

Γ(0) = covariance matrix of all stations Γ(1) = covariance matrix of all stations at lag 1

Corr(x,y) = a * exp((-b*d_hor) + (-c*d_vert))

Cross-Correlation: a=0.989, b=0.0009, c=0.2505 Cross-Correlation at lag 1: a=0.3, b=0.0015, c=0.8579

Zt = At − Φ1At-1

ΣZ = Γ(0) – Φ1Γ(1)T

(42)

VAR Parameters

VAR Parameters

At = Φ1 At−1 + Zt

A = matrix of 1 to J stations for all times 1 to T Φ1 = autoregressive parameter matrix

Zt = matrix of error terms/shocks (residuals)

vec(Φ1) = (I ⊗ Γ(0))-1 vec(Γ(1))

Γ(0) = covariance matrix of all stations

Γ(1) = covariance matrix of all stations at lag 1

For simulation, use exponential decay function to proxy correlation by distance (get fit from real data)

e.g. f<-function(x,a,b){a*exp(-b*x)} Cross-Correlation: a=0.9162, b=0.0005

Cross-Correlation at lag 1: a=0.4162, b=0.0005

HELP: Fit for lag 1 correlation not strong. Is this sensible?

(43)

Obtaining realistic

Obtaining realistic

spatially correlated shocks

spatially correlated shocks

Zt = At − Φ1 At-1

Shock terms need to follow the observed distribution and be spatially correlated

Create global networks and neighbourhoods across the globe Create covariance for each network+neighbourhood using

distance as a proxy

Run a factorisation algorithm to create global fields of

spatially correlated shocks that are MVN

(44)

Obtaining realistic

Obtaining realistic

spatially correlated shocks

spatially correlated shocks

Zt = At − Φ1 At-1

Shock terms need to follow the observed distribution and be spatially correlated

Create global networks and neighbourhoods across the globe Create covariance for each network+neighbourhood using

distance as a proxy

Run a factorisation algorithm to create global fields of

spatially correlated shocks that are MVN

(45)

The Random Shock Part:

The Random Shock Part:

Spatially Correlated

Spatially Correlated

(46)

The Random Shock Part:

The Random Shock Part:

Spatially Correlated

Spatially Correlated

(47)

The VAR Part:

The VAR Part:

Spatio-temporally Correlated

Spatio-temporally Correlated

Station 1 t-1

Station 2 t-1

Station 1 t=0

(48)

Results: Station AR(1)

Results: Station AR(1)

correlation

correlation

Station

Autocorrelation at lag 1:

(49)

Results: Station to

Results: Station to

Neighbours cross-correlation

Neighbours cross-correlation

Station Cross correlation with 40 nearest neighbours:

CLEAN too high compared to REAL world

Real world contaminated by random error, inhomogeneity, missing data.

(50)

Results

Results

Mean SD and AR(1) Corr of

(51)

Remaining Issues

Remaining Issues

St Dev of Difference series too low: Compare with USCRN

which is clean but very short

AR(1) of station is too high: adjust distance function

Cross-correlations possibly too high: adjust distance

function (maybe vary with latitude?)

Have not used all 30000+ stations: figure out how

Errors

www.surfacetemperatures.org

References

Related documents

This model, which is most accurate during equinox and December solstice, describes in 15° latitudinal bins the diurnal, seasonal and solar cycle variations of equatorial vertical

vacuolar invertase does affect the accumulation of sucrose in source leaves of plants grown. under a

This paper first describes height and seasonal dependence of zonal and meridional diurnal tides observed in the MLT region over Syowa Station using the MF radar data covering 10

Subsequent columns show the correlation of the mean seasonal cycle of formaldehyde with the mean seasonal cycles of gross primary production (GPP) from the LPJ model, the fraction

Note that: (1) continental runo ff is for either a mean seasonal cycle or interannual varying data (Section 2.5.2); (2) precipitation fields are a climatological mean annual cycle

Metrics for model simulation of warm season precipitation processes developed in NAMAP are examined that pertain to the seasonal progression and diurnal cycle of precipitation,

When the diurnal amplitude peaks, over land it is so strong that the diurnal cycle almost entirely accounts for the daily mean precipitation; in regions where there is a strong

By comparing the mean diurnal and seasonal variations of tropospheric ozone at three different levels in the lower troposphere the question of its sources is