The Use of Emulation as Part of Decision Making Under
5.1. Structure of the Uncertainty Problem 111
5.2.1 Fitting an Emulator
Section 2.3 gave an overview for the existing literature on transmission expansion plan-ning. It was noted that a large amount of the literature gave little thought to uncer-tainty in the power system background when identifying optimal expansion decisions of the power system’s transmission system. Further, the literature which did consider uncertainty did so by considering a small number of possible states of the power sys-tem. This thesis will overcome this by treating the variables which define the power system as continuous variables, which would allow the variables to take any potential value over a given range (e.g. we could model the peak demand level to be anywhere between 62100 MW and 63600 MW) instead of a small number of possible values.
As mentioned, in the problem presented there is uncertainty in the values of the input variables, X. However, the function fT(X) (the simulator of total costs), is far too ex-pensive to evaluate at every set of input parameters desired. Therefore, it is necessary to evaluate fT a small number of times for particular values of X, and approximate fT everywhere else as carefully as possible by some alternative function, ˜fT. ˜fT should be much less computationally demanding to evaluate in order to allow for efficient estima-tion for input values not simulated. An estimate of the uncertainty that exists in the response when approximating fT by ˜fT is also required. The process of approximating fT by ˜fT is known as emulation [74, 45, 89].
Suppose there are Nv variables with uncertainty of interest, labelled v1, ..., vNv and Nd decision variables, d1, ..., dNd. The goal is to fit a model to the output of fT (the sum
5.2. Emulation 113
of mean constraint costs plus reinforcement costs) based on these Nv + Nd variables.
As this process assumes all other variables are considered fixed as if known, ˜fT can be treated as if it is a function of v1, ..., vNv and d1, ..., dNd only.
A small set of training runs of the simulator are used to construct the function ˜fT which approximates fT. These training runs vary the values of v1, ..., vNv, and d1, ..., dNd over a given range. Details on how training runs are selected to give a dense coverage of the space defined by v1, ..., vNv and d1, ..., dNd are given in Section 5.2.4.
Suppose yT represents the response of the simulator (fT) for the training data. For simplicity in the following notation, also suppose x = (v, d) = (v1, ..., vNv, d1, ..., dNd).
The emulator, ˜fT, is usually constructed such that ˜fT is the sum of a parametric model plus a non-parametric smoother applied to the residuals of the parametric model [45, 89, 84, 92], i.e.
f˜T(x) = g(x)Tβ + e(x) (5.2.1) where g(x)Tβ is the parametric portion of the emulator and e(x) is the non-parametric smoother applied to the residuals of g(x)Tβ. The emulator used in this thesis will use a polynomial regression model as the parametric portion (i.e. g(x)Tβ from Equa-tion 5.2.1) of the emulator model. This can be thought of as
yT(v1, ...vNv, d1, ..., dNd) = yT(x) = β0+
p
X
t=1 Nv+Nd
X
i=1
βt,ixti+
p
X
t=1 Nv+Nd
X
i=1
X
j6=i
βt,i,jxtixtj+...+ε(x) (5.2.2) i.e. the dot product of a coefficient vector with the polynomial form of the predictor variables with error term ε(v1, ..., vNv, d1, ..., dNd). This thesis only considers interac-tions between equal powers of variables (e.g. the polynomial may include an interaction between v1 and d2 but not between v1 and d22). Values of β are estimated via least squares regression, as will be detailed in Section 5.2.2.
Whilst this methodology applies very generally, the terms included in the polynomial regression model should be carefully selected based on the specifics of the example considered. This is considered in Sections 5.3.3 and 6.1.3, where both the R2 value of the polynomial regression model and a measure of the resulting emulator’s predictive power for a second set of training data which was not used to fit the model are used
to identify terms to be included in the model.
If the residual of the polynomial regression model, ε(x∗), is positive for a given set of input values, x∗, this indicates that the polynomial regression model under-estimates the response of the simulator, yT(x), at x∗. As yT(x) is modelled as a continuous variable, ε(x) is also modelled as a continuous variable. By the continuity of yT(x), the polynomial model will also under-estimate the response for values of x close to x∗, i.e. values of ε(x) are not independent but rather contain information about the response of the simulator near x.
This means that the fit of the model can be improved further by using a Gaussian process model to smooth the residuals, ε(v1, ..., vNv, d1, ..., dNd). This is important as the polynomial model will aim to give a good fit over the entire range of the variables considered, whereas the Gaussian process model can model local behaviour much more accurately. The idea of a Gaussian process is to smooth the residuals in order for the model to agree with the training data and provide an accurate estimate of response (and variance of the estimated response) for values where training data is unavailable.
In this sense, the Gaussian process model can be thought of as a smooth interpolator of the residuals [74]. In simple terms, the idea of the Gaussian process is if a response is to be estimated at point xp, training data closer to xp carries more relevant information and should be given more weight when making an estimation. Further details on Gaussian process models are given in Section 5.2.3.
In this thesis, the polynomial portion of the emulator model will be denoted by ˜fT1 and the Gaussian process model fitted to the resulting residuals of the polynomial model will be denoted by ˜fT2. Thus, the emulator model for given inputs is the sum of these two, denoted ˜fT, such that
f˜T(v1, ..., vNv, d1, ..., dNd) = ˜fT1(v1, ..., vNv, d1, ..., dNd) + ˜fT2(v1, ..., vNv, d1, ..., dNd) (5.2.3) That is to say ˜fT is our emulator which approximates the simulator, and the estimated response of the emulator is acquired by evaluating the sum of the estimated responses of the fitted polynomial regression model and the corresponding Gaussian process model for given input values.
After the emulator has been fitted to a set of training data, a second set of training
5.2. Emulation 115
data will then be simulated. The emulator model fitted to the first set of training data, f˜T, will then be used to estimate a response for the output of the simulator at the second set of training data. By comparing the estimated response of the emulator to the value simulated the predictive power of the emulator can be assessed, as will be demonstrated in Section 5.3.3.