CHAPTER 2 Literature Review
2.2 Approximate Dynamic Programming (ADP) Approach
Due to the uncertainty and stochastic nature of natural gas storage decisions, scholars such as Lai et al. (2008), gravitated toward stochastic dynamic programming (SDP) as the natural approach to solving the storage valuation problem. Solving dynamic optimization problems is rooted in the application of Bellmanβs equation which solves simpler sub-problems, computing the value of a decision at a certain point in time in terms of the payoff of the current decision and the value of the remaining decisions within the problem that result from the prior decisions. Bellmanβs equation for stochastic problems can be described as
ππ‘ π π‘, ππ‘ = max πΆ π₯, π + πΎπΌ ππ‘+1 π π‘+1, ππ‘+1 |π π‘, ππ‘ β π‘ β π,
where C(x, a) is the contribution made at time t from taking action a while in state x, and Ξ³ discounts the expected value of the next state (Puterman 1994).
Storage value is determined by the expected cash flow generated by the operational decisions. These decisions are dependent on the current operational characteristics of the storage facility in addition to the market conditions at the
CHAPTER 2: Literature Review
26
time. Given the decision to inject or withdraw x and the operational and market characteristics that make up the state at time t, the value of storage can be expressed by the following Bellmanβs equation
ππ‘ ππ‘, ππ‘ = arg max πΆπΉπ‘ π₯, ππ‘ + πβπππ‘πΌ ππ‘+1 ππ‘+1, ππ‘+1 |ππ‘ β π‘ β π, subject to βππ‘ β€ π₯ β€ πΌπ‘, ππ‘+1 = π₯ + ππ‘, 0 β€ π₯ β€ ππππ₯, πΆπΉπ‘ π, ππ‘ = π β ππ‘, where
ππ‘ = the inventory level at time t with ππππ₯being total capacity
π₯ = the amount of flow in any given period t
ππ‘ = the maximum withdrawal capacity at time t
πΌπ‘ = the maximum injection capacity at time t
π π‘ = the gas price at time t which is determined by a specific stochastic process
ππ‘ = the forward curve at time t
CHAPTER 2: Literature Review
27
The solution approach for solving a discrete SDP is defined by backward induction, which can be implemented by first considering the last time a decision is made and choosing what to do in any situation at that time. Using this information, one can then determine what to do at the second-to-last time of decision. This process continues backwards until one has determined the best action for every possible situation.
Though it appears an easily obtained solution can be reached, the presence of the forward curve vector, ππ‘, within system state that complicates the ability to
reach a solution. ππ‘ is a vector of forward prices at time t for all the futures contracts
that mature at various times in the future. Over 72 contracts are traded in the futures market at one time, yet utilizing even a fraction of them within the state variable subjects the problem to the βcurse of dimensionality.β This means that the high dimensionality of the state space and the manipulation of exponentially large volumes of information render the solution unfeasible by backwards induction.
To combat this problem Lai et al. (2008) develops a technique based on ADP methods to value the storage of natural gas and renders the high-dimensional model of the forward curve more pliable. This approach transforms the intractable stochastic dynamic program model of the storage problem into a manageable lower dimensional Markov decision process.
The approach to reducing the computationally intractable SDP model is to develop an approximate model using information reduction. Lai et al. (2008), removes price related state variables from the state definition of the exact model.
CHAPTER 2: Literature Review
28
Though the amount of variables removed may fluctuate based on the solverβs discretion, it is important to note that the more price-related variables are removed, the easier it will be to solve efficiently. The contraction within the ADP technique occurs by computing the optimal value function by conditioning on the possible values of the next monthβs futures price and the next months futures price at π‘ = 0, instead of conditioning on the whole forward curve. So now the vector ππ‘, is composed of two scalar variables, ππ‘,π‘+1 and π0,π‘+1. The state variable now includes the current inventory, spot price, and only two variables which are associated with the forward curve within each period t. Thus, the ADP model of the problem is vastly reduced from the previous model and is given by
ππ‘π΄π·π π
π‘, ππ‘ = πΌ π£π‘π΄π·π ππ‘, ππ‘,π‘+1 |π0,π‘+1 β π‘ β π,
where,
π£π‘π΄π·π π
π‘, ππ‘ = arg max πΆπΉπ‘ π₯, ππ‘ + πβπππ‘πΌ ππ‘+1π΄π·π ππ‘+1, π π‘+1 |ππ‘,π‘+1 .
The conditioning on the next monthβs futures price at time 0, π0,π‘+1, can be
seen in the first equation and yields the expected value of the value function displayed by the second equation, which is conditioned on the next monthβs futures price at time t, ππ‘,π‘+1.
It is reported that the implementation of this ADP model can generate values that on average are 97% of optimal storage value, while the LP model of Section 2.1 reports a larger underestimate of the value of storage with an average of 75% (Lai, Margot and Secomandi 2008). It appears that ADP dominates the LP model in terms
CHAPTER 2: Literature Review
29
of the actual valuation of the storage facility. However, looking at cpu run time as an alternative indicator, ADP can be considered as a suboptimal policy with extensively higher cpu requirements, averaging 250 cpu seconds compared to the .0038 cpu seconds required by the LP model. It is evident that despite the improvements in the actual valuation of natural gas storage, greater computational burdens are undertaken to produce such results. The fact that ADP solution methods are considered slower and less efficient, little impression is made on industry practitioners. Thus, it is important to find a method that will best balance valuation quality and computational efficiency.
Chapter 3 will discuss the stochastic optimization method known as optimal learning, focusing on the stochastic search method known as the Knowledge Gradient Policy. We will utilize this process in order to find a valuation method for natural gas storage that deploys stochastic optimization methods lacking in the LP model, yet will yield less computational burdens found in the ADP model.