2.3 Modifier Adaptation
2.3.1 Basic Modifier Adaptation
The functionsφpand gpare usually not known accurately, as only the modelsφ and g are available. Consequently, an approximate solution to the original problem (2.1.1) is obtained by solving the following model-based problem:
u∗(θ) := argminu φ(u,θ)
subject to g (u,θ) ≤ 0, (2.3.1)
whereθ is an nθ-dimensional vector of uncertain model parameters. If the model matches the plant perfectly, solving Problem (2.3.1) provides a solution to Problem (2.1.1).
Unfortunately, this is rarely the case, since the structure of the model functionsφ and g as well as the nominal values,θ0, for the uncertain model parameters are likely to be incorrect. This structural and parametric mismatch implies that the model-based optimal input u∗(θ) will probably not correspond to u∗p, not only forθ = θ0, but for any values of the model parametersθ.
MA collects process information to correct for the differences between the plant and the model optimization problems. This is done by applying successively different values of u to the plant, each time waiting for the plant to settle to steady state and observing its performance. The measured cost and constraints corresponding to the input ukat the kt hiteration are:
φ˜p(uk) = φp(uk) + dkφ (2.3.2)
˜
gp, j(uk) = gp, j(uk) + dkg , j, ∀ j ∈ [1, . . . , ng] (2.3.3) where dkφand dkg , j are realizations of a zero-mean random variable for the cost and the jt hconstraint, respectively, with the corresponding variancesσ2φandσ2g , j. This stochastic component represents high-frequency noise due to measurement noise and high-frequency disturbances affecting the plant. The process measurements are used to iteratively modify the model-based problem (2.3.1) in such a way that, upon conver-gence, the necessary conditions of optimality (NCO) for the modified problem match those for the plant-based problem (2.1.1). This is made possible by using modifiers that, at each iteration, are computed as the differences between the measured and predicted values of the constraints and the measured and predicted cost and constraint gradients.
This forces the cost and constraints in the model-based optimization problem to locally
match those of the plant. In its simplest form, the algorithm proceeds as follows:
Algorithm 2.3.1: Modifier Adaptation (Marchetti et al., 2009)
Initialize the modifier terms: the ng-dimensional vector of zeroth-order constraint modifiers²0= 0, the nu-dimensional vector of first-order cost modifiersλφ0 = 0, and the (nu× ng) matrix of first-order constraint modifiersλ0g= 0. Choose the modifier filter matrices K², Kφ, Kg, typically diagonal matrices with eigenvalues in the interval (0, 1].
Also, choose arbitrarily u0= 0.
for k = 1 → ∞
1. Solve the modified model-based optimization problem uk:= argmin
u φm,k−1(u)
subject to gm,k−1(u) ≤ 0, (2.3.4)
where the modified cost and constraints are given by
φm,k(u) := φ(u,θ0) + (λφk)T(u − uk), (2.3.5) gm,k(u) := g(u,θ0) + ²k+ (λgk)T(u − uk). (2.3.6)
The subscript (·)mindicates a quantity that has been modified.
2. Apply the input ukto the plant to obtain ˜φp(uk) and ˜gp(uk).
3. Obtain an estimate of the plant cost gradient, ∇φE,k, and the plant constraint gradient, ∇gE,k(these gradient estimates are for the current operating point uk).
The gradients must be estimated using measurements collected at no less than nudifferent operating points close to uk(to be further discussed in Section 2.3.2).
4. Update the modifier terms using the following first-order filter equations1:
²k:= (Ing− K²)²k−1+ K²³
˜gp(uk) − g(uk,θ0)´
, (2.3.7)
λφk := (Inu− Kφ)λφk−1+ Kφ¡∇φE,k− ∇φ(uk,θ0)¢T
. (2.3.8)
λgk:= (Inu− Kg)λgk−1+ Kg¡∇gE,k− ∇g(uk,θ0)¢T
, (2.3.9)
end
1There is also a variant of the MA algorithm which filters the inputs, rather than the modifier terms (Marchetti, 2009).
The MA algorithm’s most attractive property is that, if the scheme converges, then, under ideal circumstances, it will do so to a KKT point for the plant.
Theorem 2.3.1 (KKT matching). Let the gain matrices K², Kφ, Kgbe nonsingular. Assume no high-frequency noise and perfect gradient estimates, i.e. ∇φE,k= ∇φp(uk), ∇gE,k=
∇gp(uk). If Algorithm 2.3.1 converges, with u∞:= limk→∞uk being a KKT point for the modified problem (2.3.4), then u∞is also a KKT point for the plant optimization problem (2.1.1).
Proof. See Marchetti et al. (2009).
Note that, while the KKT-matching property is a very desirable property for a RTO algorithm, it remains a theoretical result. In a real application, due to high-frequency noise, the algorithm will converge to a neighborhood of the plant optimum.
The KKT matching theorem guarantees that, if MA converges, it will converge to a plant KKT point. But can the algorithm converge to the plant optimum? In the field of RTO, this is referred to as the ‘Model Adequacy’ question (Forbes and Marlin, 1996). In the case of MA, the process model {φ,g} is called adequate if values for the modifiers ²,λφ andλgcan be found such that a fixed point of the MA algorithm 2.3.1 coincides with the plant optimum u∗p. This results in a (fairly relaxed) requirement that the model must satisfy.
Theorem 2.3.2 (Model Adequacy). Let u∗pbe the unique plant optimum, which is as-sumed to be a regular point for the nagactive constraints. The process model is adequate for use in MA if the reduced Hessian of the cost functionφ is positive definite at u∗p:
ZT¡∇2φ¢Z > 0, (2.3.10)
where the columns of Z ∈ Rnu×(nu−nag)are a set of basis vectors for the null space of the Jacobian of the active constraints in the model-based optimization problem (2.3.1).
Proof. See Marchetti et al. (2009).
The Model Adequacy Condition is automatically satisfied if the model cost function is convex. Indeed, François and Bonvin (2013a) propose a method to enforce the Model
Adequacy Condition for a general non-linear model cost function by using convex approximations.
We now know that MA will only converge to a plant KKT point, and that, assuming the model-adequacy criterion is met, it can converge to the plant optimum. But will the MA scheme converge at all? This is a difficult question to answer. Both necessary and sufficient conditions have been proposed for the convergence of MA (Marchetti et al., 2009; Faulwasser and Bonvin, 2014; Bunin, 2014). Unfortunately, none of these conditions are very satisfactory from a practical point of view, as they are generally impossible to verify/enforce in practice. The exponential filter matrices K², Kφ, Kgare the tuning parameters that influence convergence. These matrices should be chosen with real, positive eigenvalues in the interval (0, 1]. Larger eigenvalues encourage more rapid convergence, but may also cause oscillating behavior, or a failure to converge at all. Smaller eigenvalues result in the MA algorithm taking more cautious steps, making convergence more likely, but slower. Currently, the only viable option is to tune these filter matrices through simulation or experimental trials.
Finally, note that an innovative method named‘Nested’ Modifier Adaptation has re-cently been proposed (Navia et al., 2014, 2013), which completely avoids the gradient estimation step. Rather, the gradient modifiersλφk andλkg are determined at each iteration by an unconstrained gradient-free optimization routine, such as the simplex method. This optimization routine adjusts the gradient modifiers at each RTO iteration in order to optimize the plant’s measured performance. While this conveniently avoids gradient estimation, the drawback is that the gradient-free optimization algorithm must optimize the plant using nu(ng+ 1) decision variables. Due to this large number of decision variables, it may take many RTO iterations to converge to the plant optimum.