Outline
Multi-objective least squares problem
Control
Estimation and inversion Regularized data fitting
Multi-objective least squares
I goal: choose
n-vector
x
so thatk
norm squared objectivesJ
1=
kA1
x b
1k
2, . . . ,
J
k=
kA
kx b
kk
2 are all smallI
A
i is anm
i⇥ n
matrix,b
i is anm
i-vector,i = 1, . . . ,k
I
J
i are the objectives in a multi-objective optimization problem(also called a multi-criterion problem)
I could choose
x
to minimize any oneJ
i, but we want onex
that makes themWeighted sum objective
I choose positive weights 1
, . . . ,
k and form weighted sum objectiveJ =
1J
1+
· · · +
kJ
k=
1kA1
x b
1k
2+
· · · +
kkA
kx b
kk
2I we’ll choose
x
to minimizeJ
I we can take 1
=
1, and call
J
1 the primary objectiveI interpretation of i: how much we care about
J
i being small, relative toprimary objective
I for a bi-criterion problem, we will minimize
Weighted sum minimization via stacking
I write weighted-sum objective as
J =
26
66
66
66
4
p
1(A
1x b
1)
..
.
p
k(Ak
x b
k)
37
77
77
77
5
2I so we have
J = k ˜Ax ˜bk
2, with˜A =
26
66
66
66
4
p
1A
1..
.
p
kA
k37
77
77
77
5
,
˜b =
26
66
66
66
4
p
1b
1..
.
p
kb
k37
77
77
77
5
Weighted sum solution
I assuming columns of
˜A
are independent,ˆx = ( ˜A
T˜A)
1˜A
T˜b
=
(
1A
T1A
1+
· · · +
kA
TkA
k)
1(
1A
T1b
1+
· · · +
kA
Tkb
k)
I can compute
ˆx
via QR factorization of˜A
Optimal trade-off curve
I bi-criterion problem with objectives
J
1,J
2 I letˆx( )
be minimizer ofJ
1+
J
2I called Pareto optimal: there is no point
z
that satisfiesJ
1(z) < J
1(ˆx( )), J
2(z) < J
2(ˆx( ))
i.e., no other point
x
beatsˆx
on both objectivesExample
A
1 andA
2 both10 ⇥ 5
10
410
210
010
210
40.2
0
0.2
0.4
0.6
ˆx
1( )
ˆx
2( )
ˆx
3( )
ˆx
4( )
ˆx
5( )
Objectives versus and optimal trade-off curve
10
410
210
010
210
45
10
15
J
1( )
J
2( )
6
8
10
12
14
16
5
10
=
0.1
=
1
=
10
J
( )
J
2(
)
Using multi-objective least squares
I identify the primary objective
– the basic quantity we want to minimize
I choose one or more secondary objectives
– quantities we’d also like to be small, if possible
– e.g., size of x, roughness of x, distance from some given point
I tweak/tune the weights until we like (or can tolerate)
ˆx( )
I for bi-criterion problem withJ = J
1+
J
2:– if J2 is too big, increase
Outline
Multi-objective least squares problem Control
Estimation and inversion Regularized data fitting
Control
I
n-vector
x
corresponds to actions or inputs Im-vector
y
corresponds to results or outputsI inputs and outputs are related by affine input-output model
y = Ax + b
I
A
andb
are known (from analytical models, data fitting . . . )I the goal is to choose
x
(which determinesy), to optimize multiple
Multi-objective control
I typical primary objective:
J
1=
ky y
desk
2, wherey
des is a given desired ortarget output
I typical secondary objectives:
– x is small: J2 = kxk2
Product demand shaping
I we will change prices of
n
products byn-vector
price I this induces change in demand dem=
E
d priceI
E
d is then ⇥ n
price elasticity of demand matrix I we wantJ
1=
k
dem tark
2 smallI and also, we want
J
2=
k
pricek
2 smallI so we minimize
J
1+
J
2, and adjust>
0
Robust control
I we have
K
different input-output models (a.k.a. scenarios)y
(k)=
A
(k)x + b
(k),
k = 1, . . . ,K
I these represent uncertainty in the system
I
y
(k) is the output with inputx, if system model
k
is correct I average cost across the models:1
K
KX
k=1ky
(k)y
desk
2 2Outline
Multi-objective least squares problem Control
Estimation and inversion
Estimation
I measurement model:
y = Ax + v
I
n-vector
x
contains parameters we want to estimate Im-vector
y
contains the measurementsI
m-vector
v
are (unknown) noises or measurement errors Im ⇥ n
matrixA
connects parameters to measurementsI basic least squares estimation: assuming
v
is small (andA
hasRegularized inversion
I can get far better results by incorporating prior information about
x
intoestimation, e.g.,
– x should be not too large
– x should be smooth
I express these as secondary objectives:
– J2 = kxk2 (‘Tikhonov regularization’)
– J2 = kDxk2
I we minimize
J
1+
J
2I adjust until you like the results
I curve of
ˆx( )
versus is called regularization pathImage de-blurring
I
x
is an imageI
A
is a blurring operatorI
y = Ax + v
is a blurred, noisy imageI least squares de-blurring: choose
x
to minimizekAx yk
2+
(kD
vxk
2+
kD
hxk
2)
D
v,D
h are vertical and horizontal differencing operationsExample
Regularization path
Regularization path
Tomography
I
x
represents values in region of interest ofn
voxels (pixels)I
y = Ax + v
are measurements of integrals along lines through regiony
i=
nX
i=1
A
ijx
j+
v
iI
A
ij is the length of the intersection of the line in measurementi
with voxelj
line in measurement i x1 x2
Least squares tomographic reconstruction
I primary objective is
kAx yk
2I regularization terms capture prior information about
x
I for example, if
x
varies smoothly over region, use Dirichlet energy for graphExample
I left: 4000 lines (100 points, 40 lines per point)
Regularized least squares reconstruction
=
10
2=
10
1=
1
Outline
Multi-objective least squares problem Control
Estimation and inversion Regularized data fitting
Motivation for regularization
I consider data fitting model (of relationship
y ⇡ f (x)
)ˆf(x) = ✓
1f
1(x) + · · · + ✓
pf
p(x)
withf
1(x) = 1
I
✓
i is the sensitivity ofˆf(x)
tof
i(x)
I so large
✓
i means the model is very sensitive tof
i(x)
I✓
1 is an exception, sincef
1(x) = 1
never variesRegularized data fitting
I suppose we have training data
x
(1), . . . ,
x
(N),y
(1), . . . ,
y
(N) I express fitting error on data set asA✓ y
I regularized data fitting: choose
✓
to minimizekA✓ yk
2+
k✓2:p
k
2I
>
0
is the regularization parameterI for regression model
ˆy = X
T+
v
1
, we minimizeExample
0 0.5 1 1 0 1 2 3 x Train TestI solid line is signal used to generate synthetic (simulated) data
I 10 blue points are used as training set; 20 red points are used as test set I we fit a model with five parameters
✓
1, . . . ,✓
5:4