Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesian Optimization)

Automatic Gain Tuning
based on
Gaussian Process Global Optimization
(= Bayesian Optimization)
https://is.tuebingen.mpg.de/publications/marco_icra_2016

=
A simple PD control example
Global optimal gains, θ to get a minimum cost J ?

A simple PD control example
Procedure of Bayesian Optimization
1. GP prior before observing any data
2. GP posterior, after five noisy evaluations
3. The next parameters θnext are chosen
at the maximum of Acquisition function
Repeat until you can find
a globally optimal θ

Information gain, : Mutual information for an observed data
 Reduction of uncertainty in the location ∗ by selecting points
that are expected to cause the largest reduction in entropy of
distribution ∗
Acquisition function and Entropy Search

Control design problem
propose the use of Entropy Search, a recent algorithm for global
Bayesian optimization, as the minimizer for the LQR tuning problem.
ES employs a Gaussian process (GP) as a non-parametric model
capturing the knowledge about the unknown cost function.
consider a system that follows a discrete-time nonlinear dynamic
model
system states xk, control input uk, and zero-mean process noise wk at
time instant k

common way to measure the performance of a control system is
through a quadratic cost function such as
cost captures a trade-off between control performance (keeping xk
small) and control effort(keeping uk small)
Nonlinear control design problem is intractable in general.
linear model is often sufficient for control design
Control design problem

Linear Quadratic Regulator (LQR)
static gain matrix F can readily be computed by solving the
discrete-time infinite-horizon LQR problem for the nominal model
(An, Bn) and the weights (Q, R).
LQR tuning problem
goal of the automatic LQR tuning is to vary the parameters θ
such as to minimize the cost

Optimization problem
Consider the approximate one of quadratic cost function

uncertainty over the objective function J is represented by a
probability measure p(J), typically a Gaussian process (GP)
LQR TUNING WITH ENTROPY SEARCH
prior knowledge about the cost function J as the GP
with mean function and covariance function ∗
squared exponential (SE) covariance function

noisy evaluations of cost function can be modeled as
Conditioning the GP on the data then yields another GP with
posterior mean and a posterior variance ∗ .
 shape of the mean is adjusted to fit the data points, and the
uncertainty (standard deviation) is reduced around the evaluations
points.

ES is one out of several popular formulations of Bayesian Optimization
Aims to reduce the uncertainty in the location ∗
expected to cause the largest reduction in entropy( uncertainty)
of distribution, J(θ)

Bayesian
Optimization
Bayesian Optimization and How to Scale It Up
Zi Wang
CS Colloquium, US

Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesian Optimization)

Information gain, : Mutual information for an observed data
 Reduction of uncertainty in the location ∗ by selecting points
that are expected to cause the largest reduction in entropy of
distribution ∗

Probability of Improvement

: Gaussian Cumulative Density Ft’n

Evolution of an example Gaussian process for three successive
function evaluations (orange dots)
Approximated probability distribution over the location of the
minimum pmin(θ) in green

Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesian Optimization)

More Related Content

What's hot (20)

Similar to Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesian Optimization) (20)

More from 홍배 김 (20)

Recently uploaded (20)

Automatic Gain Tuning based on Gaussian Process Global Optimization (= Bayesian Optimization)