Comparison of covariance models for gaussian processes¶
The main goal of this example is to briefly review the most important covariance models and compare them in terms of regularity.
We first show how to define a covariance model, a temporal grid and a gaussian process. We first consider the squared exponential covariance model and show how the trajectories are sensitive to its parameters. We show how to define a trend. In the final section, we compare the trajectories from exponential and Matérn covariance models.
References¶
Carl Edward Rasmussen and Christopher K. I. Williams (2006) Gaussian Processes for Machine Learning. Chapter 4: “Covariance Functions”, www.GaussianProcess.org/gpml
The anisotropic squared exponential model¶
The SquaredExponential
class allows to define covariance models : * is the amplitude parameter, * is the scale.
[1]:
import openturns as ot
[2]:
# Amplitude values
amplitude = [3.5]
# Scale values
scale = [1.5]
# Covariance model
myModel = ot.SquaredExponential(scale, amplitude)
Gaussian processes¶
In order to create a GaussianProcess
, we must have * a covariance model, * a grid.
Optionnally, we can define a trend (we will see that later in the example). By default, the trend is zero.
We consider the domain . We discretize this domain with 100 cells (which corresponds to 101 nodes), with steps equal to 0.1 starting from 0:
[3]:
xmin = 0.0
step = 0.1
n = 100
myTimeGrid = ot.RegularGrid(xmin, step, n+1)
graph = myTimeGrid.draw()
graph.setTitle("Regular grid")
graph
[3]:
Then we create the gaussian process (by default the trend is zero).
[4]:
process = ot.GaussianProcess(myModel, myTimeGrid)
process
[4]:
GaussianProcess(trend=[x0]->[0.0], covariance=SquaredExponential(scale=[1.5], amplitude=[3.5]))
Then we generate 10 trajectores with the getSample
method. This trajectories are in a ProcessSample
.
[5]:
nbTrajectories = 10
sample = process.getSample(nbTrajectories)
type(sample)
[5]:
openturns.func.ProcessSample
We can draw the trajectories with drawMarginal
.
[6]:
graph = sample.drawMarginal(0)
graph.setTitle("amplitude=%.3f, scale=%.3f" % (amplitude[0], scale[0]))
graph
[6]:
In order to make the next examples easier, we define a function which plots a given number of trajectories from a gaussian process based on a covariance model.
[7]:
def plotCovarianceModel(myCovarianceModel,myTimeGrid,nbTrajectories):
'''Plots the given number of trajectories with given covariance model.'''
process = ot.GaussianProcess(myCovarianceModel, myTimeGrid)
sample = process.getSample(nbTrajectories)
graph = sample.drawMarginal(0)
graph.setTitle("")
return graph
The amplitude parameter sets the variance of the process. A greater amplitude increases the chances of getting larger absolute values of the process.
[8]:
amplitude = [7.]
scale = [1.5]
myModel = ot.SquaredExponential(scale, amplitude)
graph = plotCovarianceModel(myModel,myTimeGrid,10)
graph.setTitle("amplitude=%.3f, scale=%.3f" % (amplitude[0], scale[0]))
graph
[8]:
Modifying the scale parameter is here equivalent to stretch or contract the “time” .
[9]:
amplitude = [3.5]
scale = [0.5]
myModel = ot.SquaredExponential(scale, amplitude)
graph = plotCovarianceModel(myModel,myTimeGrid,10)
graph.setTitle("amplitude=%.3f, scale=%.3f" % (amplitude[0], scale[0]))
graph
[9]:
Define the trend¶
The trend is a deterministic function. With the GaussianProcess
class, the associated process is the sum of a trend and a gaussian process with zero mean.
[10]:
f = ot.SymbolicFunction(['x'], ['2*x'])
fTrend = ot.TrendTransform(f,myTimeGrid)
[11]:
amplitude = [3.5]
scale = [1.5]
myModel = ot.SquaredExponential(scale, amplitude)
process = ot.GaussianProcess(fTrend,myModel, myTimeGrid)
process
[11]:
GaussianProcess(trend=[x]->[2*x], covariance=SquaredExponential(scale=[1.5], amplitude=[3.5]))
[12]:
nbTrajectories = 10
sample = process.getSample(nbTrajectories)
graph = sample.drawMarginal(0)
graph.setTitle("amplitude=%.3f, scale=%.3f" % (amplitude[0], scale[0]))
graph
[12]:
Other covariance models¶
There are other covariance models. The models which are used more often are the following. * SquaredExponential
. The generated processes can be derivated in mean square at all orders. * MaternModel
. When , it converges to the squared exponential model. This model can be derivated times only if . In other words, when increases, then the trajectories are more and more regular. The particular case is the
exponential model. The most commonly used values are and , which produce trajectories that are, in terms of regularity, in between the squared exponential and the exponential models. * ExponentialModel
. The associated process is continus, but not differentiable.
The Matérn and exponential models¶
[13]:
amplitude = [1.0]
scale = [1.0]
nu1, nu2, nu3 = 2.5, 1.5, 0.5
myModel1 = ot.MaternModel(scale, amplitude, nu1)
myModel2 = ot.MaternModel(scale, amplitude, nu2)
myModel3 = ot.MaternModel(scale, amplitude, nu3)
[14]:
nbTrajectories = 10
graph1 = plotCovarianceModel(myModel1,myTimeGrid,nbTrajectories)
graph2 = plotCovarianceModel(myModel2,myTimeGrid,nbTrajectories)
graph3 = plotCovarianceModel(myModel3,myTimeGrid,nbTrajectories)
[15]:
from openturns.viewer import View
import pylab as pl
fig = pl.figure(figsize=(20, 6))
ax1 = fig.add_subplot(1, 3, 1)
_ = View(graph1, figure=fig, axes=[ax1])
_ = ax1.set_title("Matern 5/2")
ax2 = fig.add_subplot(1, 3, 2)
_ = View(graph2, figure=fig, axes=[ax2])
_ = ax2.set_title("Matern 3/2")
ax3 = fig.add_subplot(1, 3, 3)
_ = View(graph3, figure=fig, axes=[ax3])
_ = ax3.set_title("Matern 1/2")
We see than, when increases, then the trajectories are smoother and smoother.
[16]:
myExpModel = ot.ExponentialModel(scale, amplitude)
[17]:
graph = plotCovarianceModel(myExpModel,myTimeGrid,nbTrajectories)
graph.setTitle("Exponential")
graph
[17]:
We see that the exponential model produces very irregular trajectories.