Sensitivity analysis using Sobol’ indices

Consider the input random vector \inputRV = \left( X_1,\ldots,X_\inputDim \right) and let \vect{Y} = \left( Y_1,\ldots,Y_{d'} \right) be the output of the physical model:

\vect{y} = \model(\vect{x}).

We consider the output Y_k for any index k \in \{1, \ldots, d'\}. Sobol’ indices measure the influence of the input \inputRV to the output Y_k. The method considers the part of the variance of the output Y_k produced by the different inputs X_i.

In the first part of this document, we introduce the Sobol’ indices of a scalar output Y_k. Hence, the model is simplified to:

Y = \metaModel(\vect{x}).

In the second part of the document, we consider the general case where the output is multivariate. In this case, aggregated Sobol’ indices can be used [gamboa2013].

The Sobol’ decomposition is described more easily when the domain of the input is the unit interval [0,1]^\inputDim. It can be easily extended to any input domain using expectations, variances and variance of conditional expectations.

We assume that the input marginal variables X_1,\ldots,X_\inputDim are independent. This restrictive hypothesis implies that the only copula of the input random vector \bdX for which the Sobol’ indices are easy to interpret is the independent copula. If the input variables are dependent, then the Sobol’ indices can be defined, but some of their properties are lost.

Partition of the input

For any i\in\{1,\ldots, d\}, let \bdx_{\overline{\{i\}}} \in [0,1]^{d - 1} be the vector made of components of \bdx=(x_1,x_2, \ldots,x_p)\in [0,1]^{d } which indices are different from i. Hence, if \bdx\in[0,1]^\inputDim, then:

\bdx_{\overline{\{i\}}} = (x_1,x_2,\ldots,x_{i-1},x_{i+1},\ldots,x_p)^T\in [0,1]^{d - 1}.

Consider the function \operatorname{g} defined by the equation:

y = \operatorname{g}(\bdx)

where \bdx=(x_1,\ldots,x_p)^T \in [0,1]^\inputDim. With this notation, we can partition the input of g:

\operatorname{g}(\bdx) = \operatorname{g} \left(x_i,\bdx_{\overline{\{i\}}} \right).

The goal of sensitivity analysis is to measure the sensitivity of the variance of the output Y depending on the variable X_i. This may take into account the dependence of the output to the interactions of X_i and \bdX_{\overline{\{i\}}} through the function g.

More generally, let \bdu \subseteq \{1,2,\ldots,d\} be a group of variables. Therefore:

\operatorname{g}(\bdx) = \operatorname{g} \left(\bdx_\bdu,\bdx_{\overline{\bdu}} \right).

The goal of sensitivity analysis is to measure the sensitivity of the variance of the output Y depending on the group of variables \bdX_\bdu. This may take into account the dependence of the output to the interactions of \bdX_\bdu and \bdX_{\bdu} through the function g.

Sobol’ decomposition

In this section, we introduce the Sobol’-Hoeffding decomposition [sobol1993]. If \operatorname{g} can be integrated in [0,1]^\inputDim, then there is a unique decomposition:

y &= h_0 + \sum_{i=1,2,\ldots,d} h_{\{i\}}(x_i)
     \quad + \sum_{1\leq i < j \leq d} h_{\{i,j\}}(x_i,x_j) \nonumber \\
  & \quad+ \ldots +
         h_{\{1,2,\ldots,d\}}(x_1,x_2,\ldots,x_p),

where h_0 is a constant and the functions of the decomposition satisfy the equalities:

\int_0^1 h_{\{i_1,\ldots,i_s\}}(x_{i_1},\ldots,x_{i_s})dx_{i_k} = 0,

for any k=1,2,\ldots,s and any indices 1\leq i_1< i_2< \ldots< i_s\leq d and s=1,2,\ldots,d.

Extension to any input distribution with independent marginals

In this section, we extend the previous definitions to an input random vector that is not necessarily defined on the input unit cube [0,1]^\inputDim. To do this, we define the functions h_\bdu using conditional expectations.

The functions h_\bdu satisfy the equality:

\int_{[0,1]^{|\overline{\bdu}|}} \operatorname{g}(\bdx) d\bdx_{\overline{\bdu}}
= \sum_{\bdv \subseteq \bdu} h_\bdv(\bdx_\bdv),

for any group of variables \bdu \subseteq \{1,2,\ldots,\inputDim\} with size lower or equal to d, where |\overline{\bdu}| is the cardinal of the subset \overline{\bdu}. The functions h_\bdu can be defined recursively, using groups of variables of lower dimensionality:

h_\bdu(\bdx_\bdu)
= \int_{[0,1]^{|\overline{\bdu}|}} \operatorname{g}(\bdx_\bdu,\bdx_{\overline{\bdu}}) d\bdx_{\overline{\bdu}}
-  \sum_{\bdv \subsetneq \bdu} h_\bdv(\bdx_\bdv)

where \subsetneq denotes a proper subset. Let \boldsymbol{x} \in [0,1]^\inputDim be a point and let \bdu \subseteq \{1, \ldots, d\} be a group of variables. Therefore:

\Expect{Y|\bdX_\bdu=\bdx_\bdu}
= \sum_{\bdv \subseteq \bdu} h_\bdv(\bdx_\bdv).

The Möbius inversion formula implies (see [daveiga2022] Theorem 3.3 page 49):

h_\bdu(\bdx_\bdu)
= \sum_{\bdv \subseteq \bdu} (-1)^{|\bdu| - |\bdv|} \Expect{Y|\bdX_\bdv=\bdx_\bdv}.

The previous equation is a consequence of the Möbius inversion formula [rota1964] (also called the exclusion-inclusion principle).

Decomposition of the variance

The variance of the function \operatorname{g} can be decomposed into:

\Var{Y}=\sum_{i=1}^\inputDim V_{i}
+ \sum_{1\leq i < j\leq d} V_{\{i,j\}} + \ldots + V_{\{1,2,\ldots,\inputDim\}}

where the interaction variances are:

V_{i}        &= \Var{h_{\{i\}}(X_i)}, \label{eq-sde-varvi1-2} \\
V_{\{i, j\}}  &= \Var{h_{\{i, j\}}(X_i, X_j)}, \\
V_{\{i, j, k\}} &= \Var{h_{i, j, k}(X_i, X_j, X_k)}, \\
\ldots       & \\
V_{\{1, 2, \ldots, d\}} &= \Var{h_{\{1, 2, \ldots, d\}}(X_1, X_2, \ldots, X_p)}.

More generally, the interaction variance of a group of variables is:

V_\bdu = \Var{h_\bdu(\bdx_\bdu)},

for any \bdu \subseteq \{1,2,\ldots,d\}. Using the Hoeffding decomposition, we get:

\Var{Y} = \sum_{ \bdu \subseteq \{1, \ldots, d\} } V_\bdu.

The Möbius inversion formula implies (see [daveiga2022] corollary 3.5 page 52):

V_\bdu = \sum_{\bdv \subseteq \bdu} (-1)^{ |\bdu| - |\bdv| } \Var{\Expect{ Y \vert \mat{X}_\bdv} }.

Interaction sensitivity index of a variable

The first order interaction sensitivity indices are equal to:

S_i           &= \frac{V_{i}}{\Var{Y}} , \\
S_{\{i,j\}}   &= \frac{V_{\{i,j\}}}{\Var{Y}} , \\
S_{\{i,j,k\}} &= \frac{V_{\{i,j,k\}}}{\Var{Y}} , \\
\ldots & \\
S_{\{i_1,i_2,\ldots,i_s\}} &= \frac{V_{\{i_1,i_2,\ldots,i_s\}}}{\Var{Y}}, \\
\ldots & \\
S_{\{1,2,\ldots,p\}} &= \frac{V_{\{1,2,\ldots,p\}}}{\Var{Y}}.

The first order Sobol’ index S_i measures the part of the variance of Y explained by X_i alone. The second order Sobol’ index S_{i,j} measures the part of the variance of Y explained by the interaction of X_i and X_j.

More generally, the first order interaction Sobol’ index of a group of variables \bdu is:

S_\bdu = \frac{V_\bdu}{\Var{Y}} = \frac{\Var{h_\bdu(\bdX_\bdu)}}{\Var{Y}}.

where h_\bdu is the function of the input variables in the group \bdu of the functional Sobol’-Hoeffding ANOVA decomposition of the physical model. This index measures the sensitivity of the variance of the output explained by interactions within the group.

Total interaction sensitivity index of a group of variables

The total interaction sensitivity index of the group \bdu is (see [liu2006] eq. 8 page 714 where it is named “superset importance”):

S^{T,i}_\bdu = \sum_{\bdv \supseteq \bdu} S_{\bdv}

This index measures the sensitivity of the variance of the output explained by interactions within the group and groups of variables containing it.

First order Sobol’ sensitivity index of a variable

The first order Sobol’ sensitivity index is equal to the corresponding interaction index of the group \{i\}:

S_i &= S_{\{i\}}

for i=1,\ldots, d. The first order Sobol’ index S_i measures the sensitivity of the output variance explained by the effect of X_i alone. We can alternatively define the first order Sobol’ sensitivity index using the variance of a conditional expectation. The first order Sobol’ sensitivity index satisfies the equation:

S_i &= \frac{\Var{\Expect{Y|X_i}}}{\Var{Y}}

for i=1,\ldots, d.

Total sensitivity index of a variable

The total Sobol’ sensitivity index is:

S^T_i &= \frac{V_{i} + \sum_{\substack{j\in\{1,\ldots, d\}\\j\neq i}} V_{\{i,j\}} + \ldots
V_{1, 2,\ldots, d}}{\Var{Y}}

for i=1,\ldots, d. The total Sobol’ sensitivity index can be equivalently defined in terms of the variance of a conditional expectation. The total Sobol’ sensitivity index satisfies the equation:

S^T_i &= 1 - \frac{\Var{\Expect{Y|X_{\overline{\{i\}}}}}}{\Var{Y}}

for i=1,\ldots, d. For any i=1,\ldots,d, let us define

V_i^T   & = \sum_{\bdu \ni i} V_\bdu \\
V_{-i} & = \Var{ \Expect{Y \vert X_1, \ldots, X_{i-1}, X_{i+1}, \ldots X_\inputDim} }.

Total Sobol’ indices satisfy the equality:

S_i^T = \frac{V_i^T}{\Var{Y}} = 1 - \frac{V_{-i}}{\Var{Y}}

for i=1,\ldots,d.

The total Sobol’ index S_i^T measures the part of the variance of Y explained by X_i and its interactions with other input variables. It can also be viewed as the part of the variance of Y that cannot be explained without X_i.

First order closed sensitivity index of a group of variables

Let \bdu \subseteq \{1, \ldots, d\} be a group of input variables. The first order (closed) Sobol’ index of a group of input variables \bdu is:

S_{\bdu}^{\operatorname{cl}}
= \frac{\Var{\Expect{Y|\vect{X}_{\bdu}}}}{\Var{Y}}

The first order closed Sobol’ index of a group of input variables \bdu measures the sensitivity of the variance of Y explained by the variables within the group. This index is useful when the group contains random variables parameterizing a single uncertainty source (see [knio2010] page 139).

Total sensitivity index of a group of variables

The total Sobol’ index of a group of variables \bdu is:

S^T_\bdu
= \frac{\sum_{\bdv\cap\bdu\neq\emptyset} \Var{h_\bdv(\bdX_\bdv)}}{\Var{Y}}

where h_\bdv is the function of the variables in the group \bdv of the functional Sobol’-Hoeffding ANOVA decomposition of the physical model. The total Sobol’ index of a group of input variables \bdu measures the sensitivity of the variance of Y explained by the variables within the group and any group of variables containing any variable in the group. It can also be viewed as the part of the variance of Y that cannot be explained without X_\bdu.

For any group of variables \bdu, the total and first order (closed) Sobol’ indices are related by the equation:

S^T_\bdu + S_{\overline{\bdu}}^{\operatorname{cl}} = 1

where \overline{\bdu} is the complementary group of \bdu.

Summary of Sobol’ indices

The next table presents a summary of the 6 different Sobol’ indices that we have presented.

Single variable or group

Sensitivity Index

Formula

One single variable i

First order

S_i = \frac{\Var{\Expect{Y|X_i}}}{\Var{Y}}= \frac{V_i}{\Var{Y}}

Total

S^T_i = \sum_{\bdu \ni i} S_\bdu = 1 - S_{\overline{\{i\}}}^{\operatorname{cl}}

Interaction of a group \bdu

First order

S_\bdu = \frac{V_\bdu}{\Var{Y}}

Total interaction

S^{T,i}_\bdu = \sum_{\bdv \supseteq \bdu} S_{\bdv}

Group (closed) \bdu

First order closed

S_\bdu^{\operatorname{cl}} = \frac{\Var{\Expect{Y|\bdX_\bdu}}}{\Var{Y}} = \sum_{\bdv \subseteq \bdu} S_\bdv

Total

S^T_\bdu = \frac{\sum_{\bdv\cap\bdu\neq\emptyset} V_\bdv}{\Var{Y}} = 1 - S_{\overline{\bdu}}^{\operatorname{cl}}

Table 1. First order and total Sobol’ indices of a single variable i or a group \bdu.

Let us summarize the properties of the Sobol’ indices.

  • All these indices are in the [0, 1] interval.

  • The sum of interaction first order Sobol’ indices is equal to 1:

\sum_{\bdu \subseteq \{1,2,\ldots,d\}} S_\bdu = 1.

  • Each first order index is lower than its total counterpart:

S_\bdu & \leq S^{T,i}_\bdu \\
S_i & \leq S^T_i \\
S_\bdu^{\operatorname{cl}} & \leq S^T_\bdu

  • If S_i < S^T_i, there are interactions between the variable X_i and other variables.

  • If S_i = S^T_i for i = 1, \ldots, d, then the function is additive, i.e. the function g is the sum of functions g_1, \ldots, g_\inputDim of input dimension 1:

Y = \sum_{i = 1}^\inputDim g_i(X_i).

Example

Let us consider a function g which has d = 3 inputs (X_1, X_2, X_3). The full set of interaction indices is:

S_1, \; S_2, \; S_3, \; S_{\{1, 2\}}, \; S_{\{1, 3\}}, \; S_{\{2, 3\}},
\; S_{\{1, 2, 3\}}.

Each Sobol’ index combines a subset of the previous interaction indices. For example, the first and total Sobol’ indices are presented in the next table.

Variable

First order

Total

X_1

S_1

S_1^T = S_1 + S_{1,2} + S_{1,3} + S_{1,2,3}

X_2

S_2

S_2^T = S_2 + S_{1,2} + S_{2,3} + S_{1,2,3}

X_3

S_3

S_3^T = S_3 + S_{1,3} + S_{2,3} + S_{1,2,3}

Table 2. First order and total Sobol’ indices of the variables X_1, X_2 and X_3.

The list of possible groups is \{1,2\}, \{1,3\}, \{2,3\} and \{1,2,3\}. The next table presents the Sobol’ indices of the group \bdu = \{1, 2\}.

Sobol’ index of group \bdu = \{1, 2\}

Value

Group interaction

S_{\{1,2\}}

Group total interaction

S_{\{1,2\}}^{T,i} = S_{\{1,2\}} + S_{\{1, 2, 3\}}

Group first order (closed)

S_{\{1,2\}}^{\operatorname{cl}} = S_{1} + S_{2} + S_{\{1, 2\}}

Group total

S_{\{1,2\}}^T = S_1 + S_2 + S_{\{1, 2\}} + S_{\{1, 2, 3\}}

Table 3. Sobol’ indices of the group \bdu = \{1, 2\}.

Aggregated Sobol’ indices

For multivariate outputs i.e. when d'>1, the Sobol’ indices can be aggregated [gamboa2013]. Let V_i^{(k)} be the (first order) variance of the conditional expectation of the k-th output Y_k:

V_i^{(k)} & = \Var{ \Expect{Y_k \vert X_i} }

for i=1,\ldots,d and k=1,\ldots,d'. Similarly, let V_i^{(T, k)} be the total variance of the conditional expectation of Y^{(k)} for i = 1, \ldots, d and k = 1, \ldots, d'.

The indices can be aggregated with the following formulas:

S_i^{(a)}  & =  \frac{ \sum_{k=1}^{d'} V_{i}^{(k)} }{ \sum_{k=1}^{d'} \Var{Y_k} }  \\
S_i^{(T, a)} & =  \frac{ \sum_{k=1}^{d'} VT_{i}^{(k)} }{ \sum_{k=1}^{d'} \Var{Y_k} }

for i=1,\ldots,d.

Estimators

To estimate these quantities, Sobol’ proposes to use numerical methods that rely on two independent realizations of the random vector \vect{X}. This is known as the pick-freeze estimator.

Let N \in \Nset be the size of each sample. Let \mat{A} and \mat{B} be two independent samples of size N of \vect{X}:

\mat{A} = \left(
\begin{array}{cccc}
a_{1,1} & a_{1,2} & \cdots & a_{1, d} \\
a_{2,1} & a_{2,2} & \cdots & a_{2, d} \\
\vdots  & \vdots  & \ddots  & \vdots \\
a_{N,1} & a_{1,2} & \cdots & a_{N, d}
\end{array}
\right), \  \mat{B} = \left(
\begin{array}{cccc}
b_{1,1} & b_{1,2} & \cdots & b_{1, d} \\
b_{2,1} & b_{2,2} & \cdots & b_{2, d} \\
\vdots  & \vdots  & \vdots  & \vdots \\
b_{N,1} & b_{1,2} & \cdots & b_{N, d}
\end{array}
\right)

Each line is a realization of the random vector \vect{X}.

We are now going to mix these two samples to get an estimate of the sensitivity indices.

\mat{E}^i = \left(
\begin{array}{cccccc}
a_{1,1} & a_{1,2} & \cdots & b_{1,i} & \cdots & a_{1, d} \\
a_{2,1} & a_{2,2} & \cdots & b_{2,i} & \cdots & a_{2, d} \\
\vdots  & \vdots  &        & \vdots  & \ddots & \vdots \\
a_{N,1} & a_{1,2} & \cdots & b_{N,i} & \cdots & a_{N, d}
\end{array}
\right), \;
\mat{C}^i = \left(
\begin{array}{cccccc}
b_{1,1} & b_{1,2} & \cdots & a_{1,i} & \cdots & b_{1, d} \\
b_{2,1} & b_{2,2} & \cdots & a_{2,i} & \cdots & b_{2, d} \\
\vdots  & \vdots  &        & \vdots  & \ddots  & \vdots \\
b_{N,1} & b_{1,2} & \cdots & a_{N,i} & \cdots & b_{N, d}
\end{array}
\right)

Several estimators of V_i, V_i^T and V_{-i} are provided by the SobolIndicesAlgorithm implementations:

Specific formulas for \widehat{V}_i, \widehat{VT}_i and \widehat{V}_{-i} are given in the corresponding documentation pages.

The estimator \widehat{V}_{i,j} of V_{i,j} is the same for all these classes:

\widehat{V}_{i,j} = \frac{1}{N-1} \sum_{k=1}^{N} \tilde{g}(\vect{E}_k^i) \tilde{g}(\vect{C}_k^j) - \frac{1}{N} \sum_{k=1}^{N} \tilde{g}(\vect{A}_k) \tilde{g}(\vect{B}_k) - \widehat{V}_i - \widehat{V}_j.

Notice that the value of the second order conditional variance depends on the estimators \widehat{V}_i and \widehat{V}_j which are method-dependent. This implies that the value of the second order indices may depend on the specific Sobol’ estimator we use.

Centering the output

For the sake of stability, computations are performed with centered output. Let \overline{\vect{g}} be the mean of the combined samples \vect{g}(\mat{A}) and \vect{g}(\mat{B}). Let \tilde{\vect{g}} be the empirically centered function defined, for any \vect{x} \in \Rset^\inputDim, by:

\tilde{\vect{g}}(\vect{x}) = \vect{g}(\vect{x}) - \overline{\vect{g}}.

To estimate the total variance \Var{Y}, we use the computeVariance() method of the Sample \tilde{g}(\mat{A}).