Uncertainty ranking: SRC and SRRC¶
Standard Regression Coefficients (SRC) deal with analyzing the influence the random vector has on a random variable which is being studied for uncertainty. Here we attempt to measure linear relationships that exist between and the different components .
The principle of the multiple linear regression model consists in attempting to find the function that links the variable to the variables by means of a linear model:
where describes a random variable with zero mean and standard deviation independent of the input variables . If the random variables are independent and with finite variance , the variance of can be estimated as follows:
From this we obtain the following coefficients:
The estimators for the regression coefficients , and the standard deviation are obtained from a sample of . The SRC coefficients are defined as the estimators of the coefficients :
where denotes the estimate of the regression coefficient , denotes the empirical standard deviation of the sample of the input variable and denotes the empirical standard deviation of the sample of the output variable . The absolute value of this estimated contribution is by definition between 0 and 1. The closer it is to 1, the greater the impact the variable has on the dispersion of .
The square , which is the contribution of to the variance of , is sometimes described in the literature as the “importance factor”, because of the similarity between this approach to linear regression and the method of cumulative variance which uses the term importance factor.
It is a good idea to check the quality of the linear regression before estimating the SRC coefficients: if the linear regression model is a poor fit to the data, then the SRC coefficients are useless.
Note that if there exists a map such that , then the squared SRC coefficients are equal to Sobol’ indices.
Standard Rank Regression Coefficients (SRRC) are SRC coefficients computed on the ranked input variables and the ranked output variable . They are useful when the relationship between and is not linear (so SRC cannot be used), but only monotonic.