Pearson’s correlation test¶
This method deals with the modelling of a probability distribution of a
random vector . It
seeks to find a type of dependency (here a linear correlation) which may
exist between two components
and
.
The Pearson’s correlation coefficient , defined in
Pearson’s coefficient,
measures the strength of a linear relationship between two random
variables
and
. If we have a sample made up of
pairs
,
we denote
to be the estimated coefficient.
Even in the case where two variables and
have a
Pearson’s coefficient
equal to zero, the estimate
obtained from the sample may be non-zero:
the limited sample size does not provide the perfect image of the real
correlation. Pearson’s test nevertheless enables one to determine if the
value obtained by
is significantly
different from zero. More precisely, the user first chooses a
probability
. From this value the critical value
is calculated such that:
if
, one can conclude that the real Pearson’s correlation coefficient
is not zero; the risk of error in making this assertion is controlled and equal to
;
if
, there is insufficient evidence to reject the null hypothesis
.
An important notion is the so-called “-value” of the test. This
quantity is equal to the limit error probability
under which the null correlation hypothesis
is rejected. Thus, Pearson’s coefficient is supposed non zero if and
only if
is greater than the value
desired by the user. Note that the higher
, the more robust the decision.
(Source code
, png
)
![../../_images/pearson_test-1.png](../../_images/pearson_test-1.png)