Estimation of a quantile upper bound by Wilks’ method¶
We consider a random variable of dimension 1 and the unknown level quantile of its distribution (). We seek to evaluate an upper bound of with a confidence greater or equal to , using a given order statistics.
Let be some independent copies of . Let be the -th order statistics of which means that is the -th maximum of for . For example, is the minimum and is the maximum. We have:
Smallest rank for an upper bound to the quantile¶
Let be an i.i.d. sample of size of the random variable . Given a quantile level , a confidence level , and a sample size , we seek the smallest rank such that:
(1)¶
The probability density and cumulative distribution functions of the order statistics are:
(2)¶
We notice that where is the cumulated distribution function of the Binomial distribution and is the complementary cumulated distribution fonction (also named survival function in dimension 1). Therefore:
and equation (1) implies:
(3)¶
This implies:
The smallest rank such that the previous equation is satisfied is:
An upper bound of is estimated by the value of on the sample .
Minimum sample size for an upper bound to the quantile¶
Given , , and , we seek for the smallest sample size such that the equation (1) is satisfied. In order to do so, we solve the equation (3) with respect to the sample size .
Once the smallest size has been estimated, a sample of size can be generated from and an upper bound of is estimated using i.e. the -th observation in the ordered sample .