Estimation of a quantile upper bound by Wilks’ method¶
We consider a random variable of dimension 1 and the unknown
level quantile of its distribution (
).
We seek to evaluate an upper bound of
with a confidence greater or equal to
, using a given order statistics.
Let be some independent copies of
.
Let
be the
-th order statistics of
which means that
is the
-th maximum of
for
. For
example,
is the minimum
and
is the maximum. We have:
Smallest rank for an upper bound to the quantile¶
Let be an i.i.d. sample of size
of
the random variable
.
Given a quantile level
, a confidence level
, and a sample size
, we seek the smallest
rank
such that:
(1)¶
The probability density and cumulative distribution functions of the order
statistics are:
(2)¶
We notice that where
is the cumulated
distribution function of the Binomial distribution
and
is the
complementary cumulated distribution fonction (also named survival function in dimension
1).
Therefore:
and equation (1) implies:
(3)¶
This implies:
The smallest rank such that the previous equation is satisfied is:
An upper bound of is estimated by the value of
on the sample
.
Minimum sample size for an upper bound to the quantile¶
Given ,
, and
, we seek for the smallest sample size
such that the equation (1) is satisfied. In order to do so, we solve the
equation (3) with respect to the sample size
.
Once the smallest size has been estimated, a sample of size
can be
generated from
and an upper bound of
is estimated using
i.e. the
-th observation
in the ordered sample
.