GeneralizedParetoFactory¶
(Source code
, png
)
- class GeneralizedParetoFactory(*args)¶
Generalized Pareto factory.
See also
Notes
Several estimators are available to build a
GeneralizedPareto
distribution from a scalar sample (see [matthys2003] for the theory).The chosen algorithm depends on the size of the sample compared to the
ResourceMap
key GeneralizedParetoFactory-SmallSize:When the size of the sample is small the probability weighted method of moments is used, with a fallback on the exponential regression method
When the size of the sample is large the exponential regression method is used, with a fallback on the probability weighted moment method
Moments based estimator:
Lets denote:
the empirical mean of the sample,
its empirical variance.
Then we estimate using:
(1)¶
This estimator is well-defined only if , otherwise the second moment does not exist.
Probability weighted moments based estimator:
Lets denote:
the sample sorted in ascending order
Then we estimate using:
(2)¶
This estimator is well-defined only if , otherwise the first moment does not exist.
Maximum likelihood based estimator:
These estimators are not yet implemented.
For a given , we get by maximizing the likelihood of the sample :
The threshold is obtained by maximizing the optimal likelihood
The initial point of the optimisation problem is .
Exponential regression based estimator:
Lets denote:
for
Then we estimate using:
(3)¶
Where maximizes:
(4)¶
under the constraint .
The following
ResourceMap
entries can be used to tweak the parameters of the optimization solver involved in the different estimators:GeneralizedParetoFactory-DefaultOptimizationAlgorithm
GeneralizedParetoFactory-MaximumEvaluationNumber
GeneralizedParetoFactory-MaximumAbsoluteError
GeneralizedParetoFactory-MaximumRelativeError
GeneralizedParetoFactory-MaximumObjectiveError
GeneralizedParetoFactory-MaximumConstraintError
Methods
build
(*args)Build the distribution.
buildAsGeneralizedPareto
(*args)Build the distribution as a GeneralizedPareto type.
buildEstimator
(*args)Build the distribution and the parameter distribution.
Build the distribution based on the exponential regression estimator.
buildMethodOfMoments
(sample)Build the distribution based on the method of moments estimator.
Build the distribution based on the probability weighted moments estimator.
drawMeanResidualLife
(sample)Draw the mean residual life plot.
Accessor to the bootstrap size.
Accessor to the object's name.
getId
()Accessor to the object's id.
getName
()Accessor to the object's name.
Accessor to the solver.
Accessor to the object's shadowed id.
Accessor to the object's visibility state.
hasName
()Test if the object is named.
Test if the object has a distinguishable name.
setBootstrapSize
(bootstrapSize)Accessor to the bootstrap size.
setName
(name)Accessor to the object's name.
setOptimizationAlgorithm
(solver)Accessor to the solver.
setShadowedId
(id)Accessor to the object's shadowed id.
setVisibility
(visible)Accessor to the object's visibility state.
- __init__(*args)¶
- build(*args)¶
Build the distribution.
Available usages:
build()
build(sample)
build(param)
- Parameters:
- sample2-d sequence of float, of dimension 1
The sample from which the distribution parameters are estimated.
- paramsequence of float
The parameters of the
GeneralizedPareto
.
- Returns:
- dist
Distribution
The built distribution.
- dist
Notes
In the first usage, the default
GeneralizedPareto
distribution is built.In the second usage, the parameters are evaluated according the following strategy:
If the sample size is less or equal to GeneralizedParetoFactory-SmallSize from
ResourceMap
, then the method of probability weighted moments is used. If it fails, the method of exponential regression is used.Otherwise, the first method tried is the method of exponential regression, then the method of probability weighted moments if the first one fails.
In the third usage, a
GeneralizedPareto
distribution corresponding to the given parameters is built.
- buildAsGeneralizedPareto(*args)¶
Build the distribution as a GeneralizedPareto type.
Available usages:
build()
build(sample)
build(param)
- Parameters:
- sample2-d sequence of float, of dimension 1
The sample from which the distribution parameters are estimated.
- paramsequence of float,
A vector of parameters of the
GeneralizedPareto
.
- Returns:
- dist
GeneralizedPareto
The estimated distribution as a GeneralizedPareto.
In the first usage, the default GeneralizedPareto distribution is built.
- dist
- buildEstimator(*args)¶
Build the distribution and the parameter distribution.
- Parameters:
- sample2-d sequence of float
Data.
- parameters
DistributionParameters
Optional, the parametrization.
- Returns:
- resDist
DistributionFactoryResult
The results.
- resDist
Notes
According to the way the native parameters of the distribution are estimated, the parameters distribution differs:
Moments method: the asymptotic parameters distribution is normal and estimated by Bootstrap on the initial data;
Maximum likelihood method with a regular model: the asymptotic parameters distribution is normal and its covariance matrix is the inverse Fisher information matrix;
Other methods: the asymptotic parameters distribution is estimated by Bootstrap on the initial data and kernel fitting (see
KernelSmoothing
).
If another set of parameters is specified, the native parameters distribution is first estimated and the new distribution is determined from it:
if the native parameters distribution is normal and the transformation regular at the estimated parameters values: the asymptotic parameters distribution is normal and its covariance matrix determined from the inverse Fisher information matrix of the native parameters and the transformation;
in the other cases, the asymptotic parameters distribution is estimated by Bootstrap on the initial data and kernel fitting.
- buildMethodOfExponentialRegression(sample)¶
Build the distribution based on the exponential regression estimator.
- Parameters:
- sample2-d sequence of float, of dimension 1
The sample from which the distribution parameters are estimated.
- buildMethodOfMoments(sample)¶
Build the distribution based on the method of moments estimator.
- Parameters:
- sample2-d sequence of float, of dimension 1
The sample from which the distribution parameters are estimated.
- buildMethodOfProbabilityWeightedMoments(sample)¶
Build the distribution based on the probability weighted moments estimator.
- Parameters:
- sample2-d sequence of float, of dimension 1
The sample from which the distribution parameters are estimated.
- drawMeanResidualLife(sample)¶
Draw the mean residual life plot.
Let a random variable defined so the excess of a threshold follow a Generalized Pareto distribution . The mean of excesses of for is
So for all M_n(u) is linear wrt. . The threshold is the smallest value of for which the curve is linear.
The quantity is estimater par the empirical estimator of the mean:
The quantity is asymptotically Gaussian with mean and variance
- Parameters:
- sample2-d sequence of float, of dimension 1
The sample from which the distribution parameters are estimated.
- Returns:
- graph
Graph
The graph of and its confidence interval.
- graph
Notes
The confidence level can be set using the
ResourceMap
key GeneralizedParetoFactory-MeanResidualLifeConfidenceLevel The number of threshold points in the graph can be set with the key GeneralizedParetoFactory-MeanResidualLifePointNumber.
- getBootstrapSize()¶
Accessor to the bootstrap size.
- Returns:
- sizeinteger
Size of the bootstrap.
- getClassName()¶
Accessor to the object’s name.
- Returns:
- class_namestr
The object class name (object.__class__.__name__).
- getId()¶
Accessor to the object’s id.
- Returns:
- idint
Internal unique identifier.
- getName()¶
Accessor to the object’s name.
- Returns:
- namestr
The name of the object.
- getOptimizationAlgorithm()¶
Accessor to the solver.
- Returns:
- solver
OptimizationAlgorithm
The solver used for numerical optimization of the likelihood.
- solver
- getShadowedId()¶
Accessor to the object’s shadowed id.
- Returns:
- idint
Internal unique identifier.
- getVisibility()¶
Accessor to the object’s visibility state.
- Returns:
- visiblebool
Visibility flag.
- hasName()¶
Test if the object is named.
- Returns:
- hasNamebool
True if the name is not empty.
- hasVisibleName()¶
Test if the object has a distinguishable name.
- Returns:
- hasVisibleNamebool
True if the name is not empty and not the default one.
- setBootstrapSize(bootstrapSize)¶
Accessor to the bootstrap size.
- Parameters:
- sizeinteger
The size of the bootstrap.
- setName(name)¶
Accessor to the object’s name.
- Parameters:
- namestr
The name of the object.
- setOptimizationAlgorithm(solver)¶
Accessor to the solver.
- Parameters:
- solver
OptimizationAlgorithm
The solver used for numerical optimization of the likelihood.
- solver
- setShadowedId(id)¶
Accessor to the object’s shadowed id.
- Parameters:
- idint
Internal unique identifier.
- setVisibility(visible)¶
Accessor to the object’s visibility state.
- Parameters:
- visiblebool
Visibility flag.
Examples using the class¶
Fit an extreme value distribution