Note

Go to the end to download the full example code.

Select fitted distributions¶

In this example help to make a choice between several distributions fitted to a sample.

Several methods can be used:

the ranking by the Kolmogorov p-values (for continuous distributions),
the ranking by the ChiSquared p-values (for discrete distributions),
the ranking by BIC values.

import openturns as ot
import openturns.viewer as otv

Create a sample from a continuous distribution

distribution = ot.Beta(2.0, 2.0, 0.0, 1.0)
sample = distribution.getSample(1000)
graph = ot.UserDefined(sample).drawCDF()
view = otv.View(graph)

1. Specify the model only

Create the list of distribution estimators

factories = [ot.BetaFactory(), ot.TriangularFactory()]

Rank the continuous models by the Lilliefors p-values:

estimated_distribution, test_result = ot.FittingTest.BestModelLilliefors(
    sample, factories
)
test_result

class=TestResult name=Unnamed type=Lilliefors Beta binaryQualityMeasure=true p-value threshold=0.5 p-value=0.772489 statistic=0.0152829 description=[Beta(alpha = 1.82673, beta = 1.69512, a = 0.0117009, b = 0.983794) vs sample Beta]

Rank the continuous models wrt the BIC criteria (no test result):

ot.FittingTest.BestModelBIC(sample, factories)

[class=Beta name=Beta dimension=1 alpha=1.82673 beta=1.69512 a=0.0117009 b=0.983794, -0.1828338429879409]

Rank the continuous models wrt the AIC criteria (no test result)

ot.FittingTest.BestModelAIC(sample, factories)

[class=Beta name=Beta dimension=1 alpha=1.82673 beta=1.69512 a=0.0117009 b=0.983794, -0.20246486410386944]

Rank the continuous models wrt the AICc criteria (no test result):

ot.FittingTest.BestModelAICC(sample, factories)

[class=Beta name=Beta dimension=1 alpha=1.82673 beta=1.69512 a=0.0117009 b=0.983794, -0.2024246630988443]

2. Specify the model and its parameters

Create a collection of the distributions to be tested

distributions = [ot.Beta(2.0, 2.0, 0.0, 1.0), ot.Triangular(0.0, 0.5, 1.0)]

Rank the continuous models by the Kolmogorov p-values:

estimated_distribution, test_result = ot.FittingTest.BestModelKolmogorov(
    sample, distributions
)
test_result

class=TestResult name=Unnamed type=Kolmogorov Beta binaryQualityMeasure=false p-value threshold=0.05 p-value=0.029146 statistic=0.0458091 description=[Beta(alpha = 2, beta = 2, a = 0, b = 1) vs sample Beta]

Rank the continuous models wrt the BIC criteria:

ot.FittingTest.BestModelBIC(sample, distributions)

[class=Beta name=Beta dimension=1 alpha=2 beta=2 a=0 b=1, -0.21016253990533787]

Rank the continuous models wrt the AIC criteria:

ot.FittingTest.BestModelAIC(sample, distributions)

[class=Beta name=Beta dimension=1 alpha=2 beta=2 a=0 b=1, -0.21016253990533787]

Rank the continuous models wrt the AICc criteria:

ot.FittingTest.BestModelAICC(sample, distributions)

[class=Beta name=Beta dimension=1 alpha=2 beta=2 a=0 b=1, -0.21016253990533787]

Discrete distributions

Create a sample from a discrete distribution

distribution = ot.Poisson(2.0)
sample = distribution.getSample(1000)
graph = ot.UserDefined(sample).drawCDF()
view = otv.View(graph)

Create the list of distribution estimators

distributions = [ot.Poisson(2.0), ot.Geometric(0.1)]

Rank the discrete models with respect to the ChiSquared p-values:

ot.ResourceMap.SetAsBool("FittingTest-ChiSquaredCheckSample", False)
estimated_distribution, test_result = ot.FittingTest.BestModelChiSquared(
    sample, distributions
)
test_result

class=TestResult name=Unnamed type=ChiSquared Poisson binaryQualityMeasure=true p-value threshold=0.05 p-value=0.205524 statistic=8.47196 description=[Poisson(lambda = 2) vs sample Poisson]

Rank the discrete models with respect to the BIC criteria:

ot.FittingTest.BestModelBIC(sample, distributions)
otv.View.ShowAll()

OpenTURNS

An Open source initiative for the Treatment of Uncertainties, Risks'N Statistics

Previous topic

Next topic

This Page

Select fitted distributions¶