Distribution Fitting Software & Articles

Extreme Value Distributions

EasyFit: select the best fitting distribution and use it to make better decisions. learn more
EasyFit Screenshot - Click To Enlarge
Download Free Trial

The problem of modeling extreme or rare events arises in many areas where such events can have very negative consequences. Some examples of rare events include extreme floods and snowfalls, high wind speeds, extreme temperatures, large fluctuations in exchange rates, and market crashes. To develop appropriate probabilistic models and assess the risks caused by these events, business analysts and engineers frequently use the extreme value distributions (EVD).

Contents
Extreme Value Theory
Gumbel Distribution (EVD Type I)
Fréchet Distribution (EVD Type II)
Weibull Distribution (EVD Type III)
Generalized Extreme Value Distribution
Using Extreme Value Distributions in EasyFit

Extreme Value Theory

Extreme value theory is a separate branch of statistics that deals with extreme events. This theory is based on the extremal types theorem, also called the three types theorem, stating that there are only three types of distributions that are needed to model the maximum or minimum of the collection of random observations from the same distribution.

In other words, if you generate N data sets from the same distribution, and create a new data set that includes the maximum values from these N data sets, the resulting data set can only be described by one of the three models - specifically, the Gumbel, Fréchet, and Weibull distributions.

These models, along with the Generalized Extreme Value distribution, are widely used in risk management, finance, insurance, economics, hydrology, material sciences, telecommunications, and many other industries dealing with extreme events.

Gumbel Distribution

Emil Gumbel

One of the first scientists to apply the theory was a German mathematician Emil Gumbel (1891-1966). Gumbel's focus was primarily on applications of extreme value theory to engineering problems, in particular modeling of meteorological phenomena such as annual flood flows:

"It seems that the rivers know the theory. It only remains to convince the engineers of the validity of this analysis."

The Gumbel distribution, also known as the Extreme Value Type I distribution, is unbounded (defined on the entire real axis), and has the following probability density function:

Gumbel Distribution PDF
 

where z=(x-μ)/σ, μ is the location parameter, and σ is the distribution scale (σ>0). The shape of the Gumbel model does not depend on the distribution parameters:

Gumbel Distribution PDF
 

The graph above shows the Gumbel PDF for σ=1 and μ=0.

Fréchet Distribution

Maurice Fréchet

Maurice Fréchet (1878-1973) was a French mathematician who had identified one possible limit distribution for the largest order statistic in 1927. The Fréchet distribution, also known as the Extreme Value Type II distribution, is defined as

Fréchet Distribution PDF
 

where α is the shape parameter (α>0), and β is the scale parameter (β>0). This distribution is bounded on the lower side (x>0) and has a heavy upper tail. The animation below shows the Fréchet PDF graph for β=1 and various values of α:

Fréchet Distribution PDF
 

Weibull Distribution

Waloddi Weibull

Waloddi Weibull (1887-1979) was a Swedish engineer and scientist well-known for his work on strength of materials and fatigue analysis. The Weibull distribution, also known as the Extreme Value Type III distribution, first appeared in his papers in 1939. The two-parameter version of this distribution has the density function

Two-Parameter Weibull Distribution PDF
 

The Weibull distribution is defined for x>0, and both distribution parameters (α - shape, β - scale) are positive. The two-parameter Weibull distribution can be generalized by adding the location (shift) parameter γ:

Three-Parameter Weibull Distribution PDF
 

In this model, the location parameter γ can take on any real value, and the distribution is defined for x>γ.

Even though the Weibull distribution was originally developed to address the problems arising in material sciences, it is widely used in many other areas thanks to its flexibility. When α=1, this distribution reduces to the Exponential model, and when α=2, it mimics the Rayleigh distribution which is mainly used in telecommunications. In addition, it resembles the Normal distribution when α=3.5:

Weibull Distribution PDF
 

It's worth noting that the Gumbel and Fréchet models described above relate to maxima (largest extreme value), while the Weibull model relates to minima (smallest extreme value). This form of the Weibull distribution is commonly used in practice.

Generalized Extreme Value Distribution

The Generalized Extreme Value (GEV) distribution is a flexible three-parameter model that combines the Gumbel, Fréchet, and Weibull maximum extreme value distributions. It has the following PDF:

GEV Distribution PDF
 

where z=(x-μ)/σ, and k, σ, μ are the shape, scale, and location parameters respectively. The scale must be positive (sigma>0), the shape and location can take on any real value. The range of definition of the GEV distribution depends on k:

GEV Distribution Domain
 

Various values of the shape parameter yield the extreme value type I, II, and III distributions. Specifically, the three cases k=0, k>0, and k<0 correspond to the Gumbel, Fréchet, and "reversed" Weibull distributions. The reversed Weibull distribution is a quite rarely used model bounded on the upper side. For example, for k=−0.5, the GEV PDF graph has the form:

GEV Distribution PDF - The Reversed Weibull Case
 

When fitting the GEV distribution to sample data, the sign of the shape parameter k will usually indicate which one of the three models best describes the random process you are dealing with.

Using Extreme Value Distributions in EasyFit

EasyFit supports the entire family of extreme value distributions, including the Gumbel, Fréchet, Weibull, and GEV models. Like most distributions in EasyFit, you can fit these models to your data or use them in Excel-based Monte Carlo simulations.

The Gumbel distribution is available in two forms: Gumbel Max (maximum extreme value) and Gumbel Min (minimum extreme value), enabling you to model left-skewed and right-skewed data:

Gumbel Distribution PDF
 

In addition to the "classical" two-parameter Fréchet distribution, EasyFit supports the three-parameter model which has the location parameter γ:

Three-Parameter Fréchet Distribution PDF
 

In this model, α and β have the same meaning as in the two-parameter model, but the distribution is defined for x>γ (γ can take on any real value). Similarly, EasyFit supports both two-parameter and three-parameter Weibull distributions.

Since the range of definition of the Generalized Extreme Value distribution depends on the shape parameter k, this model falls into the Advanced distributions class, according to the classification used in EasyFit.

Distribution Fitting

The extreme value distributions can be easily fitted to your data using either automated or manual fitting capabilities of EasyFit.

In automated fitting mode, EasyFit will fit both forms of the Weibull and Fréchet distributions unless you specify otherwise in the Distribution Fitting Options dialog. A similar capability is available in manual fitting mode.

When fitting the three-parameter Fréchet or Weibull models, you can either have EasyFit estimate the three parameters from data, or manually specify the location parameter and estimate α and β only. This feature can be useful if γ is known and doesn't need to be estimated.

Selecting The Best Fitting Distribution

To compare the fit of the extreme value distributions and select the best fitting model, you can use the goodness of fit tests and distribution graphs displayed by EasyFit. In general, the GEV distribution frequently provides better fit than the Gumbel, Fréchet, and Weibull models.

Applying The Selected Distribution

Once you select the best fitting model, you can use it to perform specific calculations and make appropriate decisions based on the analysis results. Some typical applications include calculating probabilities and making estimates and projections.

References

  • Kotz, S., Nadarajah, S. (2000). "Extreme Value Distributions: Theory and Applications." London: Imperial College Press.
  • Fisher, R.A., Tippett, L.H.C. (1928). "Limiting forms of the frequency distribution of the largest and smallest member of a sample." Proc. Cambridge Philosophical Society 24:180-190.
  • Gumbel, E.J. (1958). "Statistics of Extremes." Columbia University Press, New York.
  • Weibull, W. (1951). "A statistical distribution function of wide applicability" J. Appl. Mech.-Trans. ASME 18(3), 293-297.
  • www.wikipedia.org
  • www.barringer1.com