Conditional Dependence of Trivariate Generalized Pareto Distributions

Barro, Diakarya

Research Article

Conditional Dependence of Trivariate Generalized Pareto Distributions

Diakarya Barro
Department of Academic, Laboratoire LANIBIO, UFR-SEA, Université de Ouagadougou, Burkina Faso

ABSTRACT

In this study we consider the dependence of the family of multivariate generalized Pareto distributions under given conditions on lower dimensional margins. A new function which describes this conditional dependence is built via Pickands dependence function. This function provides a new characterization of the basic subfamilies of trivariate generalized Pareto distributions.

PDF Abstract XML References Citation

How to cite this article

Diakarya Barro, 2009. Conditional Dependence of Trivariate Generalized Pareto Distributions. Asian Journal of Mathematics & Statistics, 2: 20-32.

DOI: 10.3923/ajms.2009.20.32

URL: https://scialert.net/abstract/?doi=ajms.2009.20.32

INTRODUCTION

Extreme Values Theory (EVT) is based on modelling and measuring events which occur with very small probability. Mainly two methods have been developed in this theory: the block maxima method (Beirlant et al., 2005) and the Peacks-Over-Threshold (POT) approach (Coles, 2001; Resnick, 1987). The block maxima method is interested to asymptotic behavior of the laws of the component-wise maxima appropriately normalized under the condition that the univariate margins are independent and identically distributed (iid). This method shows that these asymptotic laws are the Multivariate Extreme Value Distributions (MEVDs). Suggested originally by hydrologists, the POT approach is rather based on modelling of exceedances of a random sample over a large threshold within a time period.

Earlier studies have developed statistical structures to describe the dependence of the multivariate distributions arising from these two approaches. In many latest books and reviews on the topic (Beirlant et al., 2005; Gaume, 2005), it has been shown that no single parametric family can summary the MEVDs like does do the Generalized Extreme Value (GEV) family in the univariate EVT. Nevertheless, if the univariate margins are given the dependence of these distributions can be characterized by equivalent measures like Pickands dependence function, exponent measure or stable tail dependence function (Degen, 2006). Furthermore, Tajvidi has shown (Tajvidi, 1996) that for a sample of random vectors the law H of the exceedances over a large threshold is the multivariate Generalized Pareto Distribution (GPD). Moreover, this excess distribution H is linked to the asymptotic component-wise maxima model G of the same sample by Eq. 1:

(1)

The aim and particularity of this study is to build a new structure which describes the dependence of multivariate GPDs but under given conditions made on the marginal distributions. This new conditional dependence function enable us to characterize the basic parametric subfamilies of three-dimensional GPDs.

MATERIALS AND METHODS

In this study, we consider the following problem: Let consider a situation where X = {(X₁,...,X_n);n≥1} is a random vector with a multivariate GPD function H and we are interested to model a structure which describes both the dependence of H under the condition X_I>x_0,I and the dependence of the survival function of H under the condition X_J≤x_0,J; x_0,I and x_0,J being given realizations of the complementary lower dimensional margins X_I and X_J of X. Therefore, it is desirable to model the structure which gives at any realization x = (x₁,...,x_n) of X the probability of the discordances For this purpose Pickands dependence function of a MEVD would be useful (Coles, 2001; Beirlant et al., 2005). Let:

be the component-wise maxima of a random vector with univariate iid variables and with distribution function F. A m-dimensional continuous and non-degenerate function G is a MEVD if there exists vectors of normalizing sequences with such that (in component-wise algebraic notations), for all

(2)

If Eq. 2 holds F is said to belong to the max-domain of attraction of G. Therefore, from the link established between G and H by Eq. 1, we obtain, for all x = (x₁,...,x_m)εIR^m, the following characterization:

Image for - Conditional Dependence of Trivariate Generalized Pareto Distributions

(3)

where, A is the Pickands dependence function of H defined on the unit simplex

in IR^m-1 such that:

verifying for tεS_m-1, the condition:

μ being the angular measure of H on S_m-1 (Beirlant et al., 2005). In addition, the y_i are defined by the transformations:

(4)

where, μ_iεIR. ξ_i. εIR and σ_i>0 are respectively the location, shape and scale parameters of the univariate margins G_i of G. The different values of the parameter ξ_i allow the GEV distribution defined by:

(5)

to describe the three types of asymptotic extreme behavior such as:

(6)

The laws Λ, Φ and Ψ are from Gumbel, Fréchet and Weibull, respectively.

RESULTS

Here, the main three theorems of this study will be presented and proved.

Angular Distribution of Multivariate GPDs
The following theorem gives the angular distribution of a multivariate GPD.

Theorem 1
Let H be a multivariate GPD. Then, there exists a function L(.) defined on S_m-1 in IR^m-1 such as, for all x = (x₁,...,x_m)εIR^m:

(8)

Moreover, if H is continuously differentiable of order m, the density function l of L fulfills, for all t = (t₁,...,t_m-1)εS_m-1, the equality:

(9)

The function L(.) is the angular distribution of the multivariate GPD H.

Proof
Let V be the exponent measure function of the distribution H with unit Fréchet margins (Michel, 2006; Resnick, 1987). Therefore, for all x_i>0 we have:

(10)

It is known that:

(11)

Where:

l(.) being the angular density of H. Furthermore, we have:

This result inserted in Eq. 11 gives :

By taking:

We have:

Replacing l(t₁,...,t_m-1) in Eq. 10 we see that Eq. 9 assertion holds.

For proving the following theorems we define a conditional dependence measure for the family of multivariate distributions.

A Conditional Measure of a Multivariate Distribution
Let n, k be natural numbers such that {n≥2;1≤k<n} and let N_k be a given subset of k elements of N = {1,...,n}, the set of the first n natural numbers.

Definition 1
We define N_k-partition of a random vector X ={(X₁,...,X_n), n≥2} (or the partition of X in the direction of N_k) by the pairwise vector as:

•	is the k-dimensional marginal vector of X whose component indexes are ordered in the subset N_k

•	is the (n-k)-dimensional marginal vector of X whose component indexes are ordered in the complementary of N_k in N
	Similarly, every realization x = (x₁,..,x_n) of X can be decomposed into two parts

are, respectively realizations of vectors et . If H, HN_k and denote the distribution functions of the random vectors then for all realization x = (x₁,...,x_n) of X we have

are the upper endpoints of the functions HN_k and

Definition 2
Given a N_k-partition of X = (X_l,...,X_n) we define the upper N_k-discordance degree of X as the conditional probability given for all x = (x_l,..., x_n) εIRⁿ by Similarly, the lower N_k-discordance degree of X is defined, for all x = (x_l,...,x_n) in IRⁿ by .

The following definition characterizes the probability that one of the margins and exceeds 1/2, while the values taken by the other are less than 1/2.

Definition 3
Given the distribution H of a multivariate random vector X = {(X₁,...,X_n), n≥2} with univariate margins H_i, 1≤i≤n we define the upper N_k-median discordance degree of H by the real number denoted by : s quantile function of H_i. Similarly, the lower N_k-median discordance degree of H is defined by

Example 1
Let X = (X₁, X₂, X₃) be a trivariate random vector. Let’s consider N₂ = {1, 3}. The lower N₂-discordance degree of X is given, for x = (x₁, x₂, x₃) ε IR³ by . Particularly, if H is a continuous distribution function of X we verify easily that , while for N₁ = {1}, the upper N₁-median discordance degree is given by where H₂>0 and H_1,3>0 are, respectively the distribution functions of the margins X₂ and (X₁, X₃); being the inverse of the survival function of H_i. Let’s suppose, in addition that:

for all x_i ε]0,1] (distribution whose univariate margins are uniform on 0,1]), we check easily that, for θ = 1/2 we get :

The following result shows that the N_k-marginal distribution of a MEVD is also a MEVD.

Theorem 2
Suppose there exists a MEVD G describing the asymptotic behavior of the component-wise maxima of X suitably normalized. Then, there exists a k-dimensional MEVD G_k and a (n-k)-dimensional MEVD G_n-k associated, respectively to the component-wise maxima of the marginal vectors and Moreover G_k and G_n-k are the marginal distributions of G.

Proof
Let σ_n = (σ₁,...,σ_n)εIRⁿ; σ_i>0 and μ_n = (μ₁;...;μ_n) εIRⁿ be the vectors of normalizing sequences of the component-wise maxima M_n = (M₁,...,M_n) associated to the MEVD G by previous Eq. 2. Then, if

is the upper endpoint of marginal vector we have:

Therefore, there exists two vectors of normalizing sequences σi, N_k>0 and μN_k = (μ1,N_k;...;μk, N_k) ε IR^k such as the marginal component-wise maxima MN_k = (M1, N_k,...,Mk,N_k) of M_n converges to GN_k according to Eq. 2. Thereby G_k is a MEVD with k variables.

Similarly, we establish that G_n-k also arises as the limiting distribution of the marginal component-wise maxima linearly normalized with vectors of sequences with

A Conditional Dependence Function of a Multivariate GPD
Note that, in the above, each of the discordance degrees and of a random vector X can be obtained by functional transformations of the other. Therefore, the following characterizations will be restricted to the upper which will be denoted by δ in the simplest case i.e., k = 1.

Theorem 3
Let G be a MEVD with discordance degree δ. Then, there exists a convex function D defined on the unit simplex S_m-1 by:

(7)

for all (x₁,...,x_m) in IRⁿ ; where the y_i(x_i) satisfy Eq. 4, for i = 1,…, m.

D is called the discordance function of G or of its corresponding GPD H.

Taking a MEVD with unit Gumbel margin, G_i(x_i) = exp{-exp(-x_i)}; x_i>0, the following corollary characterizes the simplest upper median degree, .

Corollary
Let G be a MEVD with unit Gumbel. Then, the upper median discordande degree of G, denoted by is given by

Example 2
Let G_θ, θ>1 be the logistic model of MEVD given for (x₁, x₂, x₃)εIR³ by

y_i(x_i) satisfying Eq. 4. Its discordance function is given, for all (t₁, t₂)εS₂ by

and the median discordance degree

Particularly, for θ = 2, we get = 0.864.

Proof of Theorem 3
Let's suppose that, for i = 1,…,m, the univariate margins G_i of the MEVD G have the generalized form: G_i(x_i) = exp{-y_i(x_i)} where the y_i (x_i) satisfy Eq. 4. Therefore, for all (x_l,...,x_n)εIRⁿ;

(12)

Thus, being the joint distribution of the margin vector (X₂,...,X_n) of X, we have:

Furthermore, due to theorem 1, the function is a k-dimensional extreme value distribution. Therefore, if A and are the Pickands dependence functions of G and respectively, then, in Eq. 12 we have:

Furthermore, we have:

where, D is the convex function such that 0≤1-t_l≤1 and defined on the unit simplex

by

for all tεS_m-1. Particularly for the trivariate case, we have:

defined on S₂.

APPLICATION TO THE TRIVARIATE MODELS OF GPDs

The logistic model is the most important family of multivariate GPDs.

The Family of Trivariate GPD of Logistic Type
Let X = (X₁, X₂, X₃) be a trivariate random vector with a parametric distribution H_θ, θ>1. The above Eq. 3 enable us to characterize H_θ by its discordance function D_θ via its Pickands dependence function A_θ (Michel, 2006).

Definition 4
The trivariate parametric function H_θ is a MGPD of Logistic Type if H_θ has, for all (x₁, x₂, x₃)εIR³ the representation:

With the y_i(x_i) satisfying Eq. 4 and where the discordance function D_θ of H_θ is given for all:

We give here three basic trivariate GPDs of Logistic Type (Joe, 1997; Husler and Reiss, 1989) and we build their discordance functions:

•	The trivariate family of GPD of Logistic Type of Gumbel

Particularly if θ→1¯ we obtain the trivariate Pareto independent model H(x₁, x₂, x₃)=1+{-y₁(x₁) +y₂(x₂)+y₃(x₃)]} with D(t₁, t₂) = 2-t₁ for (t₁, t₂)εS₂

•	The trivariate family of GPD of Logistic Type of Galambos

•	The trivariate family of GPD of Logistic Type of Husler-Réiss

The GPD which describes the behavior of the exceedances of the trivariate normal distribution over a large threshold is given for (x₁, x₂, x₃)εIR³; θ = (θ₁, θ₂, θ₃) by:

where, Φ notes the distribution function of the standard normal law and the survival function of the bivariate normal distribution function with covariance matrix:

Thus, for all (t₁, t₂) the corresponding discordance function is:

where, R(t₁, t₂, θ) is an integral rest defined for all (t₁, t₂)εS₂.

The Family of Trivariate GPD of Nested Type
The Nested Logistic Type is an asymmetric subfamily of logistic model. It generalizes this model to allow different degrees of dependence between the components of the underlying random vector. For (x₁, x₂, x₃)εIR³ and θ₁, θ₂>1, define now, recursively the following norm

where, ||.||_θ is the usual θ-norm with the convention that the absolute value is taken if the norm does not have an index (Joe, 1997; Michel, 2006).

Definition 5
The distribution function given, for all (x₁, x₂, x₃)εIR³, by

is called the generalized Pareto distribution of Nested logistic type.

The basic trivariate GPD of Nested Logistic Type is given, for θ₁, θ₂≥1 by:

The Family of Trivariate GPD of Asymmetric Logistic Type
The asymmetric distributions arise as the models which describe the asymptotic behavior of the maxima of storms recorded at different locations along a coastline (Gaume, 2005). They generalize the logistic model but does not include the nested logistic model from the previous section (Michel, 2006).

Definition 6
Let B be a non-empty subset of {1,2,3} and let λ_C≥1 be arbitrary numbers for every C⊂B with |C|≥2 and λ_C = 1 if |C| = 1. Furthermore, let 0≤p_i,C≤1 where, p_i,C = 0 if I∉C and the side condition

is fulfilled for i=1,2,3. Then the distribution function

is a generalized Pareto distribution of Asymmetric Logistic Type.

The basic trivariate GPDs of Asymmetric Logistic Type with their discordance functions follow (Joe, 1997):

•	The trivariate Asymmetric GPD of Logistic Type of Gumbel

•	The trivariate Asymmetric GPD of Logistic Type of Galambos

for (x₁, x₂, x₃)εIR³ and θ₁, θ₂>0. We have, for (t₁, t₂), εS₂

DISCUSSION

The results of the study show that the dependence of all trivariate GDP, under given conditions on the lower dimensional margins, is totally described by its discordance function. These results are similar to the characterizations of the multivariate GPDs developed by Tajvidi (1996) or to the equivalent dependence measures for MEVDs (Resnick, 1987; Beirlant et al., 2005). But the particularity of this study is the that the new measure and function describe the joint dependence under any condition made on the support of a lower dimensional margin. Moreover, the applications of the study determine clearly the three main subfamilies of the models of trivariate GPDs by characterizing them by their discordance function.

We found that the results conform to the solution of the problem considered earlier. This is seen at all realisations x = (x_l,...,x_n) of the random vector X. We also note that the theorem 3 establishes a link between the new dependence structure and the previous dependence measures via Pickands dependence one.

CONCLUSION

In this research we have investigated about characterization of a conditional dependence of multivariate families of generalized Pareto distributions. We have built a new measure and function which describe this conditional dependence. Basic trivariate subfamilies of multivariate GPDs have been characterized by this function. Moreover, we have computed the expressions of this function for the usual trivariate subfamilies of GPDs.

REFERENCES

Beirlant, J., Y. Goegebeur, J. Segers and J. Teugels, 2005. Statistics of Extremes Theory and Applications. John Wiley and Sons, Chichester, ISBN-13: 978-0-471-97647-9.
Coles, S., 2001. An Introduction to Statistical Modeling of Extreme Values. 1st Edn., Springer, New York, ISBN-13: 978-1852334598.
Direct Link
De Haan, L. and A. Ferreira, 2006. Extreme Value Theory: An Introduction. Springer, Berlin.
Gaume, E., 2005. On the asymptotic behavior of flood peak distributions-theoretical Hydrol. Earth Syst. Sci. Discuss., 2: 1835-1864.
Direct Link
Degen, M., 2006. On multivariate generalised pareto distributions and high risk scenarios. Diploma Thesis, Department of Mathematics, ETH Zurich.
Husler, J. and R.D. Reiss, 1989. Extreme Value Theory: Proceedings of a Conference Held in Oberwolfach Dec. 6-12, 1987. Springer, Berlin, ISBN-13: 9780387969541, pp: 279.
Joe, H., 1997. Multivariate Models and Dependence Concepts. Monographs on Statistics and Applied Probabilty. Vol. 73, Chapman and Hall, London, ISBN-13: 9780412073311.
Michel, R., 2006. Simulation and Estimation in Multivariate Generalized Pareto. Dissertation, Fakultat fur Mathematik und Informatikn, Universitat Wurzburg, Wurzburg.
Resnick, S., 1987. Extreme Values, Regular Variation and Point Processes.Extreme Values, Regular Variation and Point Processes.Extreme Values, Regular Variation and Point Processes. Springer Series of the Applied Probability Trust, Springer, New York.
Tajvidi, N., 1996. Characterisation and Some Statistical Aspects of Univariate and Multivariate Generalised Pareto Distributions. Dissertation, Department of Mathematics, Chalmers Tekniska Hogskola Goteborg, Swedish.

Asian Journal of Mathematics & Statistics

Research Article

Conditional Dependence of Trivariate Generalized Pareto Distributions

ABSTRACT

How to cite this article

Search

INTRODUCTION

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

REFERENCES

Search

Related Articles

Leave a Comment