Parameter Estimation of the Extended Generalized Gaussian Family Distributions using Maximum Likelihood Scheme

Lee, June-Yule

ABSTRACT

An extended generalized Gaussian distribution which can describe a family of symmetric and asymmetric distributions is considered. Parameter estimation of this function using maximum likelihood scheme is proposed. By measured the tail length and skewness of the observed data, the method integrates a pre-calculated table of initial values for parameters estimation. This allows a fast convergence of the presented model for real-time applications. The simulation results also show that the proposed scheme is an asymptotically unbiased estimator in terms of Cramer- Rao lower bound criterion.

PDF Abstract XML References Citation

INTRODUCTION

In the field of science and engineering, the family of the Generalized Gaussian Distributions (GGD) are applied to model and detect the Gaussian and non-Gaussian signals (Miller and Thomas, 1972; Kassam, 1988). For example, in natural environments, the performance of communication systems can significantly degrade due to the atmospheric noise caused by the lightning activity in local and distant storm, or the ambient noise in the ocean. In seismic research (Walden, 1985), the non-Gaussian distribution has been used to model the amplitude distribution of the primary reflection coefficients. The aim is to estimate the reflection coefficient sequences as accurately as possible. In image processing, the fundamental work is fitting the image intensity histogram data to gain underlying knowledge of data source. Fan and Lin (2009) developed an expectation maximization algorithm to estimate the parameter of the GGD mixture model that best fit the observed image intensity histogram data. Other useful applications of the GGD also include, video coding (Sharifi and Leon-Garcia, 1995), handwritten character identification (Solihin and Leedham, 1999), speech recognition (Gazor and Zhang, 2003), etc. Therefore, the choice between different families of distribution is important and the estimation of the designed probability density function from observed data is the major task.

In this study, the parameter estimator for the extended generalized Gaussian distribution to include asymmetry is applied. The parameter estimation is determined by the maximum likelihood scheme with iteration method. The key factor of the iteration method depends on the choice of the initial and step values. Soderlind (2006) presented and discussed the approaches to step size selection in control theory and signal processing. Tlelo-Cuautle et al. (2007) proposed a procedure to control the step-size, where the step size is determined by the eigenvalues of its state matrixes. For saving the computing cost and improving the accuracy, we proposed a table look-up method.

THE MODEL

The generalized Gaussian distribution to include asymmetry proposed by Nandi and Mampel (1995) is:

Image for - Parameter Estimation of the Extended Generalized Gaussian Family Distributions using Maximum Likelihood Scheme

(1)

and

(2)

where, α_n>0, α_p>0, β_n>0 and Γ is gamma function. By setting:

(3)

then the variance of Eq. 1 is chosen as one. By changing the parameters in Eq. 1, one can obtain symmetric (α_n = α_p) and asymmetric (α_n ≠ α_p) distributions. For symmetric cases, Fig. 1, (α_n , α_p) are (1,1) and (2,2) represent the Laplace distribution and Gaussian distribution, respectively. When α_n tends to 0 the distribution tends to impulse and when α_n tends to infinite the distribution tends to Uniform distribution. For asymmetric cases, (Fig. 2), the right skew distribution can be modeled by reducing the parameter α_p and as α_p decreases the tail length in the right side increases. Therefore, Eq. 1 provides a wide range of uni-modal distributions with symmetry and asymmetry.


Fig. 1:	Symmetric model for varying the parameters of (α_n = α_p)


Fig. 2:	Asymmetric model for varying the parameters of (α_n = α_p)

THE ESTIMATOR

Given a series samples x_i, where i = 1,2,3,...,N, the likelihood function of Eq. 1 is:

(4)

where, N₁ (N_r) is the number of x_i samples <0 (≥0). Taking the logarithm of Eq. 4, one gets:

(5)

If ∂ InL/∂α_n, ∂InL/∂β_n and ∂InL/∂α_p are equate to zero, then the maximum likelihood estimates α_n, β_n and α_p are obtained, i.e.,

(6)

(7)

(8)

Where,

(9)

and Ψ(•) is digamma function defined as:

(10)

To fit the observed data and approximate the solutions α_n, β_n and α_n, one need to solve (Eq. 6-8). In the following two iterated methods are developed.

Iterated method 1: Given a reasonable initial values α_n0, β_n and α_p, (Eq. 6-8) can be solved iteratively for α_n, β_n and α_p. First step, compute Eq. 6 with (α_n0±Δα_n) by fixed β_n0 and α_p0, then α_n1 is selected by the value of (α_n0±Δα_n) or (α_n0-Δα_n) so that the likelihood function of Eq. 6 approaches zero. Second step, compute Eq. 7 with (β_n0±Δβ_n) by fixed α_n1 and α_p0, then β_n1 can be updated by the value of (β_n0±Δβ_n) or (β_n0-Δβ_n) so that Eq. 7 approaches zero. Third step, compute Eq. 8 with (α_n0±Δα_n) by fixed α_n1 and β_n1, then α_p1 can be updated by the value of (α_n0+Δα_n) or (α_n0-Δα_n) so that Eq. 8 approaches zero. Thus, one iteration is completed for updating α_n1, β_n1 and α_n1. This process continues until Eq. 6-8 simultaneously approach zero with some error tolerances. The accuracy of the estimation of this method depends on the choice of the step values Δα_n, Δβ_n and Δα_p. Therefore, small step value leads to accuracy of the estimation but requires more computing cost.

Iterated method 2: In this section the gradient method for iteration is developed. In the j-th iteration the Taylor series of Eq. 11 is applied to update the parameters α_nj, β_nj and α_pj.

(11)

where, θ = (α_n, β_n, α_p)^T, grad In L = (∂In L/∂α_n, ∂In L/∂β_n, ∂In L/∂α_p)^T and:

(12)

is a Fisher information matrix. The inverse of the Fisher information matrix plays a key role for the iteration of Eq. 11. Given N samples of the observed data, the Fisher matrix can be obtained from Eq. 12. For each iteration, the expected value of the information matrix is calculated with updating parameters α_nj, β_nj and α_pj. The estimation of the parameter is obtained as the value V(θ^(j))^-1 converge to zero with some error tolerances.

Cramer-Rao lower bound: It is important to understand the unbiasedness of an estimator. The most common way to decide whether or not an estimator’s distribution is unbiased is by looking at its expected value. The Cramer-Rao Lower Bound (CRLB) for the standard deviation of every unbiased estimator is defined as (Hogg and Tanis, 1989):

(13)

(14)

(15)

where, N is the samples of the every realizations and the E[•] is the expected value. Note that Eq. 13-15 are equivalence to the inverse of the square root of the diagonal elements in Eq. 12.

RESULTS AND DISCUSSION

In the following simulation, the digamma function of Eq. 10 is computed using the algorithm in Char et al. (1991) and for further details of the algorithms one can refer to Abramowitz and Stegun (1972). The step value for method 1 is 0.005 and the error tolerance for method 2 is 0.05. All simulation results are based on 50 Monte Carlo runs.

To obtain a good initial value, the tail length and skewness of the observed data are measured and indicated by Q₀ (tailness) and Q₂ (skewness) (Lee, 2003):

(16)

and

(17)

where, are the means of right α_R and left α_L intervals and m(α) is α-trimmed mean. The Q₀ for Gaussian distribution is 2.58 and for Laplace distribution is 3.30, and Q₂ is 1 for all symmetric cases. Distributions with Q₂>1 have relatively longer right tail (right skewness), while those with 0<Q₂<1 have relatively longer left tail (left skewness). By choosing the parameter values (with α_n = α_p for symmetry and α_n ≠ α_p for asymmetry), the Q₀-Q₂ plot (Fig. 3) for various tail length and skewness can be pre-calculated which corresponds to the parameters under the Eq. 1. For example, the values for Gaussian model the (Q₀, Q₂) is (2.58, 1) and the corresponding (α_n, β_n, α_p) is (2, 1.14, 2). The results shown in Fig. 3 are various Q₀-Q₂ plots which cover the tailness up to 5.5 and right skewness up to 10. Using the table look-up of Q₀-Q₂ plots and the corresponding (α_n, β_n, α_p), a reasonable initial value of estimated parameter can be interpolated.

The samples applied in Table 1 are Gaussian distributions, the results shown both iterated methods are reach the exact values of the parameters with low bias. As the sample size increases the standard deviation of the estimation decreases. It appears that the estimation is asymptotically unbiased and satisfies the Cramer-Rao lower bound, where the standard deviations approach but greater than the CRLB (Fig. 4). The results are shown in Table 2 and Fig. 5 are for the samples of Laplace distributions. They are also in agreement with the theoretical values of the parameters.


Fig. 3:	The Q₀-Q₂ plots

The parameter estimation of extended GGD model include asymmetric distribution is achieved in this study. This differs from the previous study of GGD mode, where more data can be fitted in the proposed model. Furthermore, the pre-calculated table of the Q₀-Q₂ plots save the computing cost and improve the accuracy of the estimation. The key factor of the iteration method depends on the choice of the initial and step values, in this study a table look-up method is provided for fast and accuracy estimation.

Table 1:	Gaussian distribution with 50 realizations


Fig. 4:	Standard deviation of the estimated parameters of Gaussian samples


Fig. 5:	Standard deviation of the estimated parameters of Laplace samples

Table 2:	Laplace distribution with 50 realizations

REFERENCES

Abramowitz, M. and I.A. Stegun, 1972. Handbook of Mathematical Functions. Dover Publications, New York, ISBN: 0486612724.
Char, B.W., K.O. Geddes, G.H. Gannet, B.L. Leong, M.B. Monagan and S.M. Watt, 1991. Maple V Manual. Springer-Verlag, New York.
Gazor S. and W. Zhang, 2003. Speech probability distribution. IEEE Signal Proc. Lett., 10: 204-207.
CrossRef
Hogg, R.V. and E.A. Tanis, 1989. Probability and Statistical Inference. 3rd Edn., Maxwell Macmillan, London, ISBN: 0029465001.
Fan, S.K. and Y. Lin, 2009. A fast estimation method for the generalized gaussian mixture distribution on complex images. Comput. Vision Image Understand., 113: 839-853.
CrossRef
Lee, J.Y., 2003. Low bias mean estimator for symmetric and asymmetric unimodal distributions. Comput. Method Programs Biomed., 72: 99-107.
CrossRef PubMed Direct Link
Miller, J.H. and J.B. Thomas, 1972. Detectors for discrete-time signals in non-gaussian noise. IEEE Tran. Inform. Theory, 18: 241-250.
Direct Link
Nandi, A.K. and D. Mampel, 1995. An extension of the generalised gaussian distribution to include asymmetry. J. Franklin Inst., 332: 67-75.
CrossRef
Sharifi K. and A. Leon-Garcia, 1995. Estimation of shape parameter for generalised Gaussian distributions in subband decompositions of video. IEEE Trans. Circuits Syst. Video Technol., 5: 52-56.
CrossRef
Soderlind, G., 2006. Time-step selection algorithms: Adaptivity, control and signal processing. Applied Numerical Mathematics, 56: 488-502.
CrossRef
Solihin, Y. and C.G. Leedham, 1999. Integral ratio: A new class of global thresholding techniques for handwriting images. IEEE Trans. Pattern Anal. Mach. Intell., 21: 761-768.
CrossRef
Tlelo-Cuautle, E., J.M. Munoz-Pacheco and J. Martinez-Carballido, 2007. Frequency scaling simulation of Chua's circuit by automatic determination and control of step-size. Applied Mathematics and Computation, 194: 486-491.
CrossRef Direct Link
Walden, A.T., 1985. Non-gaussian reflectivity, entropy and deconvolution. Geophysics, 50: 2862-2888.
CrossRef
Kassam, S.A., 1988. Signal Detection in Non-Gaussian Noise. Springer-Verlag, New York.

Information Technology Journal

Research Article

Parameter Estimation of the Extended Generalized Gaussian Family Distributions using Maximum Likelihood Scheme

ABSTRACT

How to cite this article

Search

INTRODUCTION

RESULTS AND DISCUSSION

CONCLUSION

ACKNOWLEDGMENT

REFERENCES

Search

Related Articles

Leave a Comment