HOME JOURNALS CONTACT

Journal of Applied Sciences

Year: 2011 | Volume: 11 | Issue: 17 | Page No.: 3192-3197
DOI: 10.3923/jas.2011.3192.3197
Estimation of P (Y<X) in the Rayleigh Distribution in the Presence of k Outliers
Afshin Ghanizadeh

Abstract: In present study, we considered the problem of estimating R = P (Y<X), where Y had Rayleigh distribution with parameter λ and X had Rayleigh distribution with presence of k outliers with parameters θ and β, such that X and Y are independent. The moment, maximum likelihood and mixture estimators of R are derived and had been shown that the moment estimator of R is asymptotically unbiased estimator. At the end, we concluded that mixture estimators are better than the maximum likelihood and moment estimators.

Fulltext PDF Fulltext HTML

How to cite this article
Afshin Ghanizadeh , 2011. Estimation of P (Y<X) in the Rayleigh Distribution in the Presence of k Outliers. Journal of Applied Sciences, 11: 3192-3197.

Keywords: mixture estimator, outliers, Newton-Raphson method, maximum likelihood estimator, Monte-Carlo simulation, Rayleigh distribution and moment estimator

INTRODUCTION

In reliability context inferences about R = P (Y<X), where X and Y independent distribution, are a subject of interest. For example in mechanical reliability of a system if X is the strength of a component which is subject to stress Y, then R is a measure of system performance. The system fails, if at any time the applied stress is greater than its strength. Stress-strength reliability has been discussed by Kapur and Lamberson (1977). Some other aspects of inference about R are given in Al-Hussaini et al. (1997) and Sathe and Dixit (2001) have been estimate of R = P (Y<X) in the negative binomial distribution and recently, Nasiri and Pazira (2010b) have done estimation of R = P (Y<X) with presence of outliers in exponential case.

Dixit et al. (1996) assumed that random variables (X1, X2,..., Xn) represent the distance of an infected sampled plant from a plant in a plot of plants inoculated with a virus. Some of the observations are derived from the airborne dispersal of the spores and are distributed according to the exponential distribution. The other observations out of n random variables (say k) are present because aphids which are know to be carriers of Barley Yellow Mosaic Dwarf Virus (BYMDV) have passed the virus into the plants when the aphids feed on the sap. Dixit and Nasiri (2001) considered estimation of parameters of the exponential distribution in the presence of outliers generated from uniform distribution (Khamis et al., 2006; Elahi et al., 2009; Muiru et al., 2010; Shittu and Shangodoyin, 2008; Xie et al., 2007).

In present study, we obtain the moment, maximum likelihood and mixture estimators of R in Rayleigh distribution with presence of k outliers generated from the same distribution. Rayleigh distribution is a special case of the Weibull distribution. It has been used to study the scattering of radiation, wind speeds or to make certain transformation. The probability density function of the Rayleigh distribution with parameter of θ is given by:

The Maximum Likelihood Estimator (MLE) of θ is:

Thus, we assume that the random variables (X1, X2,..., Xn) are such that k of them are distributed with p.d.f f1 (x, θ, β):

(1)

and the remaining (n-k) random variables are distributed with p.d.f f2 (x, θ):

(2)

METHOD OF MOMENT

Let Y1, Y2,..., Ym be a random sample for Y with P.d.f.:

(3)

and X1, X2,..., Xn be a random sample for X with P.d.f.:

(4)

with presence of k (Know) outliers (Dixit, 1989; Dixit and Nasiri, 2001; Nasiri and Pazira, 2010a; Abu-Shawiesh et al., 2009; Goegebeur et al., 2005; Guo and Zhang, 2011; Ismail, 2008). The parameter R we want to estimate is:

(5)

For k = 0, R was proposed by Surles and Padgett (1998). Since P (Y<X) + P (Y>X) = 1, we consider P (Y>X):

(6)

Where:

The moment estimator of R can be shown to be:

(7)

where, , and can be obtained as;

From Eq. 4 we get:

(8)

(9)


(10)

and:

(11)

From Eq. 11, we have:

(12)

(13)

Where:

If Δ = ξ22-4 ξ1ξ3 is non-negative then the roots are real. Therefore:

(14)

and from Eq. 10:

(15)

Thus, we can obtain the moment estimator of β and θ by using Eq. 14 and 15, respectively.

Here, we shall show that and are asymptotically unbiased estimators. Let:

and:

then:

Here, we can write as a function of W1, W2:

(16)

Let:

Expand the function h (w1, w2) around (μ, v) by Taylor series:

(17)

Then:

and similarly:

Note that if we consider:

and , It is easy to show that .

METHOD OF MAXIMUM LIKELIHOOD

The maximum likelihood estimator of R can be shown to be:

(18)

Where:

and to obtain the maximum likelihood estimator of θ and β, we consider the joint distribution of X with presence of k outliers:

(19)

where,

and:

For the more details see Dixit and Nasiri (2001) and Nasiri and Pazira (2010a).

If L (θ, β) = In [L (x, θ, β)], then from Eq. 19:

(20)

The solve for our MLEs of θ and β we take the derivative of the log likelihood (L (θ, β)) with respect to each parameter set the partial derivatives equal to zero and solve for and :

(21)

(22)

There is no closed-from solution to this system of equations, so we will solve for and iteratively, using the Newton-Raphson method, a tangent method for root finding. In our case we will estimate α = (θ, β) iteratively:

(23)

where, g is the vector of normal equations for which we want g = [g1, g2] with:

(24)

(25)

and G is the matrix of second derivatives:

(26)

Where:

(27)

(28)

(29)

With:

The Newton-Raphson algorithm converges, as our estimates of θ and β change by less than a tolerated amount with each successive iteration, to and . For k = 0, was proposed by Surles and Padgett (1998).

Note: Here, the number of k outlying observations is known, k = 1, 2,..., n. But if k is unknown, then k can be selected by evaluating the likelihood for different value of k choosing the one that maximizes the likelihood.

MIXTURE OF METHOD OF MOMENT AND MAXIMUM LIKELIHOOD

Read (1981) proposed the methods which avoid the difficulty of complicated equations. According to Read (1981), replacement of some but not all, of the equations in the system of likelihood may make it more manageable. From Eq. 15, we have:

(30)

and from Eq. 22:

(31)

SIMULATION STUDY

In present study, we have addressed the problem of estimating P (Y<X) for the Rayleigh distribution with presence of k outliers. The moment, maximum likelihood and mixture estimators of R are derived and has been shown that the moment estimator of R is asymptotically unbiased estimator. All the results are base on 1000 replications and are given in Table 1 and 2 for k = 1 and 2, respectively. In this case as expected when m = n and m, n increase then the average biases and the MSEs decrease. For fixed m as n increase the MSEs decrease and also for fixed n as m increase the MSEs decrease.

Table 1: Biases and Mean Squared Errors (MSE's) of the point estimates from Rayleigh distribution, when k = 1, β = 2, θ = 5, λ = 3

Table 2: Biases and Mean Squared Errors (MSE's) of the point estimates from Rayleigh distribution, when k = 2, β = 2, θ = 5, λ = 3

CONCLUSION

From Table 1 and 2, we conjecture that the moment estimator of R is asymptotically unbiased. On the other hand, the moment and mixture estimators are underestimation but the maximum likelihood estimator is overestimation, this is true for k = 1 and 2. The MSEs of any three estimators are tending to zero and when m = n and m, n increase then the MSEs decrease and for fixed m as n increase the MSEs decrease and also for fixed n as m increases the MSEs decrease, this is true for k = 1 and 2.

Table 1 and 2 showed that the mixture estimator has the smallest estimated MSEs as compared with the moment and maximum likelihood estimators. We strongly feel mixture estimator is better and easy to calculate than the maximum likelihood and moment estimations. From the previous observations, we suggest to use mixture method for estimating R = P (Y<X) in Rayleigh case with presence of k outliers because it is easy to calculate than the rest.

ACKNOWLEDGMENTS

With special thanks of Islamic Azad University Kermanshah branch, Iran for the financial support of this present work and Dr. Adel Fatemi for his helpful guides through the work.

REFERENCES

  • Abu-Shawiesh, M.O., F.M. Al-Athari and H.F. Kittani, 2009. Confidence interval for the mean of a contaminated normal distribution. J. Applied Sci., 9: 2835-2840.
    CrossRef    Direct Link    


  • Al-Hussaini, E., M.A.M.A. Mousa and K.S. Sultan, 1997. Parametric and nonparametric estimation of for finite mixtures of log normal components. Commun. Statist. Theory Methods, 26: 1269-1289.
    Direct Link    


  • Dixit, U.J., 1989. Estimation of parameters of the gamma distribution in the presence of outliers. Commun. Statist.-Theory Methods, 18: 3071-3085.


  • Dixit, U.J., K.L. Moor and V. Barnett, 1996. On the estimation of the power of the scale parameter of the exponential distribution in the presence of outliers generated from uniform distribution. Metron, 54: 201-211.


  • Dixit, U.J. and P.F. Nasiri, 2001. Estimation of parameters of the exponential distribution in the presence of outliers generated from uniform distribution. Metron, 49: 187-198.
    PubMed    


  • Goegebeur, Y., V. Planchon, J. Beirlant and R. Oger, 2005. Quality assessment of pedochemical data using extreme value methodology. J. Applied Sci., 5: 1092-1102.
    CrossRef    Direct Link    


  • Guo, Y., Y. Gu and Y. Zhang, 2011. Invariant feature point based ICP with the RANSAC for 3D registration. Inform. Technol. J., 10: 276-284.
    CrossRef    Direct Link    


  • Ismail, S.O., 2008. Accommodation of outliers in time series data: An alternative method. Asian J. Math. Statist., 1: 24-33.
    CrossRef    Direct Link    


  • Kapur, K.C. and L.R. Lamberson, 1977. Reliability in Engineering Design. John Wiley and Sons, New York


  • Khamis, A., Z. Ismail, K. Haron and A.T. Mohammed, 2006. Modeling oil palm yield using multiple linear regression and robust M-regression. J. Agron., 5: 32-36.
    CrossRef    Direct Link    


  • Elahi, M., L. Xinjie, M.W. Nisar and H. Wang, 2009. Distance based outlier for data streams using grid structure. Inform. Technol. J., 8: 128-137.
    CrossRef    Direct Link    


  • Muiru, W.M., B. Koopmann, A.V. Tiedemann, E.W. Mutitu and J.W. Kimenju, 2010. Evaluation of genetic variability of Kenyan, German and Austrian isolates of Exserohilum turcicum using amplified fragment length polymorphism DNA marker. Biotechnology, 9: 204-211.
    CrossRef    Direct Link    


  • Nasiri, P.F. and H. Pazira, 2010. Bayesian and non-Bayesian estimations on the generalized exponential distribution in the presence outliers. J. Stat. Theory Practice, 4: 453-475.


  • Nasiri, P. and H. Pazira, 2010. Estimation of P(Y<X) for exponential distribution in the presence of k outliers. J. Applied Sci.,


  • Read, R.R., 1981. Representation of certain covariance matrices with application to asymptotic efficiency. J. Am. Statist. Assoc., 76: 148-154.
    Direct Link    


  • Sathe, Y.S. and U.J. Dixit, 2001. Estimation of P[XY] in the negative binomial distribution. J. Stat. Plann. Inference, 93: 83-92.
    CrossRef    Direct Link    


  • Shittu, O.I. and D.K. Shangodoyin, 2008. Detection of outliers in time series data: A frequency domain approach. Asian J. Scientific Res., 1: 130-137.
    CrossRef    Direct Link    


  • Surles, J.G. and W.J. Padgett, 1998. Inference for P(Y < X) in the Burr Type X model. J. Appl. Stat. Sci., 26: 225-238.


  • Xie, Z., S. Wang, D. Zhang, F.L. Chung and Hanbin, 2007. Image segmentation using the enhanced possibilistic clustering method. Inform. Technol. J., 6: 541-546.
    CrossRef    Direct Link    

  • © Science Alert. All Rights Reserved