Proceedings of the 3rd International Conference on Industrial Application Engineering 2015
© 2015 The Institute of Industrial Applications Engineers, Japan.
Parameter Estimation of the Burr Type XII Distribution with a
Progressively Interval-Censored Scheme Using Genetic Algorithm
Tzong-Ru Tsaia*, Jyun-You Jiangb, Yuhlong Lioc, Nan Jiangc and Ya-Yen Fana
a
Department of Statistics, Tamkang University, Taiwan
b
School of Statistics, Southwestern University of Finance and Economics, China
c
Department of Mathematical Sciences, University of South Dakota, USA *Corresponding Author: [email protected]
Abstract
Burr type XII distribution (BXIID) has been widely used to model lifetime data sets. The flexibility of the BXIID is established due to its two shape parameters. To save test time and cost, the BXIID parameters can be inferred by using the maximum likelihood estimation method based on a date set with incomplete lifetime information. But the maximum likelihood estimates (MLEs) of BXIID parameters could have a big bias and mean squared error (MSE) if the sample size is small or the MLEs are evaluated with improper initial parameters. In this study, a progressively interval-censored (PIC) scheme is used to implement the life test, and the Genetic Algorithm (GA) is applied to reduce the bias and MSEs of the MLEs of the BXIID parameters. An extensive Monte Carlo simulation was conducted to evaluate the estimation performance of the typical maximum likelihood estimation method (TMLEM) and GA. Simulation results show that the GA is competitive with the TMLEM in terms of resulting in a smaller bias and MSE in parameter estimation.
Keywords: Burr type XII distribution; Genetic Algorithm; maximum likelihood estimation; Monte Carlo simulation; progressively interval-censored scheme.
Abbreviations and Acronyms
BXIID Burr type XII distribution MSE mean squared error MLE maximum likelihood estimateGA genetic algorithm
PIC progressively interval-censored
TMLEM typical maximum likelihood estimation method
Notations
BXIID lifetime
density function of distribution function of
ith inspection time
number of failed subjects in , number of surviving subjects removed at
. likelihood function
. log-likelihood function MLE of obtained using TMLEM
MLE of obtained using TMLEM MLE of obtained using GA MLE of obtained using GA
. generated value of in simulation
. generated value of in simulation
/
largest integer equal or smaller than
1.
Introduction
Tadikamalla [2] showed that the BXIID connects to lots of useful lifetime distributions. He concluded that the BXIID can be used to model almost all data sets. Many existing studies considered the BXIID as a favorable baseline distribution for performing reliability assessment [3]-[16]. The density function and distribution function of BXIID are, respectively, defined by
| , 1 , (1)
and
| , 1 1 , 0, 0, 0, (2)
where and are both shape parameters.
Because the duration to collect a complete data set of failure times of highly reliable products is time consuming and could not be affordable by the manufacturers. Some schemes for saving testing time, such as censoring and truncated schemes, have been proposed to conduct the life test, and then a censoring sample or truncated sample can be used to implement reliability assessment for highly reliable products.
Censoring often is implemented with a time censoring scheme, failure censoring scheme, or interval censoring scheme. The time censoring is type I censoring, which terminates the life test at a predetermined time. The observed failure times and the termination time are used for the type I censored sample to implement reliability assessment. The failure censoring is type II censoring, a life test under the type II censoring is terminated when a predetermined number of failure times is collected. All the failure times and the last surviving time are used for the type II censored sample to implement reliability assessment.
The number of failed subjected in a type I censoring scheme is a random variable, and the termination time in a type II censoring scheme is a random variable. The type I censoring scheme is favorable by practitioner because the termination time of life test can be predetermined. However, the practitioner may only inspect the life test at specific times, and only the number of failed subjects in individual time intervals are taken into account. The exact failure times of subjects are unknown. Such inspection method is named interval-censoring method. The termination time of life test also can be predetermined by using an interval-censoring scheme, and such scheme can be conducted easier than conducting a type I censoring.
To avoid only observing the shortest lifetimes of subjects
in a life test, the practitioner would like to occasionally removing some surviving subjects during the life test. In this paper, an interval-censoring scheme of constant removals, named a PIC scheme, is taken for the life test to implement reliability assessment for highly reliable products. Progressively censoring schemes have been widely used for life testing [8], [11]-[13], [15] and [17].
A PIC sample is collected as follows: subjects are subjected to life testing at the initial time 0. The number of failed subjects are taken into account at the scheduled times , , … , , respectively, and surviving subjects are randomly removed from the life test at time . The life test terminates at time . Let denote the number of subjects failed in the interval , . Aggarwala [17] derived the likelihood function, based on a PIC sample, by
c, k ∝ ∏ | , | , 1
| , , (3)
where | , 0. No closed forms of solutions for Equation (3) can be found. Iterative numerical search methods are suggested to search for the MLEs of parameters. The MLEs of and , denoted by ̂ and , can be obtained, by using the TMLEM, as follows:
̂ , aug max
, ℓ , , (4)
where ℓ , log , denotes the log-likelihood function. Two difficulties for using (4) to obtain the MLEs: 1. As the true values of or close to 0, the TMLEM
often fails due to the values of log and log could be undefined to maximize the log-likelihood function.
2. Because initial parameters of and need to be pre-determined for the TMLEM, the TMLEM is sensitive to the initial parameters. The identification of initial parameters is difficult to do if full knowledge about the lifetime data is not complete, or too many multiple parameters need to be solved simultaneously.
To overcome these two difficulties of using the TMLEM to find the MLEs of the parameters of the BXIID based on a PIC sample, GA can act a potential candidate method to search the MLEs of the parameters and of the BXIID. This study denotes the MLEs of and , obtained by using the GA, by ̂ and , respectively. The principal of GA is briefly given as follows:
1. Choose an initial population.
2. Determine the fitness of each individual. 3. Perform selection.
4. Perform crossover and mutation
5. Determine the fitness of each individual. 6. Perform selection.
7. The generational process is repeated until a termination condition has been reached.
Common termination condition(s) can be one of the following condition, or combinations of them:
1. A solution is reached to meet the specific criteria.
2. The fixed number of generations is reached. 3. The allocated budget is reached.
4. The highest ranking solution’s fitness is reached, or the solutions cannot be improved by
successive iterations.
The GA is used to find the MLEs of and to maximize (3) not to maximize the log-likelihood function. Moreover, the GA does not need to set up initial parameters for obtaining the MLEs. Numerous existing studies about using GA for computation can be found [18]-[25].
The rest of this paper is organized as follows. In Section 2, an extensive Monte Carlo simulation was conducted to compare the estimation performance of the TMLEM and GA for PIC samples, which were generated from the baseline of BXIID based on an algorithm. Conclusions drawn from this study are provided in Section 3
2.
Monte Carlo Simulation
The simulation, using the TMLEM and GA to obtain the MLEs of the parameters of the BXIID, was conducted with two algorithms. Algorithm 1 is provided to generate PIC samples, each sample contains failed subjects and removals, denoted by , , … , and , , … , . The MLEs of the BXIID distribution parameters are evaluated by using the generated PIC samples. The construction of Algorithm 1 is similar to the algorithm provided by Aggarwala [17].
Algorithm 1.
Step 1. Let 0, Determine the inspection times
1 An instruction to employ the R package “optim” can be found via the link
http://127.0.0.1:29635/library/stats/html/optim.html.
by , , … , and withdraw probabilities by
, , … , , , where 0 1 for
1,2, … , 1 , and 1;
Step 2. Let 0 and . . 0;
Step 3. Let 1; Generate from the binomial distribution with sample size . . and the success probability
, ,
, ;
Let ∑ , where
is the largest integer equal or smaller than ;
Step 4. Let . . and . . ;
Step 5. If , revert to Step 3; otherwise stop the algorithm.
The R package “optim”1 provides a general-purpose optimization based on Nelder–Mead, quasi-Newton and conjugate-gradient algorithms, and the optimization procedure includes an option for box-constrained optimization and simulated annealing. In this paper, the R package “optim” was used to evaluate the MLEs of the parameters of the BXIID for the TMLEM.
Because the target function in (3) is nonlinear and complicated, search the MLEs of parameters by using the TMLEM could be failed. In this study the MLEs are also searched by using the GA over the parameter space. The R package “GA”2, which was published in 2014, was used to implement the GA procedure. The package “GA” provides a flexible general-purpose set of tools for implementing GA search in both the continuous and discrete case, whether constrained or not. The simulation was conducted using the algorithm 2.
Algorithm 2.
Step 1. Generating a PIC sample of , , … , and
, , … , for the baseline BXIID, which
defined by (1) and (2), using Algorithm 1.
Step 2. Obtain the MLEs ̂ , and ̂ , by using the TMLEM with given initial parameters of , and GA.
Step 3. Repeat Steps 1-2 1000 times, and denote the resulting MLEs by ̂ ,, , and ̂ ,, , for
1,2, … ,1000. Evaluating the bias and MSEs of the 2 See the menu of using R package GA for more
information via the link http://cran.r-project.org/web/packages/GA/.
estimators, respectively, by
Bias ∑ ,
and
MSE ∑ ,
where denotes the MLE of the parameter for the the simulation run . If , then can be
̂ , or ̂ ,; and if , then can be , or ,.
The parameters, population size 50, probability of crossover 0.8, probability of mutation 0.1, and the iterations 100, are used to implement the GA.
Table 1. Bias and MSEs of the MLEs of the parameters and
Bias MSE Bias MSE
GA 2.645 15.847 2.304 12.987
TMLEM 4.876 133.417 3.370 41.934
True parameters , 3,5 , 30, 5,
0.05 for 1,2,3,4 and 1 were given for the BXIID in this simulation study. The common initial
parameters , 1,1 are used for the TMLEM
method, and the GA search the MLEs of parameters over the interval of (0.01, 10). All simulation results were prepared in Table 1, Fig. 1 and 2. Table 1 shows that the MLEs ̂ and
have a smaller bias and MSE than that of the ̂ and . In particular, the GA helps to greatly reduce the MSEs of the MLEs, and that means the GA is useful to obtain stable MLEs of the parameters and of the BXIID based on PIC samples.
The boxplots of 1000 MLEs of the parameters and , those were obtained from the simulation, are given in Fig. 1 and 2. Both boxplots indicate that the MLEs, obtained by using the TMLEM, are more spread than that obtained by using the GA. The MLEs of ̂ in Fig. 1 spread over a wide range, and the values could be much larger than the true parameter. The MLEs of in Fig. 2 are much spread over a wide range compared with that of . All these simulation results show that the TMLEM work unstably to reach the MLEs of the parameters and , and the GA works better and can be considered as a competitive method instead of the
TMLEM.
3.
Conclusions
1. In this paper, the TMLEM and GA are used to study the estimation performance of obtaining the maximum likelihood estimates of the parameters of BXIID based on PIC samples. An extensive Monte Carlo simulation was conducted, and the simulation results indicate that the GA based MLEs have a smaller bias and MSE than that MLEs obtained by using the TMLEM.
2. The GA is competitive with the TMLEM to obtain the MLEs of parameters, and therefore we recommend to use the GA to obtain MLEs when the parameters of the BXIID are estimated with using a PIC scheme. 3. Using the GA for other baseline distributions or other
testing schemes are also favorable. Other evaluation computation methods, for example the Differential Evaluation, Evolution Strategies and Particle Swarm Optimization methods, could also be helpful to reduce the bias and MSE in parameter estimation. All these topics can be addressed in future studies.
Acknowledgment
This study is supported by the grant of Ministry of Science and Technology, Taiwan MOST 103-2221-E-032-054.
Fig. 1 Boxplot of the MLEs of obtained by using the TMLEM and GA TMLEM GA 01 0 2 0 3 0 Estimation methods E s ti m a te o f c
Fig. 2 Box plot of the MLEs of obtained by using the TMLEM and GA
References
(1) I.W. Burr, “Cumulative frequency functions”, Annals of Mathematical Statistics, Volume 13, pp. 215-232, 1946. (2) P.-R. Tadikamalla, “A look at the Burr and related
distributions”, International Statistical Review, Volume 48, pp. 373-344, 1980.
(3) D.R. Wingo, “Maximum likelihood methods for fitting the Burr type XII distribution to life test data”, Biometrical Journal, Volume 25, pp. 77-84, 1983.
(4) D.R. Wingo (1993). “Maximum likelihood estimation of Burr XII distribution parameters under type II censoring”, Microelectronics and Reliability, Volume 33, pp. 1251-1257, 1993.
(5) M.A.M. Ali Mousa, “Empirical Bayes estimators for the Burr type XII accelerated life testing model based on type-2 censored data”, Journal of Statistical Computation and Simulation, Volume 52, no. 2, pp. 95-103, 1995. (6) P.L. Gupta, R.C. Gupta, S.J. Lvin, “Analysis of failure
time data by Burr distribution”, Communications in Statistics-Theory and Methods, Volume 25, pp. 2013-2024, 1996.
(7) W.J. Zimmer, J.B. Keats and F.K. Wang, “The Burr XII distribution in reliability analysis”, Journal of Quality Technology, Volume 30, pp. 386-394, 1998.
(8) M.A.M. Ali Mousa and Z.F. Jaheen, “Statistical inference for the Burr model based on progressively censored data”, Computers and Mathematics with Applications, Volume 43, pp. 1441-1449, 2002.
(9) S.K. Upadhyay, I.A. Javed and M. Peshwani, “Bayesian analysis of generalized four-parameter Burr distribution via Gibbs sampler”, METRON, Volume LXII, pp. 115-135, 2004.
(10) Wu J-W and H.-Y. Yu, “Statistical inference about the shape parameter of the Burr type XII distribution under the failure-censored sampling plan”, Applied Mathematics and Computation, 163: 443-482, 2005. (11) S.-J. Wu, Y.-J. Chen and C.-T. Chang, “Statistical
inference based on progressively censored samples with random removals from the Burr Type XII distribution”, Journal of Statistical Computation and Simulation, Volume 77, pp. 19-27, 2007.
(12) S.-F. Wu, C.-C. Wu, Y.-L. Chen, Y.-R. Yu and Y.P. Lin, “Interval estimation of a two-parameter Burr-XII distribution under progressive censoring”, Statistics, Volume 44, pp. 77–88, 2010.
(13) Y.L. Lio and T.-R. Tsai, “Estimation of δ P X for Burr XII distribution based on the progressively first failure-censored samples”, Journal of Applied Statistics, Volume 39, pp. 309-322, 2012.
(14) M. Aslam, Y.L. Lio, C.-H. Jun, “Repetitive acceptance sampling plans for Burr type XII distribution”, International Journal of Advanced Manufacturing Technology,Volume 68, pp. 495-507, 2013.
(15) A.A. Soliman, A.H. Abd Ellah, N.A. Abou-Elheggag and A.A. Modhesh, “Estimation from Burr type XII distribution using progressive first-failure censored data”, Journal of Statistical Computation and Simulation, Volume 83, no. 12, pp. 2270-2290, 2013.
(16) T.-R. Tsai, Y.L. Lio, N. Jiang, Y.-J. Lin and Y.-Y. Fan, “Economical sampling plans with warranty based on truncated data from Burr type XII distribution”, Journal of the Operational Research Society, to appear.
(17) R. Aggarwala, “Progressively interval censoring: some mathematical results with application to inference”, Communications in Statistics-Theory and Methods, Volume 30, pp. 1921-1935, 2001.
(18) C.-K. Ting, "On the mean convergence time of multi-parent genetic algorithms without selection", Advances in Artificial Life, Volume 3630, pp. 403–412, 2005. (19) J. ZHANG, H. Chung and W. L. Lo, “Clustering-based
adaptive crossover and mutation probabilities for genetic algorithms”, IEEE Transactions on Evolutionary
TMLEM GA 46 8 1 0 1 2 1 4 Estimation methods E s ti m a te o f k
Computation, Volume11, no.3, pp. 326–335, 2007. (20) Z. Akbari, "A multilevel evolutionary algorithm for
optimizing numerical functions", International Journal of Industrial Engineering Computations, Volume 2, Number 2, pp. 419–430, 2011
(21) C. Ferreira, "Gene expression programming: a new adaptive algorithm for solving problems". Complex Systems, Volume 13, Number 2, pp. 87-129, 2001. (22) M. Nicholson, “Genetic algorithms and grouping
problems”, Software: Practice and Experience, Volume 28, Number 10, 1998.
(23) B. Baudry, F. Fleurey, J.M. Jézéquel, and Y. Le Traon, "Automatic test case optimization: a bacteriologic algorithm" IEEE Software, Volume 22, Number 2, pp. 76–82, 2005.
(24) P. Civicioglu, "Transforming geocentric cartesian coordinates to geodetic coordinates by using differential search algorithm". Computers & Geosciences, Volume 46, pp. 229–247, 2012.
(25) G. Kjellström, "On the efficiency of Gaussian adaptation". Journal of Optimization Theory and Applications, Volume 71, Number 3, pp. 589–597, 1991.