Fit Tests for the Generalized Exponential Distribution
Suzanne Abdel-Rahman Allam Mathematical Statistics Department
Institute of Statistical Studies and Research Cairo University, Egypt
Abstract
The goodness-of-fit test of Towhidi and Salmanpour (2007), which is based on the empirical characteristic function is used for testing the fit of the generalized exponential distribution. Monte Carlo procedures are employed to obtain the empirical percentage points for the test statistic. In addition, an extensive power study is conducted to compare the performance of this test against the well known goodness-of-fit tests which are based on empirical distribution function.
Keywords: goodness-of-fit tests, empirical characteristic function, generalized exponential distribution, maximum likelihood estimation, empirical distribution function.
1. Introduction
A statistical problem encountered in many areas of research is the need to assess whether a sample of observations comes from a specified distribution. Typically such situations are known as 'Goodness of Fit' problems. Many goodness-of-fit tests were introduced in the literature. The tests based on the empirical distribution function (EDF) are the most commonly used goodness-of-fit tests. In addition, there are goodness-of-fit tests that are based on the empirical characteristic function (ECF).
The generalized exponential distribution has been introduced and studied quite extensively by Gupta and Kundu (1999, 2001, 2002, 2003). It is observed that this distribution can be considered for situations where a skewed distribution for a non-negative random variable is needed. Also, it is observed that it can be used quite effectively to analyze lifetime data in place of gamma, Weibull and log-normal distributions. The generalized exponential distribution has the following distribution function:
( ; , ) (1 x)
F x e ; , 0 (1.1)
for x0 and 0 otherwise. The corresponding density function is:
1
( ; , ) (1 x) x
f x e e ; , 0 (1.2)
for x0 and 0 otherwise. In addition, the CF of the generalized exponential distribution is given by:
1( )
1
it t
it
Here
is the shape parameter and is the scale parameter. If the shape parameter 1, then the generalized exponential distribution coincides with the exponential distribution with a scale parameter . The generalized exponential distribution with the two parameters
and will be denoted by GE(
, ).Hassan (2005) used different EDF statistics for testing the goodness of fit for the GE( , ) with unknown parameters. She obtained the critical values for these test statistics after estimating the unknown parameters by ML method. In addition, Hassan (2005) conducted a power comparison between the Kolmogorov-Smirnov (K-S), Cramer-von Mises (C-M), Anderson-Darling(A-D), Watson (WA), and Liao and Shimokawa (L-S) tests for the GE( , ) with the two parameters unknown. She considered different alternatives that are; the standard normal, Weibull, Gamma, exponential, chi-square and uniform distributions. From this power study, Hassan (2005) concluded that the L-S test is generally superior to other tests. Also, she mentioned that the WA test is not appearing to be powerful across this group of alternative distributions.
The present work is interested in applying the test of Towhidi and Salmanpour (2007) for testing the null hypothesis:
0
H : The sample x x1, ,...,2 xn comes from GE(
, ) with unknown parameters
and . (1.4)
The unknown parameters of this distribution will be estimated by the maximum likelihood (ML) method. Also, a power comparison between the ECF test and the EDF tests is conducted.
The organization of this article is as follows. Section 2 discusses estimation of the unknown parameters of the GE(
, ) by the ML method. The goodness-fit-test based on ECF for the GE(
, ) is provided in section 3. EDF tests for the GE(
, ) are discussed in Section 4. Then, a power study is conducted at section 5. Finally, conclusions are included in section 6.2. ML Estimation of the unknown parameters of the Generalized Exponential distribution
For testing H0 given in (1.4) the unknown parameters of the GE(
,) must beestimated first. In this section the ML method for parameter estimation is considered.
Gupta and Kundu (2001b) estimated the two parameters,
and
, of the GE(
, ) using the ML method. They concluded the following two equations:ˆ
ˆ
ln(1 ) n
x
n e
ˆ ˆ ˆ ˆ
1 1 1 1
ˆ{[ ln(1 i)][ {( i) (1 i)}] [ ][ ln(1 i)]}
n n n n
x x x x
i i
i i i i
n e x e e x e
ˆ
1ln 1 0
i n
x
i
n e
. (2.2)Where x x1, ,...,2 xn is a random sample from the GE(
, ). Equation (2.2) has tobe solved for the unknown
ˆ by iteration to obtain the ML estimator of . Then, the ML estimator of
, ˆ, can be easily obtained as a function of
ˆ from equation (2.1).3. Goodness-of-fit tests based on ECF for the generalized exponential distribution
There are different forms of test statistics that are based on ECF. These tests depend on measuring the distance between the ECF of the sample data and the CF of the hypothesized distribution. Therefore; according to the technique of measuring this distance; these tests can be divided into two major types. First; tests based on the integral of the squared modulus of the difference between the ECF and the CF of the null distribution, such as the test of Wong and Sim (2000). Second, tests based on the difference between the real (or imaginary) part of the ECF and the real (or imaginary) part of the CF of the hypothesized distribution, such as the test of Towhidi and Salmanpour (2007).
The first type of these tests can be applied only for distributions which have CF in closed form but for other distributions which do not have CF in closed form the computations will be difficult if not impossible. This is because the algorithm will contain more than one integration that does not converge to a solution. On the other hand, the second type of these tests can be applied for any distribution whether its CF is in closed form or not.
Since the CF of the generalized exponential distribution is not in a closed form, then the test of Towhidi and Salmanpour (2007) is the most appropriate in this case. This test is explained in details as follows.
The test of Towhidi and Salmanpour (2007) is given by:
* 1
0 0
( ) ( ) ( ) ( )
n n m m k k k n m m
T Z T Z T B B Z T Z T (2.3)
Where
1
2
1
2
Re ( ) Re ( )
Re ( ) ( )
Im ( ) Im ( )
Im ( ) n n n m n m n n t t t Z T t t t , 1 2 0 1 2
Re ( ) Re ( )
Re ( ) ( )
Im ( ) Im ( )
i) Tm ( , ,..., )t t1 2 tm is an arbitrary point in m, where m is a positive integer
that is equal to the sample sizen.
ii) ( )t is the CF of the hypothesized distribution and n( )t is the ECF.
iii) Imn( )t =
1
1 n sin( ) j j
tX n
and Ren( )t =
1
1 n cos( ) j j
tX n
are the imaginary
and real parts of n( )t , respectively.
iv) Im ( )t =E[sin( )]tX and Re ( ) t =E[cos( )]tX are the imaginary and real parts of ( )t , respectively.
v) k is a k k diagonal matrix whose diagonal elements, 1, ,...,2 k , are
the k largest eigen values of 0.0 be the 2m2m variance-covariance
matrix
of Y Tj( )m under H0. Where
1 1
1 1
cos( ) cos( )
cos( ) cos( ) ( )
sin( ) sin( )
sin( ) sin( )
j j
m j m j
j m
j j
m j m j
t X E t X
t X E t X
Y T
t X E t X
t X E t X
(2.5)
And the integer value k is chosen such that the ratio 2
1 1 k m j j j j
is nearto 1.
vi) Bk is a 2m k matrix whose columns are the eigen vectors, B B1, 2,...,Bk,
corresponding to 1, ,...,2 k.
It is known that the standard procedure for testing H0 , given in (1.4), is to
compare the value of the test statistic for the observed sample with its theoretical critical values (percentage points) under H0.
The exact distribution of the test statistic, *
n
T , under H0 is difficult; if not
impossible; to obtain. In this section, Monte Carlo procedures are employed to obtain the empirical critical values of *
n
T under the GE( , ) null distribution. Two significance levels will be considered that are 0.05 and 0.01. Different sample sizes are considered n 10, 20 and 30 with N= 10,000 replications. In addition, different values for the shape parameter
were considered,0.5,1,1.5, 2, 2.5,
To find the empirical critical values for the different significance levels, the following steps are performed:
Step (1): Generate N random samples of size n from the GE( , ). This can be done by firstly generating a random sample from the uniform(0,1) distribution,
n
u u
u1, 2,..., . Then the uniform random numbers can be transformed to
generalized exponential random numbers using the following transformation:
] 1ln[ ) 1
( 1
i
i u
x , i 1,2,…,n (2.6)
Step (2): Estimate the parameters of the GE( , ) using the ML method. Step (3): Calculate the value of *
n
T for each random sample. This is done as follows:
i) Select the values t1,...,tm by the relation: ti F1mi1, i 1,...,m
,
where F1 is the inverse distribution function of the GE( , ) and
m n (the sample size).
ii) Define the vector Tm
t1,...,tm
and evaluate the vector Z Tn
m Z T0
m .Where Z Tn
m and Z T0
m are defined in (2.4).iii) Determine the matrix 0 which is the 2m2m variance-covariance matrix
of Y Tj( )m , which is defined in (2.5), under H0. Then derive its eigen
values and eigen vectors.
iv) Arrange the eigen values, i, such that 12 ... 2m. Then, choose an
integer k so that the ratio 1 2
1
k i i
m i i
is large enough (near to one).Construct the matrices k and Bk.
v) Calculate the value of *
n
T as in (2.3).
Step (4): Arrange the Nvalues of *
n
T in an ascending order Step (5): Calculate the 100(1- )thpercentile from the ordered *
n
T values.
4. Goodness-of-fit tests based on EDF for the generalized exponential distribution
The most commonly used goodness-of-tests are the tests based on EDF. These test statistics generally measure in a different way the distance between a CDF;
0( )
F x ; and the EDF ; denoted byF xn( ); which is calculated from the sample.
There are many tests based on EDF statistics such as the K-S, C-M and A-D tests. In the last two decades, many authors (Lawless, 1982; Liao and Shimokawa, 1999b; Littell et al., 1979; Park and Seoh, 1994; Stephens, 1974) have reported that the A-D and C-M tests are more powerful than the K-S test. At present, the A-D test is mostly applied in the statistical analysis of lifetime data (Liao and Shimokawa, 1999a).
When F x0( ) is completely specified (i.e. does not contain unknown parameters)
and the data are uncensored, the EDF tests are all distribution free and percentage points for these test statistics are generally known. However, this is no longer the case when data are censored or F x0( ) involves unknown parameters. Therefore; if the hypothesized distribution contains unknown parameters that must be estimated from the sample data, then the standard tables used for the EDF statistics are no longer valid and the resulting test would be inaccurate. In this case, these statistics can be modified by inserting estimates of unknown parameters in F x0( ). However, it is not possible to obtain an analytic expression for the distribution of these modified statistics. Hence, the critical values for these modified statistics are usually obtained by Monte Carlo simulation methods (Abd-Elfattah et al., 2010).
In this section; the empirical percentage points of several modified EDF statistics, under the null hypothesis that the data are from the GE(
,), will be obtained by Monte Carlo simulations. The considered EDF statistics are:1. The K-S test statistic:
max ,
D D D , (4.1)
where:D maxi
i n F x0
i , , ˆ ˆ
and
0 ˆ ˆ 1
maxi i , , i
D F x
n
0 i, ,ˆ ˆ
2. The C-M statistic:
22
0 1
1 , ,ˆ ˆ 2 1
12 2
n
i i
i
W F x
n n
. (4.2)3. The WA statistic:
20
2 2 1
ˆ ˆ
, , 1 2 n
i i F x
U W n
. (4.3)4. The A-D statistic:
20 0 1
1
1 n 2 1 ln , ,ˆ ˆ ln 1 , ,ˆ ˆ
i n i
i
A n i F x F x
n
. (4.4)5. The L-S statistic:
0 0 1 0 0 1 ˆ ˆ ˆ ˆmax , , , , ,
1
ˆ ˆ
ˆ ˆ
, , 1 , ,
i i
n n
i
i i
i F x F x i
n n
L
n F x F x
. (4.5)The steps for obtaining critical values for these five EDF statistics are the same steps followed for obtaining critical values for the *
n
T statistic in the previous section. Except for step (5) where the D,W 2, U2, A2 and
n
L statistics must be calculated from each sample. These steps were performed using the Mathcad (version 13) software. Different sample sizes were considered: n 10, 20 and 30. Two significance levels considered: 0.05 and 0.01. For the GE(
, ); different values for the shape parameter
were considered that are:
0.5(0.5) 3, while was always considered to be 1.The critical values for the different modified EDF statistics are listed in table (1) in the appendix with the critical values of the *
n
Table (1): Critical values of the EDF and ECF Statistics
n Statistic Value of0.5
1 the shape parameter of the null distribution1.5 2 2.5 310
D 0.05 0.748 0.847 0.899 0.938 0.962 0.976
0.01 0.942 0.991 0.999 1 1 1
2
W 0.050.01 2.079 2.679 2.973.071 3.303 3.33 3.1263.333 3.2133.333 3.2623.333
2
A 0.05 13.303 22.269 31.263 40.1090.01 30.878 59.165 90.529 125.136 164.16449.083 207.31658.086
2
U 0.050.01 1.917 2.472 2.7432.839 3.054 3.08 2.8893.083 3.0832.97 3.0163.083
n
L 0.05 10.64 18.60 31.64 57.34 104.83 202.07
0.01 103.44 253.50 1344.43 13997.04 226276.89 5730672.82
*
n
T 0.05 9.044 8.957 8.38 8.38 8.267 8.114
0.01 50.361 22.702 20.467 20.467 21.183 19.106
20
D 0.050.01 0.489 0.569 0.6210.660 0.759 0.818 0.6600.857 0.6900.884 0.7140.906
2
W 0.050.01 1.827 2.627 3.1873.432 4.636 5.280 3.6165.675 3.9575.947 4.2386.131
2
A 0.05 10.012 15.377 20.063 24.3730.01 19.671 32.797 44.691 55.855 28.54467.393 32.46678.512
2
U 0.050.01 1.753 2.521 3.0573.297 4.455 5.075 3.4695.455 3.7985.718 4.0685.897
n
L 0.05 4.964 6.372 7.917 9.440 11.199 13.068
0.01 16.043 17.001 22.258 30.864 45.057 66.663
*
n
T 0.05 2.942 3.598 4.582 4.831 4.968 4.61
0.01 9.754 11.161 10.644 10.296 9.437 8.658
30
D 0.05 0.388 0.453 0.497 0.529 0.554 0.570
0.01 0.512 0.599 0.652 0.688 0.718 0.740
2
W 0.05 1.668 2.436 3.022 3.498 3.892 4.155
0.01 3.113 4.509 5.464 6.131 6.611 6.846
2
A 0.050.01 16.647 26.291 34.883 43.1319.178 13.797 17.530 21.200 24.23550.966 26.48455.744
2
U 0.05 1.625 2.372 2.941 3.407 3.788 4.040
0.01 3.031 4.388 5.319 5.969 6.437 6.665
n
L 0.05 3.971 4.943 5.786 6.650 7.420 7.844
0.01 8.085 9.499 11.323 13.282 15.796 17.586
*
n
T 0.05 1.687 1.833 2.298 2.552 2.669 2.742
5. Power Study
The power of a goodness-of-fit test is defined as the probability that a statistic will lead to the rejection of the null hypothesis, H0, when it is false. The power of a
goodness-of-fit test at a significance level is denoted by 1, where is the probability of committing a type-II error, that is failing to reject a false null hypothesis.
In this section a power comparison is conducted between the different EDF tests and the ECF test of Towhidi and Salmanpour (2007), for the GE( , ) with unknown parameters. For the alternative hypothesis, Ha, that the random
sample comes from another distribution; several alternative distributions were considered. These distributions are:
i) Chi-Square distribution with shape parameter
(=1.5 and 3).ii) Exponential distribution with scale parameter (=1).
iii) Gamma distribution with shape parameter
(=1.5 and 2.5). iv) Lognormal distribution with scale parameter (=1).v) Weibull distribution with shape parameter
(=1.5).The steps for obtaining the power of any test are as follows:
Step (1): Generate N (=10,000) random samples of size n from each alternative distribution.
Step (2): Calculate the value of the test statistic from each sample. Step (3): Arrange theNvalues of the test statistic in an ascending order.
Step (4): From table (1), and for each significance level ( ); get the critical value, call itc, of each test statistic under the null distribution GE(
,).Step (5): Calculate L = the number of the test statistic values that are greater thanc.
Step (6): Calculate the power of the test:
1= LN
These steps were performed using the Mathcad (version 13) software. Different sample sizes were considered: n10, 20 and 30 with two significance levels:
0.01 and 0.05. The results of the power comparison are shown, for each alternative distribution, in tables (2) to (9).
Table (2): Power values when the alternative distribution is Chi-square (
1.5)n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.05 0.277 0.007 0.004 0.003 0.002 0.002
0.01 0.003 0.001 0.001 0 0 0
2
W 0.050.01 0.0220.003 0.0060.001 0.0040.001 0.0030 0.0020 0.0010
2
A 0.050.01 0.2050.049 0.0900.024 0.0480.020 0.0320.005 0.0260.003 0.0240.003
2
U 0.050.01 0.0220.003 0.0060.001 0.0040.001 0.0030 0.0020 0.0010
n
L 0.05 0.366 0.302 0.256 0.217 0.185 0.158
0.01 0.185 0.149 0.101 0.062 0.037 0.022
*
n
T 0.05 0.247 0.247 0.256 0.256 0.257 0.259
0.01 0.105 0.161 0.169 0.169 0.166 0.175
20
D 0.05 0.264 0.107 0.045 0.019 0.008 0.004
0.01 0.019 0.001 0 0 0 0
2
W 0.050.01 0.2480.011 0.00020.075 0.0220 0.0070 0.0030 0.0010
2
A 0.050.01 0.4840.238 0.3280.092 0.2330.044 0.1670.027 0.1250.017 0.0940.011
2
U 0.050.01 0.2480.011 0.0750 0.0220 0.0070 0.0030 0.0010
n
L 0.05 0.600 0.543 0.496 0.460 0.428 0.400
0.01 0.368 0.359 0.325 0.292 0.260 0.229
*
n
T 0.05 0.579 0.543 0.494 0.483 0.479 0.493
0.01 0.340 0.311 0.322 0.328 0.346 0.366
30
D 0.050.01 0.5030.146 0.2990.029 0.1820.005 0.1150.002 0.0760.001 0.00020.056
2
W 0.050.01 0.5270.148 0.2970.014 0.1650.001 0.00010.089 0.0460 0.0290
2
A 0.050.01 0.6780.457 0.5370.255 0.4340.144 0.3480.085 0.2880.050 0.2510.037
2
U 0.050.01 0.5270.149 0.2970.014 0.1660.001 0.00010.089 0.0470 0.0300
n
L 0.05 0.751 0.694 0.645 0.606 0.579 0.562
0.01 0.554 0.518 0.476 0.441 0.410 0.391
*
n
T 0.05 0.790 0.777 0.745 0.727 0.719 0.714
Table (3): Power values when the alternative distribution is Chi-square (
3)n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.050.01 0.3320.009 0.1040.001 0.0230 0.0100 0.0050 0.0030
2
W 0.050.01 0.3370.017 0.0780.001 0.0280 0.0120 0.0060 0.0030
2
A 0.050.01 0.6570.336 0.4760.119 0.3310.082 0.2290.058 0.1620.024 0.1230.020
2
U 0.05 0.337 0.078 0.028 0.012 0.006 0.003
0.01 0.017 0.001 0 0 0 0
n
L 0.05 0.716 0.643 0.579 0.517 0.461 0.410
0.01 0.462 0.394 0.296 0.198 0.125 0.080
*
n
T 0.05 0.586 0.588 0.599 0.599 0.600 0.604
0.01 0.241 0.397 0.418 0.418 0.412 0.433
20
D 0.050.01 0.9060.570 0.8000.225 0.6890.075 0.5700.025 0.4730.007 0.3850.002
2
W 0.050.01 0.9180.610 0.8110.185 0.6870.041 0.5510.008 0.4300.002 0.3240.001
2
A 0.050.01 0.9520.864 0.9090.699 0.8600.526 0.8110.394 0.7550.282 0.7040.206
2
U 0.050.01 0.9180.609 0.8110.185 0.6880.041 0.5520.008 0.4310.002 0.3250.001
n
L 0.05 0.954 0.935 0.915 0.899 0.878 0.857
0.01 0.830 0.823 0.786 0.744 0.690 0.639
*
n
T 0.05 0.928 0.913 0.892 0.886 0.884 0.891
0.01 0.795 0.767 0.777 0.783 0.799 0.814
30
D 0.05 0.984 0.964 0.936 0.908 0.873 0.847
0.01 0.924 0.783 0.617 0.471 0.339 0.246
2
W 0.050.01 0.9880.943 0.9720.797 0.9470.595 0.9180.395 0.8820.250 0.8510.182
2
A 0.050.01 0.9910.975 0.9820.936 0.9730.884 0.9600.818 0.9470.743 0.9350.691
2
U 0.050.01 0.9880.943 0.9720.798 0.9470.595 0.9180.395 0.8810.251 0.8510.183
n
L 0.05 0.992 0.987 0.982 0.977 0.973 0.969
0.01 0.968 0.956 0.943 0.928 0.911 0.899
*
n
T 0.05 0.948 0.945 0.931 0.925 0.920 0.919
Table (4): Power values when the alternative distribution is Exponential (1)
n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.050.01 0.0870.025 0.0500.010 0.0350.005 0.0260.000 0.0190.000 0.0150.000
2
W 0.050.01 0.0920.027 0.0500.010 0.0340.005 0.0240.003 0.0180.003 0.0130.003
2
A 0.050.01 0.1010.031 0.0500.010 0.0300.005 0.0200.002 0.0140.001 0.0100.001
2
U 0.050.01 0.0920.027 0.0500.010 0.0340.005 0.0240.003 0.0180.003 0.0130.003
n
L 0.05 0.082 0.050 0.034 0.023 0.016 0.012
0.01 0.016 0.011 0.005 0.002 0.001 0.001
*
n
T 0.050.01 0.4630.186 0.4650.305 0.4760.323 0.4760.323 0.4790.317 0.4830.332
20
D 0.050.01 0.0900.025 0.0520.010 0.0340.005 0.0250.003 0.0190.002 0.0150.001
2
W 0.050.01 0.0910.027 0.0500.009 0.0320.005 0.0230.003 0.0170.002 0.0130.001
2
A 0.050.01 0.1010.031 0.0510.009 0.0290.004 0.0180.002 0.0120.001 0.0090.001
2
U 0.050.01 0.0910.027 0.0500.009 0.0320.005 0.0230.003 0.0170.002 0.0130.001
n
L 0.05 0.084 0.050 0.030 0.022 0.017 0.012
0.01 0.010 0.009 0.006 0.004 0.003 0.002
*
n
T 0.05 0.099 0.079 0.060 0.056 0.055 0.059
0.01 0.021 0.017 0.018 0.019 0.022 0.025
30
D 0.05 0.093 0.048 0.032 0.024 0.018 0.016
0.01 0.027 0.011 0.007 0.004 0.003 0.002
2
W 0.050.01 0.0950.032 0.0500.011 0.0340.006 0.0240.003 0.0190.002 0.0150.002
2
A 0.050.01 0.1000.035 0.0490.011 0.0300.005 0.0200.002 0.0150.001 0.0110.001
2
U 0.050.01 0.0940.032 0.0500.011 0.0340.006 0.0240.003 0.0190.002 0.0150.002
n
L 0.05 0.093 0.052 0.034 0.023 0.016 0.014
0.01 0.013 0.009 0.006 0.004 0.003 0.002
*
Table (5): Power values when the alternative distribution is Exponential (2)
n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.050.01 0.3800.130 0.2480.054 0.1840.024 0.1350.000 0.1030.000 0.0820.000
2
W 0.050.01 0.3880.136 0.2400.051 0.1640.024 0.1200.014 0.0900.014 0.0720.014
2
A 0.050.01 0.3800.138 0.2180.045 0.1360.019 0.0900.009 0.0630.005 0.0470.004
2
U 0.050.01 0.3880.136 0.2400.052 0.1650.025 0.1200.014 0.0910.014 0.0720.014
n
L 0.05 0.213 0.128 0.083 0.056 0.039 0.028
0.01 0.040 0.025 0.013 0.006 0.004 0.002
*
n
T 0.050.01 0.7980.480 0.7990.668 0.8070.687 0.8070.687 0.8080.679 0.8100.697
20
D 0.050.01 0.6690.360 0.5260.203 0.4340.128 0.3600.085 0.3090.063 0.2670.047
2
W 0.050.01 0.6800.374 0.5230.189 0.4190.112 0.3420.074 0.2870.050 0.2460.036
2
A 0.050.01 0.6590.374 0.4850.168 0.3650.088 0.2810.048 0.2180.028 0.1710.017
2
U 0.050.01 0.6800.373 0.5230.189 0.4200.112 0.3420.074 0.2880.051 0.2460.036
n
L 0.05 0.478 0.329 0.230 0.165 0.123 0.094
0.01 0.064 0.058 0.036 0.022 0.012 0.007
*
n
T 0.05 0.240 0.197 0.156 0.147 0.144 0.155
0.01 0.064 0.054 0.058 0.060 0.068 0.076
30
D 0.050.01 0.8390.597 0.7230.405 0.6300.293 0.5610.225 0.5050.177 0.4690.145
2
W 0.050.01 0.8530.622 0.7390.403 0.6380.269 0.5590.193 0.4970.148 0.4560.128
2
A 0.050.01 0.8310.606 0.6930.370 0.5820.228 0.4850.141 0.4150.088 0.3660.066
2
U 0.050.01 0.8520.621 0.7380.403 0.6380.269 0.5580.193 0.4970.148 0.4560.129
n
L 0.050.01 0.7230.225 0.5630.152 0.4440.095 0.3430.061 0.2730.036 0.2420.026
*
n
Table (6): Power values when the alternative distribution is Gamma (
1.5)n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.050.01 0.0750.022 0.0450.011 0.0320.006 0.0240.000 0.0190.000 0.0160.000
2
W 0.050.01 0.0770.026 0.0450.012 0.0310.006 0.0230.004 0.0180.004 0.0150.004
2
A 0.050.01 0.1110.034 0.0550.013 0.0330.007 0.0230.004 0.0170.003 0.0130.002
2
U 0.050.01 0.0770.025 0.0450.012 0.0310.006 0.0230.004 0.0180.004 0.0150.004
n
L 0.050.01 0.1210.028 0.0790.019 0.0540.010 0.0380.005 0.0280.003 0.0210.002
*
n
T 0.050.01 0.3010.050 0.3030.149 0.3160.166 0.3160.166 0.3190.161 0.3230.174
20
D 0.050.01 0.0840.016 0.0370.007 0.0220.004 0.0160.003 0.0110.002 0.0090.002
2
W 0.050.01 0.0820.017 0.0340.007 0.0210.004 0.0140.003 0.0110.002 0.0090.002
2
A 0.050.01 0.1310.034 0.0630.010 0.0330.004 0.0200.003 0.0140.002 0.0110.002
2
U 0.050.01 0.0820.017 0.0340.007 0.0210.004 0.0140.003 0.0110.002 0.0090.002
n
L 0.050.01 0.1460.027 0.0990.025 0.0680.020 0.0520.014 0.0410.011 0.0340.007
*
n
T 0.050.01 0.2790.102 0.2440.086 0.2040.091 0.1960.095 0.1930.105 0.2040.115
30
D 0.050.01 0.1150.020 0.0480.006 0.0250.003 0.0160.002 0.0110.001 0.0080.001
2
W 0.050.01 0.1210.022 0.0450.006 0.0240.003 0.0140.002 0.0100.001 0.0080.001
2
A 0.050.01 0.1690.054 0.0810.016 0.0470.006 0.0280.002 0.0190.002 0.0150.001
2
U 0.050.01 0.1210.022 0.0450.006 0.0240.003 0.0140.002 0.0100.001 0.0080.001
n
L 0.05 0.183 0.120 0.085 0.064 0.052 0.047
0.01 0.044 0.033 0.024 0.018 0.014 0.012
*
n
Table (7): Power values when the alternative distribution is Gamma (
2.5)n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.05 0.090 0.033 0.020 0.016 0.013 0.012
0.01 0.016 0.009 0.006 0.000 0.000 0.000
2
W 0.050.01 0.0860.018 0.0300.009 0.0210.006 0.0160.004 0.0140.004 0.0120.004
2
A 0.050.01 0.2360.053 0.1060.014 0.0510.008 0.0300.006 0.0200.004 0.0150.003
2
U 0.050.01 0.0860.018 0.0300.009 0.0210.006 0.0160.004 0.0140.004 0.0120.004
n
L 0.05 0.262 0.185 0.135 0.098 0.072 0.053
0.01 0.072 0.048 0.025 0.012 0.006 0.004
*
n
T 0.05 0.220 0.221 0.230 0.230 0.232 0.235
0.01 0.069 0.121 0.129 0.129 0.125 0.134
20
D 0.05 0.390 0.218 0.125 0.073 0.042 0.027
0.01 0.073 0.009 0.003 0.001 0.001 0.001
2
W 0.050.01 0.4070.077 0.2150.005 0.1100.002 0.0570.001 0.0290.001 0.0140.001
2
A 0.050.01 0.4940.255 0.3480.087 0.2480.031 0.1790.013 0.1270.005 0.0900.002
2
U 0.050.01 0.4070.076 0.2150.005 0.1110.002 0.0570.001 0.0290.001 0.0140.001
n
L 0.05 0.492 0.404 0.328 0.277 0.237 0.205
0.01 0.168 0.158 0.120 0.089 0.064 0.046
*
n
T 0.05 0.253 0.222 0.184 0.177 0.173 0.183
0.01 0.093 0.080 0.085 0.088 0.096 0.104
30
D 0.05 0.601 0.442 0.322 0.246 0.192 0.162
0.01 0.285 0.107 0.041 0.017 0.006 0.003
2
W 0.050.01 0.6300.321 0.4620.106 0.3400.028 0.2490.006 0.1850.001 0.1480.001
2
A 0.050.01 0.6710.470 0.5430.268 0.4480.158 0.3640.090 0.3030.047 0.2640.032
2
U 0.050.01 0.6300.320 0.4620.107 0.3400.028 0.2480.006 0.1850.001 0.1480.001
n
L 0.05 0.668 0.580 0.506 0.444 0.395 0.372
0.01 0.360 0.301 0.242 0.200 0.164 0.147
*
n
T 0.05 0.567 0.554 0.517 0.506 0.501 0.499
Table (8): Power values when the alternative distribution is Lognormal (1)
n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.050.01 0.1140.027 0.0590.012 0.0380.006 0.0280.000 0.0220.000 0.0170.000
2
W 0.050.01 0.1270.032 0.0620.012 0.0400.006 0.0280.004 0.0210.004 0.0170.004
2
A 0.050.01 0.2300.101 0.1380.070 0.1000.057 0.0840.018 0.0750.013 0.0700.012
2
U 0.050.01 0.1260.032 0.0620.012 0.0390.006 0.0280.004 0.0210.004 0.0170.004
n
L 0.05 0.326 0.274 0.240 0.211 0.188 0.167
0.01 0.189 0.161 0.128 0.098 0.074 0.058
*
n
T 0.05 0.602 0.602 0.609 0.609 0.610 0.611
0.01 0.500 0.542 0.548 0.548 0.546 0.552
20
D 0.05 0.222 0.126 0.082 0.057 0.041 0.033
0.01 0.057 0.020 0.009 0.006 0.004 0.003
2
W 0.050.01 0.2460.071 0.1400.021 0.0880.011 0.0590.007 0.0430.004 0.0330.003
2
A 0.050.01 0.3410.180 0.2360.101 0.1760.078 0.1400.068 0.1190.049 0.1030.033
2
U 0.050.01 0.2450.071 0.1400.021 0.0880.010 0.0590.007 0.0430.004 0.0330.003
n
L 0.05 0.450 0.405 0.371 0.348 0.329 0.315
0.01 0.296 0.292 0.275 0.252 0.234 0.219
*
n
T 0.050.01 0.4030.152 0.3570.128 0.3050.136 0.2920.141 0.2870.158 0.3030.175
30
D 0.050.01 0.3010.117 0.1940.048 0.1340.025 0.1000.013 0.0800.008 0.0680.006
2
W 0.050.01 0.3460.152 0.2270.063 0.1620.029 0.1210.014 0.0910.009 0.0770.007
2
A 0.050.01 0.4270.268 0.3190.162 0.2550.113 0.2100.088 0.1780.074 0.1600.065
2
U 0.050.01 0.3450.152 0.2260.063 0.1620.029 0.1200.013 0.0910.009 0.0770.007
n
L 0.05 0.537 0.489 0.458 0.432 0.415 0.407
0.01 0.402 0.379 0.360 0.346 0.332 0.323
*
Table (9): Power values when the alternative distribution is Weibull (
1.5)n Test Value ofof the null distribution
the shape parameter0.5 1 1.5 2 2.5 3
10
D 0.05 0.442 0.3198 0.253 0.203 0.168 0.142
0.01 0.198 0.103 0.058 0.000 0.000 0.000
2
W 0.050.01 0.457 0.3204 0.2440.212 0.104 0.060 0.0402 0.0402 0.04020.194 0.159 0.134
2
A 0.050.01 0.4660.233 0.3180.105 0.2290.056 0.1730.035 0.1350.023 0.1080.016
2
U 0.050.01 0.457 0.3204 0.2440.212 0.104 0.061 0.0405 0.0405 0.04050.194 0.159 0.135
n
L 0.05 0.335 0.240 0.180 0.137 0.107 0.083
0.01 0.108 0.077 0.049 0.030 0.019 0.013
*
n
T 0.05 0.061 0.063 0.083 0.083 0.086 0.092
0.01 0.001 0.001 0.001 0.001 0.001 0.001
20
D 0.05 0.696 0.568 0.481 0.418 0.370 0.333
0.01 0.418 0.267 0.185 0.137 0.108 0.085
2
W 0.050.01 0.7140.442 0.5730.264 0.4820.179 0.4130.131 0.3610.100 0.3170.079
2
A 0.050.01 0.7070.461 0.5560.262 0.4530.165 0.3740.111 0.3110.076 0.2650.053
2
U 0.050.01 0.7130.442 0.5730.264 0.4820.180 0.4130.131 0.3610.100 0.3180.079
n
L 0.05 0.577 0.454 0.352 0.285 0.234 0.193
0.01 0.152 0.141 0.105 0.074 0.048 0.033
*
n
T 0.05 0.466 0.409 0.343 0.330 0.324 0.342
0.01 0.150 0.118 0.129 0.136 0.158 0.180
30
D 0.05 0.828 0.733 0.655 0.591 0.542 0.509
0.01 0.624 0.448 0.343 0.276 0.227 0.193
2
W 0.050.01 0.8460.654 0.7490.458 0.6670.336 0.6040.257 0.5490.210 0.5090.188
2
A 0.050.01 0.8340.654 0.7220.451 0.6360.316 0.5560.223 0.4910.164 0.4470.135
2
U 0.050.01 0.8450.653 0.7480.458 0.6670.336 0.6040.257 0.5490.210 0.5090.188
n
L 0.05 0.769 0.649 0.553 0.460 0.392 0.361
0.01 0.345 0.263 0.196 0.145 0.106 0.086
*
n
When comparing the power values of the EDF tests with the ECF test, from tables (2) to (9), it can be noticed that:
1. When n 10, The ECF test is the most powerful test in most of the cases. Some exceptions happen as in the case for the Weibull ( 1.5) alternative where it can be seen that the EDF tests are powerful than the ECF test.
2. When n 20,
i. The ECF test is the most powerful test when the alternative distribution is gamma ( 1.5). Also, it is the most powerful test when the alternative is the exponential (1) except when the shape parameter of the null distribution 0.5 where the A-D test has the highest power value.
ii. The L-S test is the most powerful test when the alternative distribution is lognormal. Also, it is the most powerful test when the alternative is the Chi-square ( 3) distribution except when =0.01 and the shape parameter of the null distribution 0.5 where the A-D test has the highest power value.
iii. For the Chi-square ( 1.5) alternative, the ECF test is the most powerful when the shape parameter of the null distribution 2, 2.5 and 3. On the other hand, the L-S test is the most powerful test when the shape parameter of the null distribution 0.5, 1 and 1.5.
iv. For the Chi-square ( 3) alternative, exponential (2) and Weibull ( 1.5) alternatives, the EDF tests have higher power than the ECF test. Except for large values of the shape parameter value of the null distribution
2.5 and 3 where the ECF test has higherpower.
3. When n 30,
i. The ECF test is the most powerful test when the alternatives are the Chi-square ( 1.5) and exponential
1
.ii. The L-S test is the most powerful test when the alternative is the lognormal distribution.
iii. For the Chi-square ( 3) alternative; the L-S test is the most powerful except when 0.01 and the shape parameter of the null distribution is 0.5 where the A-D test is the most powerful.
iv. For the exponential ( 2) alternative, the following is noticed:
a. At 0.05 and when the shape parameter of the null distribution is 0.5,1 and 1.5; the C-M test is the most powerful. In addition; when the shape parameter of the null distribution is
b. At 0.01 and when the shape parameter of the null distribution is 0.5; the C-M test is the most powerful. In addition; when the shape parameter of the null distribution is
1 the K-S test is the most powerful.c. For the rest of the cases, the ECF test has the greatest power. v. For the gamma
1.5
alternative; the ECF test is the mostpowerful test in most of the cases except when the value of the shape parameter of the null distribution is
0.5 where the L-S test is the most powerful at 0.05, but, when 0.01 the A-D test is the most powerful.vi. For the gamma
2.5
alternative; the ECF test is the most powerful test in most of the cases except when the value of the shape parameter of the null distribution is
0.5 where the A-D test is the most powerful. Also, when 0.05 and
1 the L-S test is the most powerful.vii. For the Weibull ( 1.5) alternative, it can be seen that the EDF tests are powerful than the ECF test. The only exception happens is when the value of the shape parameter of the null distribution is 3, where the ECF test is the most powerful.
6. Conclusions
Generally it can be concluded that there is no one test that is optimal for all of the cases. It was shown that the most powerful test depends on the sample size, the alternative distribution, the significance level and the value of the shape parameter of the null distribution. Also, it is concluded that the ECF test is the best when the sample size is small (n 10) except when the alternative distribution is Weibull (
1.5) where the EDF tests are better.References
1. Abd-Elfattah, A. M., Fergany, H. A. and Omima, A. M. (2010). Goodness of fit tests for the generalized Frechet distribution. Australian Journal of Basic and Applied Sciences, 4(2): 286-301.
2. Gupta, R.D. and Kundu, D. (1999). Generalized exponential distributions,
Australian and New Zealand Journal of Statistics, 41(2), 173-188.
3. Gupta, R.D. and Kundu, D. (2001a). Exponentiated exponential Family: an alternative to gamma and Weibull distributions, Biometrical Journal, 43(1), 117-130.
5. Gupta, R.D. and Kundu, D. (2002). Generalized exponential distributions: statistical inferences, Journal of Statistical Theory and Applications, 1(1), 101-118.
6. Gupta, R.D. and Kundu, D. (2003a). Closeness of gamma and generalized exponential distribution, Communication in Statistics – Theory and Methods, 32, 705-721.
7. Gupta, R.D. and Kundu, D. (2003b), Discriminating between Weibull and generalized exponential distribution, Computational statistics and data analysis, 43, 179-196.
8. Hassan, A. S. (2005). Goodness-of-Fit for the Generalized Exponential Distribution. Interstat electronic Journal:
http://interstat.statjournals.net/YEAR/2005/abstracts/0507001.php
9. Lawless, J. F. (1982). Statistical models and methods for lifetime data. John Wiley & Sons.
10. Liao, M. and Shimokawa, T. (1999a). A new goodness-of-fit test for Type-I extreme-value and 2-parameter Weibull distributions with estimated parameters". Journal of Statistical Computation and Simulation, 64, 23-48. 11. Liao, M. and Shimokawa, T. (1999b). Goodness-of-fit test for the Type-I
extreme-value and two-parameter Weibull distributions with unknown parameters estimated by graphical plotting techniques, Part 2: Power study.National Aerospace Laboratory, Technical Report, TR-1372T. 12. Littell, R. C., McClave, J. T. and Offen, W. W. (1979). Goodness-of-Fit
tests for two parameters Weibull distribution. Communications in Statistics-Simulation and Computation, volume 8, 257-269.
13. Park, W. J. and Seoh, M. (1994). More goodness-of-fit tests for the power law process.IEEE Transactions On Reliability, 43, 275-278.
14. Stephens, M. A. (1974). EDF statistics for goodness-of-fit and some comparisons.Journal of the American Statistical Association, 69, 730-737. 15. Towhidi, M. and Salmanpour, M. (2007). A New Goodness-of-Fit Test Based on the Empirical Characteristic Function. Communications in Statistics-Theory and Methods, 36, 2777-2785.