• No results found

The Distribution of the Concentration Ratio for Samples from a Uniform Population

N/A
N/A
Protected

Academic year: 2020

Share "The Distribution of the Concentration Ratio for Samples from a Uniform Population"

Copied!
15
0
0

Loading.... (view fulltext now)

Full text

(1)

http://dx.doi.org/10.4236/am.2015.61007

The Distribution of the Concentration Ratio

for Samples from a Uniform Population

Giovanni Girone, Antonella Nannavecchia Faculty of Economics, University of Bari, Bari, Italy Email: [email protected], [email protected]

Received 24 October 2014; revised 20 November 2014; accepted 16 December 2014 Copyright © 2015 by authors and Scientific Research Publishing Inc.

This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/

Abstract

In the present paper we derived, with direct method, the exact expressions for the sampling proba-bility density function of the Gini concentration ratio for samples from a uniform population of size n = 6, 7, 8, 9 and 10. Moreover, we found some regularities of such distributions valid for any sample size.

Keywords

Gini Concentration Ratio, Uniform Distribution, Order Statistics, Probability Density Function

1. Introduction

In 1914 Corrado Gini [1] introduced the concentration ratio R for the measure of inequality among values of a frequency distribution. The Gini index is widely used in fields as diverse as sociology, health science, engineer-ing, and in particular, economics to measure the inequality of income distribution.

(2)

ased distributions. They showed that these estimators are strongly consistent for the Gini index. Also, they ob-tained an asymptotic normality for the corresponding Gini index.

Girone (1968) [10] focused on the study of the sampling distribution of the Gini index and in 1971 [11] de-rived the exact expression for samples drawn from an exponential population. In 1971 Girone [12] obtained, with direct method, the sampling distribution function of the Gini ratio for samples of size n≤ 5 drawn from a uniform population.

In the present note (Section 2), we calculate the joint probability density function (p.d.f.) of the random sam-ple of size n and, then, the joint p.d.f. of the n order statistics. Hence, we transform one of the order statistics in their average and the remaining n‒ 1 order statistics are divided by the same average. We calculate the joint p.d.f. of the new n variables and integrating with respect to the average we obtain the joint p.d.f. of the other n‒ 1 variables. One of these variables is transformed in the concentration ratio. We calculate the joint p.d.f. of the concentration ratio and of the other n‒ 2 variables and at last we integrate this p.d.f. with respect to the n – 2 va-riables obtaining the marginal p.d.f. of the concentration ratio. The main difficulty of this procedure consists in the identification of the region of integration of the n‒ 2 variables, for two reasons: firstly the need to decom-pose this region into subregions which allow identifying directly the limits of integration and secondly the growing number of such subregions that makes the derivation heavy.

In Sections 3-7, using the software Mathematica, we derive the exact distributions of the concentration ratio for samples from a uniform distribution of size n = 6, 7, 8, 9 and 10. Moreover (Section 8), we find some regu-larities of such distributions valid for any sample size.

2. The Procedure to Derive the Distribution of the Concentration Ratio

Let random variables X X1, 2, , Xn from a uniform population have p.d.f.

( )

1, 0 1,

0, elsewhere.

x f x =  < <

 (1) The joint p.d.f. of the variables is

(

1 2

)

1, 0 1, for 1,2, , , , , ,

0, elsewhere.

i n

x i n

h x x x =  < < =

 (2)

The joint p.d.f. of the order statistics X X( )1, ( )2, , X( )n is

( ) ( ) ( )

(

)

( )1 ( )2 ( )

1 2

!, 0 1,

, , ,

0, elsewhere.

n n

n x x x h x x x =  < < < < <



 (3)

By transforming the variables

( )1 ( )2 ( )n,

S X= +X + + X

( )i ( )i , for 1,2, , 1, X

D i n

S

= =  −

whose Jacobian is

1,

n

J S=

we obtain the joint p.d.f. of the variables S and D D( )1, ( )2, , D(n1) that can be written as

( ) ( ) ( )

(

)

(

)

1

1 2 1

, , , , 1 ! n ,

n

g s d d d n s

− = −

 (4)

( )1 ( )2 ( 1)

(

( )1 ( )2 ( 1)

)

for 0<sd <sd <<sdn− <s 1−dd − − dn− <1.

We integrate expression [4] with respect to the variable S and obtain the joint p.d.f. of the variables , , ,

(3)

( ) ( ) ( )

(

)

(

)

( ) ( ) ( )

(

)

1 2 1

1 2 1

1 ! , , , , 1 n n n n f d d d

d d d − − − = − − − − 

 (5)

( )1 ( )2 ( 1) ( )1 ( )2 ( 1) for 0<d <d <<dn− < −1 dd − − dn− .

By transforming the variable D(n1) in the variable Ri.e. the concentration ratio

(

)

( ) 1 1 2 1 , 1 n i i

R n i D n

=

= − −

from which we get

( )

(

)(

)

(

)

( ) 2 1 1 1 1 , 2 n n i i n R

D − − n i D

=

− −

= −

the Jacobian of the transformation is

1 2

n

J= −

and the joint p.d.f. of the variable R and D D( )1, ( )2, , D(n2) is

( ) ( ) ( )

(

)

(

)

(

)(

)

(

)

( )

1 2 2 2

1

2 1 !

, , , , ,

2 1 1 2 1

n

n n n

i i

n h d d d R

n R n i d

= − =  + − −    

 (6)

for

( ) ( ) ( )

(

)(

)

(

)

( )

(

)(

)

(

)

( )

2 2

1 2 2

1 1

1 1 1 1

0 1 1 .

2 2

n n

n i i

i i

n R n R

d d d − − n i dn i d

= =

− − − −

< < << < −

− < − +

− − (7)

By integrating expression [6] with respect to the variables D D( )1, ( )2 , , D(n2) over the regions determined

by inequalities [7], we get the marginal p.d.f. of the concentration ratio R.

3. The Distribution of the Concentration Ratio for n = 6

The procedure indicated in Section 2 is used to obtain the following p.d.f. (Figure 1) of the concentration ratio R for random samples of size n = 6:

( )

(

)

2 2 2 2

2 3375 , for

2 1

27 1728 19683 55296 0 1,

5

1 2 3 4

50 25 25 25

5 5 5 5

R

R R R

g

R R

R

− + − + < <

+   +   +   +              +    = 

( )

1252 7853 2 449523 2 677147 2 55296 2, 1 2,

5 5

144 75 1 400 2 225 3 25 4 for

5 5 5 5

R R R R g R R R

− + − + − < <

+   +   +   +

       

       

=

( )

32 2 111312 33773 2 449523 2 19683 2, 2 3,

5 5

240

1 1 2 3

75 75

for

400 25

5 5 5 5

R R

R R R R

g R − + − + < <

− +   +   +   +

       

      

=

( )

27 2 437 2 111312 7853 2 1728 2, 3 4,

5 5

240

2 1 1 2

400 75 75 25

5 5 5 5

for R R R R g R R R

− + − + − < <

− +  − +   +   +

       

   

=

(4)
[image:4.595.72.511.79.548.2]

Figure 1.Probability density function of the concentration ratio R for random samples of size n = 6 from a uniform population.

( )

1 2 27 2 32 2 1252 27 2,fo 4 1.

5 144

3 2 1 1

450 400 75 50

5 5 5 5

r

g R

R R

R

R R R

− + − + < <

− +  − +  − +   +

       

       

=

Characteristic values of the distribution are:

mean

( )

3 11696log 2 243log3 1250log5 0.35222,

15 2 3

E R = − + + − =

second moment

( )

2 16 69056log 2 24219log3 18125log5 0.13716,

25 50 12

E R = − − + =

third moment

( )

3 1368 678688log 2 156006log3 12625log5 0.05785,

25 125 125 4

E R = − + + − =

fourth moment

( )

4 89237 25434496log 2 7691436log3 15290log5 0.02606,

625 3125 3125 3

E R = − − + =

standard deviation σ

( )

R =0.11444,

index of skewness γ1

( )

R =0.20793, index of kurtosis γ2

( )

R = −0.16767.

The distribution of the concentration ratio R for samples of size n = 6 from a uniform population shows a slight positive skewness and platykurtosis.

4. The Distribution of the Concentration Ratio for n = 7

The procedure indicated in Section 2 is used to obtain the following p.d.f. (Figure 2) of the concentration ratio R

for random samples of size n = 7:

( )

(

)

2 2 2 2

2 2

117649 117649 1058841 3764768

1 2 3 4

518400 1620 640 405

6 6 6 6

367653125 1058841 , f 0 1, 6 100 1

5 20736

6

or

R R R R

R R

R

R

g − + − +

+   +   +   +

       

       

− + < <

+  +

 

 

(5)
[image:5.595.150.436.329.722.2]

Figure 2. Probability density function of the concentration ratio R for random samples of size n = 7 from a uniform population.

( )

2 2 2 2

2 2

9 14356253 2450309 2099205

20 103680 1 810 2 128 3

6 6 6

49230947 367653125 , 1 2,

6 6

4 5

1620 2

fo 6

6

r 073

6

R

R R R

R R

R g

R

− + −

+   +   +

     

     

+ − < <

+   +

   

  

=

( )

2 2 2 2

2 2

3125 7853 46303907 7450309

90

1 1 2

10368 25920 810

6 6 6

2099205 3764768 , 2 3,

6 6

3 4

128 405

6

f 6

or

g R

R

R R R

R

R R

− + − +

− +   +   +

     

     

− + < <

+   +

=

   

   

( )

2 2 2 2

2 2

32 1064201 33773 46303907

90

2 1 1

405 51840 25920

6 6 6

2450309 1058841 , 3 4,

6 6

2 3

810 64

for 0

6 6

R

R R R

R g R

R R

− + −

− +  − +   +

     

     

+ − < <

+   +

   

   

=

( )

2 2 2 2

2 2

9 463 1064201 7853

90

3 2 1

1280 324 51840

6 6 6

14356253 117649 , 4 5,

6 6

1 2

103680

fo 162

r 0

6 6

R R

R R

R

R g

R R

− + − +

− +  − +  − +

     

     

− + < <

+   +

   

   

(6)

( )

2 2 2 2

2 2

1 9 32 3125

4 3 2 1

8100 1280 405 10368

6 6 6 6

9 117649 , 5 1.

6 20 518 00 1 r

6

o 4

f

g R

R R R R

R R

R

− + −

− +  − +  − +  − +

       

       

+ − < <

+

 

 

=

Characteristic values of the distribution are:

mean

( )

7 35072log 2 21797log3 359375log5 823543log 7 0.34951,

2 15 160 288 1440

E R = − + − + + =

second moment

( )

2 763 3806128log 2 451251log3 359375log5 15647317log 7 0.13291,

36 405 80 72 6480

E R = + + − − =

third moment

( )

3 18179 2771144log 2 4266351log3 18546875log5 5764801log 7 0.05417,

216 135 320 1728 960

E R = − − − + + =

fourth moment

( )

4 165193 40601588log 2 7638867log3 263234375log5 895191241log 7 0.02342,

648 1215 320 15552 77760

E R = + + + − =

standard deviation σ

( )

R =0.10367,

index of skewness γ1

( )

R =0.18545, index of kurtosis γ2

( )

R = −0.14535.

The distribution of the concentration ratio R for samples of size n = 7 from a uniform population shows slight positive skewness and platykurtosis, both lower than those obtained for samples of size n = 6.

5. The Distribution of the Concentration Ratio for n = 8

The procedure indicated in Section 2 is used to obtain the following p.d.f. (Figure 3) of the concentration ratio R

for random samples of size n = 8:

( )

(

)

2 2 2 2

2 2 2

8192 2097152 663552 536870912

1 2 3 4

99225 33075 245 19845

7 7 7 7

128000000 169869312 137682944 , 0 1, 7 2025 1

5 6

1323 1225 7

f

7

or

R R R R

R R

R R

g R − + −

+   +   +   +

       

       

+ − + < <

+

+   +

   

  

=

( )

2 2 2 2

2 2 2

16807 9613463 2548367503 386863168 86400 66150 1 423360 2 6615 3

7 7 7

173031518287 19007619787 169869312 , 1 2,

7 7

4 5 6

846720 66150 1225 7

r

7 7

fo

R

R R R

R R

g R

R R

− + − +

+   +   +

     

     

− + − < <

+   +   +

     

     

=

( )

2 2 2 2

2 2 2

81 14356253 31094051 18601017487 120960

1 1 2

490 6615 423360

7 7 7

1969741961 173031518287 128000000 , 2 3,

7 7

3 4 5

13230 846720

for 1323

7 7 7

R

R R R

R

R R

g

R

R − + −

− +   +   +

     

     

+ − + < <

+   +   +

   

=

 

(7)
[image:7.595.131.486.284.697.2]

Figure 3.Probability density function of the concentration ratio R for random samples of size n = 8 from a uniform population.

( )

2 2 2 2

2 2 2

15625 274531 46303907 268282153

30240

2 1 1

254016 6615 19845

7 7 7

18601017487 386863168 536870912 , 3 4,

7 7

2 3 4

423

for

360 6615 19845

7 7 7

R

R R R

R R

R

R R

g − + − +

− +  − +   +

     

     

− + − < <

+   +   +

     

     

=

( )

2 2 2 2

2 2 2

64 2472719 1254307 46303907

30240

3 2 1

6615 423360 6615

7 7 7

31094051 2548367503 663552 , 4 5,

7 7

1 2 3

6615 423360 245

7

r 7

fo 7

g

R

R R R

R R R

R

R

− + −

− +  − +  − +

     

     

+ − + < <

+   +   +

     

     

=

( )

2 2 2 2

2 2 2

81 15949 2472719 274531

4 3 2 6615 1

7 7 7 7

14356253 9613463 2097152 , for5 6,

7 7

12

156800 66150 423360

66

0960 1 2

7

150 33 7

7 0 5

R R R R

R R

g R

R

R

− + − +

− +  − +  − +  − +

       

       

− + − < <

+   +

 

=

 

( )

2 2 2 2

2 2 2

1 81 64 15625

5 4 3 2

7 7 7 7

81 16807

198450 156800 6615 25

8192 , 6 1.

40

7 86400

1 1

490

7 7

16

for 99225

g

R R R R

R R

R R

R

− + −

− +  − +  − +  − +

       

       

+ − + < <

− +   +

   

   

=

(8)

mean

( )

4 3475456log 2 2775303log3 2421875log5 3411821log 7 0.34747,

315 560 1008 720

E R = − + + − − =

second moment

( )

2 190 3184576log 2 8776431log3 4140625log5 7882483log 7 0.12985,

7 63 392 392 360

E R = − − + + =

third moment

( )

3 6016 658405376log 2 1492752159log3 129921875log5 13596863log 7 0.05160,

49 5145 27440 5488 240

E R = − + + − − =

fourth moment

( )

4 145475 3729880384log 2 2307400911log3 533984375log5 9851303log 7 0.02162,

343 15435 24010 14406 90

E R = − − + + =

standard deviation σ

( )

R =0.09544,

index of skewness γ1

( )

R =0.16867, index of kurtosis γ2

( )

R = −0.12824.

The distribution of the concentration ratio R for samples of size n=8 from a uniform population shows slight positive skewness and platykurtosis, both lower than those obtained for samples of size n=6 and 7.

6. The Distribution of the Concentration Ratio for n = 9

The procedure indicated in Section 2 is used to obtain the following p.d.f. (Figure 4) of the concentration ratio R

for random samples of size n = 9:

( )

2 2 2 2

2 2 2

2125

531441 531441 10460353203

1 2 3 35 4

8 8 8 8

41518828125 10460353203 437664515463 544195584

5 6 7

764

20070400 11200 2867200

114688 11200 40 00

8 9

8 8 6

R R R R

R R g R R − + − +  +   +   +   +                  − + − +  +   +   +              =

(

)

2 for

1

, 0 ,

8 1225 1+R < <R

( )

2 2 2 2 2

2 2

1024 9819117949 15242823 14412664 857395628

77414400 9289728

2867200 9072

27931

14175 1 1600 2 3 945 4

8 8 8 8

6604248318741 2358496602787 437664515463

5 0 6

8 0 8

g R

R

R R R R

R R − + − +  +   +   +   +                  − + −  +   +          = 2 for 409600 1 2 , , 8 8 7 8 R R < <  +     

( )

2 2 2 2

2 2 2

823543 9613463 715845826021 1262942489 75600

1 1 2

8 8 8

258217250722

11059200 77414400 8640

309

7 156844988749 6604248318741 104

3 4 5

8 8 8

6576 75600 2867200

R R R g R R R R R − + − + − +   +   +              − + − +  +   +   +              = 2

60353203 , 2 3,

8 8 6 11200 8 for R R < <  +     

( )

2 2 2 2

2 2 2

81 183689449 31094051 193012968877 7560

1 2 1

2240

8 8 8

2971536239 258217250722

3096576 3096576

857395628 30

7 41518828125

2 3 4 5

8640 945

8 96576 8 8 114688 8

g

R

R R R

R R R R

R − + −

− +  − +   +              + − + −  +   +   ++       =     

2 fo 3 4

, ,

8 8

r < <R

 

 

(9)
[image:9.595.117.512.320.718.2]

Figure 4. Probability density function of the concentration ratio R for random samples of size n = 9 from a uniform population.

( )

2 2 2 2

2 2 2

78125 783437 2575720699 268282153

22680

3 60480 2 1

8 8 8

1

9289728 3096576

2125764 30965

93012968877 1262942489 1441266427931

1 8640 2 3 35 4

8 8 8

76 9 72

8 289 8

R

R R R

R R R

g R − + − +

− +  − +  − +

     

     

− + − +

+   +   +

     

   

=

 

2 for 4

5

, ,

8 R 8

R

< <

+

 

 

( )

2 2 2 2

2 2 2

77414400 302400 3096576

4 90129527 19593601 2575720699

4 3 2 1

4725

8 8 8 8

31094051 715845826021 15242823 10460353203 756

77414

0 400 1 1600 2 28672 0 3

8 0

8 8

R R R R

R

R R

g

R

R − + −

− +  − +  − +  − +

       

       

+ − + −

+   ++

   

   

=

2 fo 5

6

, ,

8 8

r < <R

 

 

 

( )

2 2 2 2

2 2 2 2

2867200 75600 77414400 6048

81 2251 90129527 783437

5 4 3 2

8 8 8 8

18

0

f 3689449 9613463 9819117949 531441 ,

75600

1 1 11200 2 o

3096576 774144

8 0

8 0 8

R R R R

R

R R R

g R − + − +

− +  − +  − +  − +

       

       

− + − +

− +   +   +

     

     

=

6 7 ,

8 r

8

R

< <

( )

2 2 2 2

2 2 2 2

1 81 4 78125

6 5 4725 4 3

8 8 8 8

81 823543 1024 531441 , 7 1.

8 14175

2 1 1

2

6350400 2867200 9289728

for

11059200 20070400

240

8 8 8

R R R R

R R

R R

g

R

R − + −

− +  − +  − +  − +

       

       

+ − + − < <

− +  − +   +

 

=

(10)

Characteristic values of the distribution are:

mean

( )

9 18425272log 2 1948617log3 1953125log5 109531219log 7 0.34589,

2 315 1120 576 5760

E R = − − − + + =

second moment

( )

2 1083 31176874log 2 209048769log3 150390625log5 1101076991log 7 0.12754,

32 105 35840 9216 11520

E R = + + − − =

third moment

( )

3 43983 69161579log 2 394860663log3 947265625log5 31654522291log 7 0.04969,

256 84 57344 24576 122880

E R = − − + + + =

fourth moment

( )

4 2732815 58613649log 2 36660941253log3 36018359375log5 739122430613log 7 0.02032,

4096 35 458752 589824 1474560

E R = + − − − =

standard deviation σ

( )

R =0.08889,

index of skewness γ1

( )

R =0.15559, index of kurtosis γ2

( )

R = −0.11467.

The distribution of the concentration ratio R for samples of size n = 9 from a uniform population shows slight positive skewness and platykurtosis, both lower than those obtained for samples of size n = 6, 7 and 8.

7. The Distribution of the Concentration Ratio for n = 10

The procedure indicated in Section 2 is used to obtain the following p.d.f. (Figure 5) of the concentration ratio R

for random samples of size n = 10:

( )

2 2 2 2 2

2 2

78125 10000000 234375 5120000000

10287648 321489 4592

762939453125

1 2 56 3 4 5

9 9 9 9 9

450375078125 26214400000

7 734832

30000000

52488 321

00

6 7

7

9 9 489

R R R R R

R g R

R

− + − +

+   +   +   +   +

         

         

− + −

+   +

   

   

=

(

)

2 46132031252, 0 19,

1568 1 fo

8 9

r R

R R

+ < <

+

+

 

 

( )

2 2 2 2 2

2

59049 6102360983 29486161727807 282890791328125

2508800 1 2 280 3 4

9 9 9 9

112482309211387 781606837730

91871139

64297800 2351462400 94058496

61

5 793800 6

9 85 00

9 91 4

R

R R

g

R R

R

R

− + − + −

+   +   +   +

       

       

+ −

+

 

 

=

+ 2 2 2

1465749080718869 2621440000000, 1 2,

9 9

7 for

64297 8

9 9

800 321489

R

R R R

+ − < <

   +   +

     

     

( )

2 2 2

2 2 2

32768 9819117949 133551832049 87091200

1 400 1

9 9

34956443674391 227124686777 4488452581986787

2 3 4

1161216

9 9 9

2394237

1148175 9185

68040 335923200

91854

80652283 0

g

R

R R

R R

R

R

− +

− +   +

   

   

− + −

+   +   +

     

     

+ =

2 2 2

450375078125, 2 3,

9 9

5 358400 6 7

8684520419229 fo

9

r

0 52488

9 9

R

R R R

− + < <

+   +   +

     

(11)
[image:11.595.140.483.260.722.2]

Figure 5.Probability density function of the concentration ratio R for random samples of size n = 10 from a uniform population.

( )

2 2 2

2 2 2

335923200 9185400

1837

5764801 605566249 715845826021 87091200

2 1

9 9

373815952381 167366716643141 2371170173011

1 2 3

9 9 9

4488452581986787

080 94058496 340200

33

R

R R

R R

g

R

R − + −

− +  − +

   

   

+ − +

+   +   +

     

    

− =

2 2 2

112482309211387 , 3 4,

9 9

4 5 7 6

9 9

30000000 for

5923200 9185400

9

R

R R R

+ − < <

+   +   +

     

     

( )

2 2 2

2 2 2

2

3 1859135071 4305254531

3 2 1

560

9 9 9

193012968877 429101140253 167366716643141 3483648

94058496 1837080

918540 1 9405 2

9 9

2

8496

680

27124686777 40

28 3

9

R R R

R

R R

g R

R

− +

− +  − +  − +

     

     

− + −

+   +

   

   

+ −

+

 

 

=

2 2

2890791328125 76293 for

94058496 734

9453125 4

8

5

, ,

9 9

9 32

4 5

9

R

R R

+ < <

+   +

   

   

( )

2 2 2

2 2 2

94058496 68040 94058496

1837

78125 195677 29538353821

4 3 2

9 9 9

12705254531 193012968877 373815952381 3483648

1

080 1837080

940584

1

9 9

349564 96

43674391 2 9

R R R

R

R R

g R

R

− + −

− +  − +  − +

     

     

+ − +

− +   +

   

   

+ =

 

 

2 2 2

512000

91871139 for

4

0000 , 5 6,

9 9

3 4

280

9 5927 9

R

R R

+ − < <

+   +

    

(12)

( )

2 2 2

2 2 2

2

64 404606309 779623

5 4 3

1148175 2351462400 48600

9 9 9

29538353821 4305254531 715845826021 87091200

2 1

94058496 1837080

9 9

133551832049 294 1

9185400 9

R R R

R

R R

R

g R − +

− +  − +  − +

     

     

− + −

− +  − +

   

   

+ −

+

 

 

=

2 2

86161727807 234375 , 6 7,

9 9

2 3

2351462400 56

9 9

for R

R R

+ < <

+   +

   

   

( )

2 2 2

2 2 2

2

3 180731 404606309

6 5 4

2508800 64297800 2351462400

9 9 9

195677 1859135071 605566249

3 2 1

68040 94058496 9185400

9 9 9

9819117949 6102360983 87091200

6

R R

g R

R

R R R

R

− + −

− +  − +  − +

     

     

+ − +

− +  − +  − +

     

     

− +

=

2 10000000 2, 79 89,

1 2

4297800 32148

fo 9

9 9

r R

R R

− < <

+   +

   

   

( )

2 2 2

2 2 2

2 2

1 3 64

7 6 5

257191200 2508800 1148175

9 9 9

78125 3 5764801

4 3 2

94058496 560 335923200

9 9 9

32768 59049 78125

2508800

1 1

1148175 10287648

9 9

R R R

R R R

R R

g R

R

− +

− +  − +  − +

     

     

− + −

− +  − +  − +

     

     

+ − +

 

=

− + +

 

  

2

8

, for 1.

9< <R

 

Characteristic values of the distribution are:

mean

( )

5 686419424log 2 88683579log3 60546875log5 40353607log 7 0.34462,

2835 1120 6048 810

E R = − + − + − =

second moment

( )

2 4977475650505 5506009449897293log 2 3870789417061723log3

1666598976 57868020 115736040

23520914453125log5 155338805614063log 7 0.12574,

5143824 8266860

E R = − + −

+ − =

third moment

( )

3 56500 302665066912log 2 1609375311log3

243 76545 1120

99482421875log5 133852914419log 7 0.04823,

489888 174960

E R = − + −

+ − =

(13)

( )

4 6581554 10398297681152log 2 1892901777log3 10382748828125log5

6561 1240029 560 19840464

1189583980753log 7 0.01933, 787320

E R = − + −

+ =

standard deviation σ

( )

R =0.08352,

index of skewness γ1

( )

R =0.14505, index of kurtosis γ2

( )

R = −0.10366.

The distribution of the concentration ratio R for samples of size n = 10 from a uniform population shows slight positive skewness and platykurtosis, both lower than those obtained for samples of size n=6,7,8 and 9.

8. Some Regularities of the Distributions

The analysis of the p.d.f. for n=2,3, ,10 shows some regularities:

● The p.d.f. of the concentration ratio R, for 0< <R 1n and for samples of size n, can be expressed by

( )

( )

(

)

(

)

1 1

1

2 2

1

2

1 1

1 ; 1 !

1

n i

n

i

n n n

i g R

n i

n R

n

− +

− =

 

−  

 

=

 

− +

 

 

● Furthermore, the p.d.f. of the concentration ratio R, for

(

n−1

)

n R< <1 and for samples of size n, can be expressed by

( )

( )

(

)

(

)

1 1

1

2 2

1

2

1 1

1 ; 2

1 ! 1

n i n

n

i

n

n i

i g R

i

n R

n

+ − −

− =

 

− + −

 

=

 

− +

 

 

● The density of the concentration ratio R, for 0< <R 1n and for samples of size n, is given by

( )

(

)

2 1

2

0 d 1 ! ;

n

ng R R n

n

= −

 

 

● The density of the concentration ratio R, for

(

n−1

)

n R< <1 and for samples of size n, is given by

( )

(

)

1

1 d 1 2;

1 !

n n

g R R n

− =

 

 

● The jth term of the density of the concentration ratio R, denoted as ai j, , verifies the following symmetry

, ,.

i j j i

a =a

The coefficients of the ai i, terms of the p.d.f. of the concentration ratio R for samples of size n−1 multip-lied by

(

n−1

)

n become the coefficients of the ai+ +1, 1i terms of the same p.d.f. for sample of size n.

These results are valid for every sample size and may allow reducing the heavy calculation to determine the p.d.f. of the concentration ratio R.

9. Concluding Remarks

In the present paper we obtain the distributions of the Gini concentration ratio R for samples of size 6,7,8,9 and 10

(14)

mean and expressed in terms of the concentration ratio R for the values assumed in each of such regions. The calculation of the limits of integration is particularly heavy and requires a very long processing time.

The obtained results show that the p.d.f. of the concentration ratio R is given by hyperbolic splines with de-gree 2 and with nodes in k n

(

−1

)

for k=1,2, ,n−1. Such distributions are unimodal with mean tending to 1 3, which is the value of the concentration ratio R for the population, and have decreasing standard deviation. Moreover, the distributions show a slight positive skewness and platykurtosis that tend to decrease as n increas-es.

Beyond the possibility to obtain similar results for samples of larger size, open problems are the derivation of the exact expression for the mean and the other features of the distribution of the concentration ratio R for ran-dom samples of size n drawn from a uniform population.

References

[1] Gini, C. (1914) L’ammontare e la composizionedellaricchezzadellenazioni. Bocca, Torino.

[2] Hoeffding, W. (1948) A Class of Statistics with Asymptotically Normal Distribution. Annals of Mathematical Statistics,

19, 293-325.

[3] Glasser, G.J. (1962) Variance Formulas for the Mean Difference and the Coefficient of Concentration. Journal of the American Statistical Association, 57, 648-654. http://dx.doi.org/10.1080/01621459.1962.10500553

[4] Cucconi, O. (1965) Sulla distribuzionecampionaria del rapporto R di concentrazione. Statistica, 25, 119.

[5] Dall’Aglio, G. (1965) Comportamentoasintoticodellestimedelladifferenza media e del rapporto di concentrazione. Me-tron, 24, 379-414.

[6] Deltas, G. (2003) The Small-Sample Bias of the Gini Coefficient: Results and Implications for Empirical Research.

Review of Economics and Statistics, 85, 226-234. http://dx.doi.org/10.1162/rest.2003.85.1.226

[7] Barrett, G.F. and Donald, S.G. (2009) Statistical Inference with Generalized Gini Indices of Inequality, Poverty, and Welfare. Journal of Business & Economic Statistics, 27, 1-17. http://dx.doi.org/10.1198/jbes.2009.0001

[8] Davidson, R. (2009) Reliable Inference for the Gini Index. Journal of Econometrics, 150, 30-40.

http://dx.doi.org/10.1016/j.jeconom.2008.11.004

[9] Fakoor, V., Ghalibaf, M.B. and Azarnoosh, H.A. (2011) Asymptotic Behaviors of the Lorenz Curve and Gini Index in Sampling from a Length-Biased Distribution. Statistics and Probability Letters, 81,1425-1435.

http://dx.doi.org/10.1016/j.spl.2011.04.013

[10] Girone, G. (1968) Sulcomportamentocampionariosimulato del rapporto di concentrazione. Annalidella Facoltà di Economia e CommerciodellUniversitàdegli Studi di Bari, 23, 5-11.

[11] Girone, G. (1971) La distribuzione del rapporto di concentrazione per campionicasuali di variabiliesponenziali. Studi di Probabilità, Statistica e Ricercaoperativa in onore di Giuseppe Pompilj, Oderisi, Gubbio.

(15)

Figure

Figure 1. Probability density function of the concentration ratio R for random
Figure 2. Probability density function of the concentration ratio R for random samples of size n = 7 from a uniform population
Figure 3. Probability density function of the concentration ratio R for random samples of size n = 8 from a uniform population
Figure 4. Probability density function of the concentration ratio R for random samples of size n = 9 from a uniform population
+2

References

Related documents

Minors who do not have a valid driver’s license which allows them to operate a motorized vehicle in the state in which they reside will not be permitted to operate a motorized

Marie Laure Suites (Self Catering) Self Catering 14 Mr. Richard Naya Mahe Belombre 2516591 [email protected] 61 Metcalfe Villas Self Catering 6 Ms Loulou Metcalfe

The corona radiata consists of one or more layers of follicular cells that surround the zona pellucida, the polar body, and the secondary oocyte.. The corona radiata is dispersed

The summary resource report prepared by North Atlantic is based on a 43-101 Compliant Resource Report prepared by M. Holter, Consulting Professional Engineer,

organisasjonslæring, arbeidsplasslæring, uformell og formell læring, læring gjennom praksis, sosial praksis og så videre vil derfor være nyttige når man skal foreta en studie

4.1 The Select Committee is asked to consider the proposed development of the Customer Service Function, the recommended service delivery option and the investment required8. It

Using text mining of first-opinion electronic medical records from seven veterinary practices around the UK, Kaplan-Meier and Cox proportional hazard modelling, we were able to

• Follow up with your employer each reporting period to ensure your hours are reported on a regular basis?. • Discuss your progress with