• No results found

Figures and Tables for Chapter 4

In document Three essays in environmental health (Page 100-108)

CHAPTER 4 A GROUPED VARIABLE SELECTION ANALYSIS OF CHILD-

4.7 Figures and Tables for Chapter 4

12 14 16 18 20 22 24 Grouped Procedure Mo de l Si ze

Figure 4.1: Model size under heteroskedasticity across grouped selection methods: Het.1.

iCAP GLASSO 12 14 16 18 20 22 24 Grouped Procedure Mo de l Si ze

Figure 4.2: Model size under heteroskedasticity across grouped selection methods: Het.2.

The figures show box-plots of model sizes over 100 simulations and n = 500 of 2 regression experiments with forms of heteroskedasticity, Het.1 : σ2= (3 + 0.2X

4X5)2 and Het.2 : σ2= 1 + 0.1X42+ 0.2X52. The

iCAP GLASSO OLS 0 50 100 150 200 250 300 Grouped Procedure Mo de l Erro r

Figure 4.3: Model error across grouped and other selection methods, n = 100.

iCAP GLASSO OLS

0.0 0.5 1.0 1.5 2.0 2.5 3.0 Grouped Procedure Mo de l Erro r

Figure 4.4: Model error across grouped and other selection methods, n = 5000.

The figures show box-plots of model errors over 100 simulations for n = 100 and n = 5000 of a regression experiment with heteroskedasticity of the form Het.2 : σ2= 1 + 0.1X2

4+ 0.2X 2 5 .

Table 4.1: Summary statistics for categorical covariates in Ghana 2008 DHS.

Variable Levels n %

Birth order Among first five 1662 82.0

Among next five 364 18.0

Wealth Poorest 674 33.3

Poorer 453 22.4

Middle 329 16.2

Richer 335 16.5

Richest 235 11.6

Water source Bottled Water 95 4.7

Piped Water 630 31.1

River, Dam, Lake etc. 327 16.1

Well/Tubewell 974 48.1

Toilet Flush toilet 129 6.4

Ventillated improved toilet latrine 655 32.3

No Facility/Composting/Bucket/Pan 731 36.1 Other 511 25.2 Fuel Charcoal/Wood/Straw 1887 93.1 Electricity/Biogas 133 6.6 Kerosene 6 0.3 Gender Female 1007 49.7 Male 1019 50.3 Residence Rural 1371 67.7 Urban 655 32.3 Electricity No 1190 58.7 Yes 836 41.3 TV No 1395 68.8 Yes 631 31.1 Fridge No 1656 81.7 Yes 370 18.3 Bicycle No 1177 58.1 Yes 849 41.9 Motorcycle No 1860 91.8 Yes 166 8.2 Car No 1941 95.8 Yes 85 4.2

Table 4.2: Summary statistics for continuous covariates in Ghana 2008 DHS.

Variable n Min Mean Median Max

Child age (months) 2026 0.0 28.9 28.0 59.0

Mother Educ. (years) 2026 0.0 4.5 4.0 16.0

Mother age (years) 2026 16.0 30.5 30.0 49.0

Mother BMI 2026 12.2 23.2 22.3 56.9

Father Educ. (years) 2026 0.0 6.5 9.0 19.0

Table 4.3: Child health by important risk factors.

Wealth Quintiles Maternal BMI

Child size Large Small n Large Small n

Poorest 0.512 0.189 674 BMI < 20 0.486 0.160 405 Poorer 0.559 0.130 453 BMI ∈ [20,25) 0.563 0.146 1090 Middle 0.558 0.116 329 BMI ∈ [25,30) 0.541 0.138 382

Richer 0.544 0.128 335 BMI > 30 0.622 0.126 135 Richest 0.615 0.123 235

Maternal Education Cooking Fuel

Child size Large Small n Large Small n

No Education 0.529 0.164 797 Wood/Straw 0.536 0.156 1352 Educated 0.561 0.134 1215 Other 0.569 0.125 660

Largerefers to proportion of children who are larger than average and very large in their size at birth. Small refers to children who are smaller than average and very small at birth. Wood/Straw are examples of biofuels.

Table 4.4: Performance of selection procedures: 2008 DHS.

i CAP OLS LASSO GLASSO

Prediction error 22.13 27.80 26.78 25.47 (1.92) (2.99) (2.55) (2.34)

Standard error in parenthesis. The prediction error is for an independent test set chosen from the 2008 DHS sample of children younger than 5 years.

Table 4.5: Quantile regression model: categorical variables.

τ = 0.1 τ = 0.5 τ = 0.75 Female -0.384∗∗∗ -0.543∗∗∗ -0.484∗∗∗ (0.048) (0.061) (0.058) Birth order (early) 0.000 0.127∗∗ 0.178∗∗∗

(0.047) (0.060) (0.057) Poorer 0.000 -0.033 0.011 (0.045) (0.059) (0.057) Middle 0.000 0.089 0.037 (0.047) (0.056) (0.056) Richer 0.000 0.000 0.011 (0.041) (0.054) (0.051) Richest 0.152∗∗∗ 0.442∗∗∗ 0.689∗∗∗ (0.041) (0.052) (0.052) Fuel: LPG 0.000 0.014 0.006 (0.045) (0.056) (0.054) Fuel: Natural Gas -0.005 0.000 -0.054

(0.040) (0.068) (0.061) Fuel: Kerosene 0.159∗∗∗ 0.000 0.000 (0.031) (0.054) (0.055) Fuel: Charcoal 0.000 0.000 0.000 (0.039) (0.050) (0.047) Fuel: Wood -0.116∗∗∗ -0.235∗∗∗ -0.033 (0.036) (0.047) (0.043) Aux. Midwife 0.248∗∗∗ 0.108∗∗∗ 0.000 (0.043) (0.057) (0.054) DPT/Hep./Flu Vaccine 0.223∗∗∗ 0.141∗∗ 0.131∗∗∗ (0.045) (0.059) (0.059) TV 0.199∗∗∗ 0.000 0.000 (0.040) (0.054) (0.053) Fridge 0.141∗∗∗ 0.027 0.000 (0.041) (0.056) (0.054)

Dependent variable: child height (in cm). Standard deviation in parenthesis. Significant at∗10%,∗∗5%,∗∗∗ 1%. Aux. Midwife is

an indicator for the mother having had access to midwife prenatal care.

REFERENCES

Abel, E.L. et al. (1980), ‘Smoking during pregnancy: a review of effects on growth and development of offspring’, Human Biology 52(4), 593.

Akaike, H. (1974), ‘A new look at the statistical model identification.’, IEEE transactions on Automatic Control AC-19, 716–723.

Akresh, R., P. Verwimp and T. Bundervoet (2011), ‘Civil war, crop failure, and child stunting in Rwanda’, Economic Development and Cultural Change 59(4), 777–810.

Alderman, H. and J. Behrman (2006), ‘Reducing the incidence of low birth weight in low income countries has substantial economic benefits’, World Bank Research Observer 21(1), 25–48.

Alderman, H., J. Hoddinott and B. Kinsey (2006), ‘Long term consequences of early child- hood malnutrition’, Oxford Economic Papers 58(3), 450–74.

Almond, D. (2006), ‘Is the 1918 influenza pandemic over? Long-term effects of in utero influenza exposure in the post-1940 US population’, Journal of Political Economy 114(4), 672–712.

Almond, D., H.W. Hoynes and D.W. Schanzenbach (2008), ‘Inside the war on poverty: the impact of food stamps on birth outcomes’, National Bureau of Economic Research. Almond, D., L. Edlund, Li. Hongbin and J. Zhang (2007), Long-term effects of the 1959-1961

China Famine: Mainland China and Hong Kong, NBER Working Paper 13384, National Bureau of Economic Research, Cambridge MA.

Almond, D., L. Edlund and M. Palme (2009), ‘Chernobyl’s subclinical legacy: Prenatal exposure to radioactive fallout and school outcomes in sweden’, Quarterly Journal of Economics 124(4), 1729–1772.

Almond, D., Li. Hongbin and L. Meng (2010), ‘Son preference and early childhood invest- ments in China’, manuscript, Tsinghua University.

Barker, D.J.P. (1998), Mothers, babies, and health in later life, Elsevier Health Sciences. Behrman, J.R. (1988), ‘Intrahousehold allocation of nutrients in rural India: Are boys fa-

Bertrand, M., E. Duflo and S. Mullainathan (2004), ‘How much should we trust differences- in-differences estimates?’, Quarterly Journal of Economics 119(9), 249–275.

Bhalotra, S. and S. Rawlings (2010), ‘Intergenerational persistence in health in developing countries: The penalty of gender inequality?’, IZA Discussion Paper No. 5371.

Bharadwaj, P. and L. Nelson (2010), ‘Discrimination begins in the womb: evidence of sex- selective prenatal investments’, Mimeograph, University of San Diego .

Boadi, N.O., S.K. Twumasi and J.H. Ephraim (2009), ‘Impact of cyanide utilization in mining on the environment’, International Journal of Environmental Resources 3(1), 101– 108.

Bozzoli, C. and C. Quintana-Domeque (2010), ‘Weight of the crisis: Evidence from newborns in argentina’, DIW Berlin Discussion Paper No. 1054.

Bundervoet, T., P. Verwimp and R. Akresh (2009), ‘Health and civil war in rural Burundi’, Journal of Human Resources 44(2), 536–563.

Conley, T.G. and C. Taber (2009), ‘Inference with ‘Difference in Differences’ with a small number of policy changes’, NBER Working Paper .

Currie, J. and E. Moretti (2005), Biology as destiny? Short and long-run determinants of intergenerational transmission of birth weight, Technical report, National Bureau of Economic Research Cambridge, Mass., USA.

Currie, J. and R. Hyson (1999), ‘Is the impact of health shocks cushioned by socioeconomic status? The case of low birthweight’, National Bureau of Economic Research Cambridge, Mass., USA.

De Mel, S., D. McKenzie and C. Woodruff (2008), ‘Mental health recovery and economic recovery after the tsunami: High-frequency longitudinal evidence from Sri Lankan small business owners’, Social Science & Medicine 66(3), 582–595.

Desai, S. and S. Alva (1998), ‘Maternal education and child health: Is there a strong causal relationship?’, Demography 35(1).

Duflo, E., M. Greenstone and R. Hanna (2008), ‘Indoor air pollution, health and economic well-being’, SAPI EN. S. Surveys and Perspectives Integrating Environment and Society (1.1).

Efron, B., T. Hastie, I. Johnstone and R. Tibshirani (2004), ‘Least angle regression’, Annals of Statistics 32(2), 407–451.

Fan, J. and R. Li (2001), ‘Variable selection via nonconcave penalized likelihood and its oracle properties’, Journal of the American Statistical Association 96(456), 1348–1360. Fenske, N., T. Kneib and T. Hothorn (2009), Identifying risk factors for severe childhood

malnutrition by boosting additive quantile regression, Technical Report 52, Department of Statistics, University of Munich.

Frankenberg, E., J. Friedman, T. Gillespie, N. Ingwersen, R. Pynoos, I.U. Rifai, B. Sikoki, A. Steinberg, C. Sumantri, W. Suriastini et al. (2008), ‘Mental health in Sumatra after the tsunami’, American journal of public health .

Fullerton, D.G., N. Bruce and S.B. Gordon (2008), ‘Indoor air pollution from biomass fuel smoke is a major health concern in the developing world’, Transactions of the Royal Society of Tropical Medicine and Hygiene 102(9), 843–851.

Godfrey, Keith M. and Barker David J.P. (2000), ‘Fetal nutrition and adult disease’, Amer- ican Journal of Clinical Nutrition 71(5), 1344S–1352S.

Haddad, L., J. Hoddinott and H. Alderman (1997), Intrahousehold resource allocation in developing countries: models, methods, and policy., Johns Hopkins University Press. Huang, J., J.L. Horowitz and S.G. Ma (2008), ‘Asymptotic properties of bridge estimators

in sparse high-dimensional regression models’, Annals of Statistics 36, 587–613.

Kandala, N.B., L. Fahrmeir, S. Klasen and J. Priebe (2009), ‘Geo-additive models of child- hood undernutrition in three sub-Saharan African countries’, Population, Space and Place 15(5), 461–473.

K´ezdi, G´abor (2004), ‘Robust standard error estimation in fixed-effects panel models’, Hun- garian Statistical Review 9, 96–116.

Koenker, R. (2010), Advances in Social Science Research Using R, Lecture Notes in Statis- tics 196, Springer Science+Business Media, LLC, chapter Additive Models for Quantile Regression: An Analysis of Risk Factors for Malnutrition in India.

Koenker, R. and P. Ng (2003), ‘SparseM: A Sparse Linear Algebra Package for R’, J. Stat. Software 8(6), 1–9.

Koenker, R., P. Ng and S. Portnoy (1994), ‘Quantile Smoothing Splines’, Biometrika 81, 673– 680.

Lawrence, M., W.A. Coward, F. Lawrence, T.J. Cole and R.G. Whitehead (1987), ‘Fat gain during pregnancy in rural African women: the effect of season and dietary status’, The American journal of clinical nutrition 45(6), 1442–1450.

Lin, Y. and H. Zhang (2006), ‘Component selection and smoothing in multivariate nonpara- metric regression’, Annals of Statistics 34, 2272–2297.

Maccini, S. and D. Yang (2008), ‘Under the weather: Health, schooling and economic con- sequences of early-life rainfall’, American Economic Review .

Martorell, R. and J. Habicht (1986), Growth in Early Childhood in Developing Countries, Plenum Press.

Meier, L., van der Geer S. and P. B¨ulmann (2006), ‘The group lasso for logistic regression’, Journal of the Royal Statistical Society: Series B (Statistical Methodology) 70(1), 53–7.

National Geographic News, December 2004 (2005). URL: http://news.nationalgeographic.com/

of Ghana, Government (2008), Implementation of the Ghana Poverty Reduction Strategy, Annual progress report, National Development Planning Commission.

Pathak, P.K., A. Singh and S.V. Subramanian (2010), ‘Economic inequalities in maternal health care: Prenatal care and skilled birth attendance in India, 1992–2006’, PloS One 5(10).

Ren, M. and X. Zhang (2008), ‘Gender difference on the long term impact of the famine’, International Food Policy Research Institute Discussion Paper 760. .

Sheth, A., S. Sanyal, A. Jaiswal and P. Gandhi (2006), ‘Effects of the December 2004 Indian Ocean tsunami on the Indian mainland’, Earthquake spectra 22(S435).

St Clair, D., M. Xu, P. Wang, Y. Yu, Y. Fang, F. Zhang, X. Zheng, N. Gu, G. Feng, P. Sham et al. (2005), ‘Rates of adult schizophrenia following prenatal exposure to the Chinese famine of 1959-1961’, JAMA: the journal of the American Medical Association 294(5).

Tibshirani, R. (1996), ‘Regression shrinkage and selection via the lasso’, Journal of the Royal Statistical Society. Series B (Methodological) 58(1), 267–288.

Tschakert, P. and K. Singha (2007), ‘Contaminated identities: Mercury and marginalization in Ghana’s artisanal mining sector’, Geoforum 38(6), 1304–1321.

Van den Berg, G., M. Lindeboom and F. Portrait (2006), ‘Economic conditions early in life and individual mortality’, American Economic Review 96(1), 290–302.

Vieira, V.M., J.E. Hart, T.F. Webster, J. Weinberg, R. Puett, F. Laden, K.H. Costenbader and E.W. Karlson (2010), ‘Association between Residences in US Northern Latitudes and Rheumatoid Arthritis: A Spatial Analysis of the Nurses’ Health Study’, Environmental health perspectives 118(7), 957.

Yuan, M. and Y. Lin (2006), ‘Model selection and estimation in regression with grouped variables’, Journal of the Royal Statistical Society: Series B (Statistical Methodology) 68(1), 49–67.

Zhao, P., G. Rocha and Y. Bin (2009), ‘The composite absolute penalties family for grouped and hierarchical variable selection’, The Annals of Statistics 37(6A), 3468–3497.

Zou, H., T. Hastie and R. Tibshirani (2007), ‘On the “degrees of freedom” of the lasso’, Annals of Statistics 35(5), 2173–2192.

In document Three essays in environmental health (Page 100-108)

Related documents