• No results found

Proof that prior increases posterior probability of including mini-

3.5 Analysis of SEER-Medicare data

3.7.2 Proof that prior increases posterior probability of including mini-

imal confounder set

Here we outline why our prior will be useful in controlling for confounding. Let all dis- tributions with a subscript 1 be distributions associated with using our conditional prior specification, and all distributions with a subscript 2 be associated with a flat prior on the

model space similar to BMA. Distributions without a subscript are those that are unaf- fected by the choice of prior. Then without loss of generality if we assume that the first m covariates are the ones necessary for controlling for confounding then the following holds: P1(αy ∈ αy∗|D) = P1(α y 1 = 1, α y 2 = 1, ..., αym = 1|D) = P1(αy1 = 1|D) P1(α2y = 1|α y 1 = 1, D)...P1(αym = 1|α y 1 = 1, ..., α y m−1 = 1, D) = P1(αy1 = 1|D) ∗ C1 = P (D|α y 1 = 1)P1(α y 1 = 1) P (D) ∗ C1 =X αx 1 P (D|αy1 = 1)P1(αy1 = 1, αx1) P (D) ∗ C1 = P (D|α y 1 = 1)3ω+12ω P (D) ∗ C1 ≥ P (D|α y 1 = 1)12 P (D) ∗ C1 =X αx 1 P (D|αy1 = 1)P2(αy1 = 1, αx1) P (D) ∗ C1 = P2(αy1 = 1|D) ∗ C1 = P2(αy1 = 1|D)P1(αy2 = 1|α y 1 = 1, D)...P1(αym = 1|α y 1 = 1, ..., α y m−1 = 1, D) = P1(αy2 = 1|α y 1 = 1, D) ∗ C2 = P (D|α y 2 = 1, α y 1 = 1)P1(α y 2 = 1|α y 1 = 1) P (D|αy1 = 1) ∗ C2 = P (D|α y 2 = 1, α y 1 = 1)P1(αy2 = 1) P (D|αy1 = 1) ∗ C2 =X αx 2 P (D|αy2 = 1, αy1 = 1)P1(αy2 = 1, αx2) P (D|α1y = 1) ∗ C2 = P (D|α y 2 = 1, α y 1 = 1) 2ω 3ω+1 P (D|αy1 = 1) ∗ C2 ≥ P (D|α y 2 = 1, α y 1 = 1) 1 2 P (D|αy1 = 1) ∗ C2 =X αx 2 P (D|αy2 = 1, αy1 = 1)P2(α y 2 = 1, αx2) P (D|α1y = 1) ∗ C2

= P2(αy2 = 1|α y 1 = 1, D) ∗ C2 = P2(α y 1 = 1|D)P2(α y 2 = 1|α y 1 = 1, D) × P1(αy3 = 1|α y 1 = 1, α y 2 = 1, D)...P1(αym = 1|α y 1 = 1, ..., α y m−1 = 1, D)

And the proof concludes by performing analogous algebraic operations for each of the remaining conditional posteriors from j=3,...,m. We have now shown how our condi- tional prior specification assigns more posterior mass to models that contain the true con- founder set.

References

BREYSSE, P. N., DELFINO, R. J., DOMINICI, F., ELDER, A. C., FRAMPTON, M. W.,

FROINES, J. R., GEYH, A. S., GODLESKI, J. J., GOLD, D. R., HOPKE, P. K.ET AL. (2013).

Us epa particulate matter research centers: summary of research results for 2005–2011. Air Quality, Atmosphere & Health 6 333–355.

BROCHU, P. J., KIOUMOURTZOGLOU, M.-A., COULL, B. A., HOPKE, P. K. and SUH, H. H. (2011). Development of a new method to estimate the regional and local contri- butions to black carbon. Atmospheric Environment 45 7681–7687.

BUHMANN, M. (1995). Multiquadric prewavelets on nonequally spaced knots in one dimension. Mathematics of computation 64 1611–1625.

CARROLL, R. J., RUPPERT, D., STEFANSKI, L. A. and CRAINICEANU, C. M. (2006). Mea-

surement error in nonlinear models: a modern perspective. CRC press.

CHAICHANA, K. L., GARZON-MUVDI, T., PARKER, S., WEINGART, J. D., OLIVI, A., BEN- NETT, R., BREM, H. and QUINONES-HINOJOSA, A. (2011). Supratentorial glioblastoma

multiforme: the role of surgical resection versus biopsy among older patients. Annals of surgical oncology 18 239–245.

CHOW, J. C., ENGELBRECHT, J. P., WATSON, J. G., WILSON, W. E., FRANK, N. H. and ZHU, T. (2002). Designing monitoring networks to represent outdoor human exposure. Chemosphere 49 961–978.

CHUI, C. K., WARD, J., JETTER, K. and STOECKLER, J. (1996). Wavelets for analyzing

scattered data: An unbounded operator approach. Applied and Computational Harmonic Analysis 3 254–267.

DADVAND, P., PARKER, J., BELL, M. L., BONZINI, M., BRAUER, M., DARROW, L. A.,

GEHRING, U., GLINIANAIA, S. V., GOUVEIA, N., HA, E.-H. ET AL. (2013). Maternal

exposure to particulate air pollution and term birth weight: a multi-country evaluation of effect and heterogeneity .

DIGGLE, P. J., MENEZES, R. and SU, T.-L. (2010). Geostatistical inference under pref- erential sampling. Journal of the Royal Statistical Society: Series C (Applied Statistics) 59 191–232.

DOCKERY, D. W., POPE, C. A., XU, X., SPENGLER, J. D., WARE, J. H., FAY, M. E., FER- RISJR, B. G. and SPEIZER, F. E. (1993). An association between air pollution and mor-

tality in six us cities. New England journal of medicine 329 1753–1759.

DOMINICI, F., PENG, R. D., BELL, M. L., PHAM, L., MCDERMOTT, A., ZEGER, S. L. and SAMET, J. M. (2006). Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. Jama 295 1127–1134.

DOMINICI, F., SHEPPARD, L. and CLYDE, M. (2003). Health effects of air pollution: a

statistical review. International Statistical Review 71 243–276.

GELFAND, A. E., SAHU, S. K. and HOLLAND, D. M. (2012). On the effect of preferential

sampling in spatial prediction. Environmetrics 23 565–578.

GELMAN, A., CARLIN, J. B., STERN, H. S. and RUBIN, D. B. (2014). Bayesian data analysis, vol. 2. Taylor & Francis.

GLINIANAIA, S. V., RANKIN, J., BELL, R., PLESS-MULLOLI, T. and HOWEL, D. (2004).

Particulate air pollution and fetal health: a systematic review of the epidemiologic evi- dence. Epidemiology 15 36–45.

GRYPARIS, A., PACIOREK, C. J., ZEKA, A., SCHWARTZ, J. and COULL, B. A. (2009). Mea-

surement error caused by spatial misalignment in environmental epidemiology. Bio- statistics 10 258–274.

GUPTA, C., LAKSHMINARAYAN, C., WANG, S. and MEHTA, A. (2010). Non-dyadic haar

wavelets for streaming and sensor data. In Data Engineering (ICDE), 2010 IEEE 26th International Conference on. IEEE.

HOETING, J. A., MADIGAN, D., RAFTERY, A. E. and VOLINSKY, C. T. (1999). Bayesian model averaging: a tutorial. Statistical science 382–401.

HULATA, E., SEGEV, R. and BEN-JACOB, E. (2002). A method for spike sorting and detec-

tion based on wavelet packets and shannon’s mutual information. Journal of neuroscience methods 117 1–12.

KANAROGLOU, P. S., JERRETT, M., MORRISON, J., BECKERMAN, B., ARAIN, M. A.,

GILBERT, N. L. and BROOK, J. R. (2005). Establishing an air pollution monitoring net- work for intra-urban population exposure assessment: A location-allocation approach. Atmospheric Environment 39 2399–2409.

KIM, S.-Y., SHEPPARD, L. and KIM, H. (2009). Health effects of long-term air pollution:

influence of exposure prediction methods. Epidemiology 20 442–450.

KLOOG, I., CHUDNOVSKY, A. A., JUST, A. C., NORDIO, F., KOUTRAKIS, P., COULL, B. A.,

LYAPUSTIN, A., WANG, Y. and SCHWARTZ, J. (2014). A new hybrid spatio-temporal

model for estimating daily multi-year pm 2.5 concentrations across northeastern usa using high resolution aerosol optical depth data. Atmospheric Environment 95 581–590.

KLOOG, I., MELLY, S. J., RIDGWAY, W. L., COULL, B. A., SCHWARTZ, J.ET AL. (2012). Us- ing new satellite based exposure methods to study the association between pregnancy pm2. 5 exposure, premature birth and birth weight in massachusetts. Environmental Health 11 1–8.

LEE, A., SZPIRO, A., KIM, S. and SHEPPARD, L. (2015). Impact of preferential sampling

on exposure prediction and health effect inference in the context of air pollution epi- demiology. Environmetrics .

LEFEBVRE, G., DELANEY, J. A. and MCCLELLAND, R. L. (2014). Extending the bayesian

adjustment for confounding algorithm to binary treatment covariates to estimate the ef- fect of smoking on carotid intima-media thickness: the multi-ethnic study of atheroscle- rosis. Statistics in medicine 33 2797–2813.

LITTLE, R. J. and RUBIN, D. B. (2014). Statistical analysis with missing data. John Wiley & Sons.

MADIGAN, D., RAFTERY, A. E., YORK, J. C., BRADSHAW, J. M. and ALMOND, R. G.

(1994). Strategies for graphical model selection. In Selecting Models from Data. Springer, 91–100.

MADIGAN, D., YORK, J. and ALLARD, D. (1995). Bayesian graphical models for discrete data. International Statistical Review/Revue Internationale de Statistique 215–232.

MADSEN, L., RUPPERT, D. and ALTMAN, N. (2008). Regression with spatially misaligned data. Environmetrics 19 453–467.

MATTE, T. D., ROSS, Z., KHEIRBEK, I., EISL, H., JOHNSON, S., GORCZYNSKI, J. E., KASS,

D., MARKOWITZ, S., PEZESHKI, G. and CLOUGHERTY, J. E. (2013). Monitoring intraur-

ban spatial patterns of multiple combustion air pollutants in new york city: Design and implementation. Journal of Exposure Science and Environmental Epidemiology 23 223–231.

MAYNARD, D., COULL, B. A., GRYPARIS, A. and SCHWARTZ, J. (2007). Mortality risk as- sociated with short-term exposure to traffic particles and sulfates. Environmental Health Perspectives 751–755.

MCCANDLESS, L. C., RICHARDSON, S. and BEST, N. (2012). Adjustment for missing con-

founders using external validation data and propensity scores. Journal of the American Statistical Association 107 40–51.

MENG, X.-L. (1994). Multiple-imputation inferences with uncongenial sources of input. Statistical Science 538–558.

MITRA, R. and DUNSON, D. (2010). Two-level stochastic search variable selection in glms

with missing predictors. The international journal of biostatistics 6.

MORENO, T., QUEROL, X., ALASTUEY, A., VIANA, M. and GIBBONS, W. (2009). Profiling transient daytime peaks in urban air pollutants: city centre traffic hotspot versus urban background concentrations. Journal of Environmental Monitoring 11 1535–1542.

NENADIC, Z. and BURDICK, J. W. (2005). Spike detection using the continuous wavelet

transform. Biomedical Engineering, IEEE Transactions on 52 74–87.

OLIVER, M. A. and WEBSTER, R. (1990). Kriging: a method of interpolation for geo-

graphical information systems. International Journal of Geographical Information System 4 313–332.

PETROSIAN, A. A. and MEYER, F. G. (2013). Wavelets in signal and image analysis: from theory to practice, vol. 19. Springer Science & Business Media.

POLLOCK, S. and CASCIO, I. L. (2007). Non-dyadic wavelet analysis. In Optimisation,

Econometric and Financial Analysis. Springer, 167–203.

POPE III, C. A. (2007). Mortality effects of longer term exposures to fine particulate air

pollution: review of recent epidemiological evidence. Inhalation Toxicology 19 33–38.

RAFTERY, A. E. (1995). Bayesian model selection in social research. Sociological methodol- ogy 25 111–164.

RAFTERY, A. E., MADIGAN, D. and HOETING, J. A. (1997). Bayesian model averaging

for linear regression models. Journal of the American Statistical Association 92 179–191.

RUBIN, D. B. (1996). Multiple imputation after 18+ years. Journal of the American statistical

Association 91 473–489.

SAMET, J. M., DOMINICI, F., CURRIERO, F. C., COURSAC, I. and ZEGER, S. L. (2000). Fine particulate air pollution and mortality in 20 us cities, 1987–1994. New England journal of medicine 343 1742–1749.

SCHENKER, N. and TAYLOR, J. M. (1996). Partially parametric techniques for multiple

imputation. Computational Statistics & Data Analysis 22 425–446.

STURMER¨ , T., SCHNEEWEISS, S., AVORN, J. and GLYNN, R. J. (2005). Adjusting effect

estimates for unmeasured confounding with validation data using propensity score calibration. American journal of epidemiology 162 279–289.

STURMER¨ , T., SCHNEEWEISS, S., ROTHMAN, K. J., AVORN, J. and GLYNN, R. J. (2007). Performance of propensity score calibrationa simulation study. American journal of epi- demiology 165 1110–1118.

SWELDENS, W. (1998). The lifting scheme: A construction of second generation wavelets. SIAM Journal on Mathematical Analysis 29 511–546.

SZPIRO, A. A. and PACIOREK, C. J. (2013). Measurement error in two-stage analyses, with application to air pollution epidemiology. Environmetrics 24 501–517.

SZPIRO, A. A., PACIOREK, C. J. and SHEPPARD, L. (2011a). Does more accurate exposure prediction necessarily improve health effect estimates? Epidemiology (Cambridge, Mass.)

22680.

SZPIRO, A. A., SHEPPARD, L. and LUMLEY, T. (2011b). Efficient measurement error cor- rection with spatially misaligned data. Biostatistics kxq083.

TIBSHIRANI, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) 267–288.

WAND, M., ORMEROD, J. T.ET AL. (2011). Penalized wavelets: Embedding wavelets into

semiparametric regression. Electronic Journal of Statistics 5 1654–1717.

WANG, C., DOMINICI, F., PARMIGIANI, G. and ZIGLER, C. M. (2015). Accounting for

uncertainty in confounder and effect modifier selection when estimating average causal effects in generalized linear models. Biometrics .

WANG, C., PARMIGIANI, G. and DOMINICI, F. (2012). Bayesian effect estimation account-

XIONG, R., XU, J. and WU, F. (2006). A lifting-based wavelet transform supporting non-

dyadic spatial scalability. In Image Processing, 2006 IEEE International Conference o=n. IEEE.

Related documents