Title: A latent variable approach to study gene-environment interactions in the presence of multiple correlated exposures.
Authors: Sánchez, Brisa N; Kang, Shan; Mukherjee, Bhramar
Published In Biometrics, (2012 Jun)
Abstract: Many existing cohort studies initially designed to investigate disease risk as a function of environmental exposures have collected genomic data in recent years with the objective of testing for gene-environment interaction (G × E) effects. In environmental epidemiology, interest in G × E arises primarily after a significant effect of the environmental exposure has been documented. Cohort studies often collect rich exposure data; as a result, assessing G × E effects in the presence of multiple exposure markers further increases the burden of multiple testing, an issue already present in both genetic and environment health studies. Latent variable (LV) models have been used in environmental epidemiology to reduce dimensionality of the exposure data, gain power by reducing multiplicity issues via condensing exposure data, and avoid collinearity problems due to presence of multiple correlated exposures. We extend the LV framework to characterize gene-environment interaction in presence of multiple correlated exposures and genotype categories. Further, similar to what has been done in case-control G × E studies, we use the assumption of gene-environment (G-E) independence to boost the power of tests for interaction. The consequences of making this assumption, or the issue of how to explicitly model G-E association has not been previously investigated in LV models. We postulate a hierarchy of assumptions about the LV model regarding the different forms of G-E dependence and show that making such assumptions may influence inferential results on the G, E, and G × E parameters. We implement a class of shrinkage estimators to data adaptively trade-off between the most restrictive to most flexible form of G-E dependence assumption and note that such class of compromise estimators can serve as a benchmark of model adequacy in LV models. We demonstrate the methods with an example from the Early Life Exposures in Mexico City to Neuro-Toxicants Study of lead exposure, iron metabolism genes, and birth weight.
PubMed ID: 21955029
MeSH Terms: Analysis of Variance; Bias; Biometry/methods*; Birth Weight/drug effects; Case-Control Studies; Computer Simulation; Environmental Exposure*; Epidemiologic Factors; Female; Gene-Environment Interaction*; Humans; Infant, Newborn; Iron/metabolism; Lead Poisoning/genetics; Lead Poisoning/pathology; Models, Statistical*; Pregnancy; Prenatal Exposure Delayed Effects; Principal Component Analysis