Advances in statistical methodology and computing have played an important role in allowing researchers to more accurately assess the health effects of ambient air pollution. The methods and software developed in this area are applicable to a wide array of problems in environmental epidemiology. This book provides an overview of the methods used for investigating the health effects of air pollution and gives examples and case studies in R which demonstrate the application of those methods to real data. The book will be useful to statisticians, epidemiologists, and graduate students working in the area of air pollution and health and others analyzing similar data.
The authors describe the different existing approaches to statistical modeling and cover basic aspects of analyzing and understanding air pollution and health data. The case studies in each chapter demonstrate how to use R to apply and interpret different statistical models and to explore the effects of potential confounding factors. A working knowledge of R and regression modeling is assumed. In-depth knowledge of R programming is not required to understand and run the examples.
Researchers in this area will find the book useful as a ``live'' reference. Software for all of the analyses in the book is downloadable from the web and is available under a Free Software license. The reader is free to run the examples in the book and modify the code to suit their needs. In addition to providing the software for developing the statistical models, the authors provide the entire database from the National Morbidity Mortality and Air Pollution Study (NMMAPS) in a convenient R package. With the database, readers can run the examples and experiment with their own methods and ideas.
Roger D. Peng is an Assistant Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health. He is a prominent researcher in the areas of air pollution and health risk assessmentand statistical methods for spatial and temporal data. Dr. Peng is the author of numerous R packages and is a frequent contributor to the R mailing lists.
Francesca Dominici is a Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health. She has published extensively on hierarchical and semiparametric modeling and has been the leader of major national studies of the health effects of air pollution. She has also participated in numerous panels conducted by the National Academy of Science assessing the health effects of environmental exposures and has consulted for the US Environmental Protection Agency's Clean Air Act Advisory Board.
Zusammenfassung
As an area of statistical application, environmental epidemiology and more speci cally, the estimation of health risk associated with the exposure to - vironmental agents, has led to the development of several statistical methods and software that can then be applied to other scienti c areas. The stat- tical analyses aimed at addressing questions in environmental epidemiology have the following characteristics. Often the signal-to-noise ratio in the data is low and the targets of inference are inherently small risks. These constraints typically lead to the development and use of more sophisticated (and pot- tially less transparent) statistical models and the integration of large hi- dimensional databases. New technologies and the widespread availability of powerful computing are also adding to the complexities of scienti c inves- gation by allowing researchers to t large numbers of models and search over many sets of variables. As the number of variables measured increases, so do the degrees of freedom for in uencing the association between a risk factor and an outcome of interest. We have written this book, in part, to describe our experiences developing and applying statistical methods for the estimation for air pollution health e ects. Our experience has convinced us that the application of modern s- tistical methodology in a reproducible manner can bring to bear subst- tial bene ts to policy-makers and scientists in this area. We believe that the methods described in this book are applicable to other areas of environmental epidemiology, particularly those areas involving spatial{temporal exposures.
Inhalt
Studies of Air Pollution and Health.- to R and Air Pollution and Health Data.- Reproducible Research Tools.- Statistical Issues in Estimating the Health Effects of SpatialTemporal Environmental Exposures..- Exploratory Data Analyses.- Statistical Models.- Pooling Risks Across Locations and Quantifying Spatial Heterogeneity.- A Reproducible Seasonal Analysis of Particulate Matter and Mortality in the United States.