Regression Modeling Strategies

Author: Frank Harrell
Publisher: Springer
ISBN: 3319194259
Format: PDF, ePub
Download Now
This highly anticipated second edition features new chapters and sections, 225 new references, and comprehensive R software. In keeping with the previous edition, this book is about the art and science of data analysis and predictive modeling, which entails choosing and using multiple tools. Instead of presenting isolated techniques, this text emphasizes problem solving strategies that address the many issues arising when developing multivariable models using real data and not standard textbook examples. It includes imputation methods for dealing with missing data effectively, methods for fitting nonlinear relationships and for making the estimation of transformations a formal part of the modeling process, methods for dealing with "too many variables to analyze and not enough observations," and powerful model validation techniques based on the bootstrap. The reader will gain a keen understanding of predictive accuracy and the harm of categorizing continuous predictors or outcomes. This text realistically deals with model uncertainty and its effects on inference, to achieve "safe data mining." It also presents many graphical methods for communicating complex regression models to non-statisticians. Regression Modeling Strategies presents full-scale case studies of non-trivial datasets instead of over-simplified illustrations of each method. These case studies use freely available R functions that make the multiple imputation, model building, validation and interpretation tasks described in the book relatively easy to do. Most of the methods in this text apply to all regression models, but special emphasis is given to multiple regression using generalized least squares for longitudinal data, the binary logistic model, models for ordinal responses, parametric survival regression models and the Cox semi parametric survival model. A new emphasis is given to the robust analysis of continuous dependent variables using ordinal regression. As in the first edition, this text is intended for Masters' or Ph.D. level graduate students who have had a general introductory probability and statistics course and who are well versed in ordinary multiple regression and intermediate algebra. The book will also serve as a reference for data analysts and statistical methodologists, as it contains an up-to-date survey and bibliography of modern statistical modeling techniques. Examples used in the text mostly come from biomedical research, but the methods are applicable anywhere predictive models ("analytics") are useful, including economics, epidemiology, sociology, psychology, engineering and marketing.

Regression Modeling Strategies

Author: Frank Harrell
Publisher: Springer Science & Business Media
ISBN: 147573462X
Format: PDF, ePub
Download Now
Many texts are excellent sources of knowledge about individual statistical tools, but the art of data analysis is about choosing and using multiple tools. Instead of presenting isolated techniques, this text emphasizes problem solving strategies that address the many issues arising when developing multivariable models using real data and not standard textbook examples. It includes imputation methods for dealing with missing data effectively, methods for dealing with nonlinear relationships and for making the estimation of transformations a formal part of the modeling process, methods for dealing with "too many variables to analyze and not enough observations," and powerful model validation techniques based on the bootstrap. This text realistically deals with model uncertainty and its effects on inference to achieve "safe data mining".

Regression Modeling Strategies

Author: Frank E. Harrell
Publisher: Springer Science & Business Media
ISBN: 9780387952321
Format: PDF, ePub, Mobi
Download Now
The book will serve as a reference for data analysts and statistical methodologists.

Regression Models as a Tool in Medical Research

Author: Werner Vach
Publisher: CRC Press
ISBN: 1466517492
Format: PDF
Download Now
While regression models have become standard tools in medical research, understanding how to properly apply the models and interpret the results is often challenging for beginners. Regression Models as a Tool in Medical Research presents the fundamental concepts and important aspects of regression models most commonly used in medical research, including the classical regression model for continuous outcomes, the logistic regression model for binary outcomes, and the Cox proportional hazards model for survival data. The text emphasizes adequate use, correct interpretation of results, appropriate presentation of results, and avoidance of potential pitfalls. After reviewing popular models and basic methods, the book focuses on advanced topics and techniques. It considers the comparison of regression coefficients, the selection of covariates, the modeling of nonlinear and nonadditive effects, and the analysis of clustered and longitudinal data, highlighting the impact of selection mechanisms, measurement error, and incomplete covariate data. The text then covers the use of regression models to construct risk scores and predictors. It also gives an overview of more specific regression models and their applications as well as alternatives to regression modeling. The mathematical details underlying the estimation and inference techniques are provided in the appendices.

Applied Survival Analysis Using R

Author: Dirk F. Moore
Publisher: Springer
ISBN: 3319312456
Format: PDF
Download Now
Applied Survival Analysis Using R covers the main principles of survival analysis, gives examples of how it is applied, and teaches how to put those principles to use to analyze data using R as a vehicle. Survival data, where the primary outcome is time to a specific event, arise in many areas of biomedical research, including clinical trials, epidemiological studies, and studies of animals. Many survival methods are extensions of techniques used in linear regression and categorical data, while other aspects of this field are unique to survival data. This text employs numerous actual examples to illustrate survival curve estimation, comparison of survivals of different groups, proper accounting for censoring and truncation, model variable selection, and residual analysis. Because explaining survival analysis requires more advanced mathematics than many other statistical topics, this book is organized with basic concepts and most frequently used procedures covered in earlier chapters, with more advanced topics near the end and in the appendices. A background in basic linear regression and categorical data analysis, as well as a basic knowledge of calculus and the R system, will help the reader to fully appreciate the information presented. Examples are simple and straightforward while still illustrating key points, shedding light on the application of survival analysis in a way that is useful for graduate students, researchers, and practitioners in biostatistics.

Statistical Modeling for Biomedical Researchers

Author: William D. Dupont
Publisher: Cambridge University Press
ISBN: 1139643819
Format: PDF, Docs
Download Now
The second edition of this standard text guides biomedical researchers in the selection and use of advanced statistical methods and the presentation of results to clinical colleagues. It assumes no knowledge of mathematics beyond high school level and is accessible to anyone with an introductory background in statistics. The Stata statistical software package is again used to perform the analyses, this time employing the much improved version 10 with its intuitive point and click as well as character-based commands. Topics covered include linear, logistic and Poisson regression, survival analysis, fixed-effects analysis of variance, and repeated-measure analysis of variance. Restricted cubic splines are used to model non-linear relationships. Each method is introduced in its simplest form and then extended to cover more complex situations. An appendix will help the reader select the most appropriate statistical methods for their data. The text makes extensive use of real data sets available at http://biostat.mc.vanderbilt.edu/dupontwd/wddtext/.

Multivariate Survival Analysis and Competing Risks

Author: Martin J. Crowder
Publisher: CRC Press
ISBN: 1439875227
Format: PDF
Download Now
Multivariate Survival Analysis and Competing Risks introduces univariate survival analysis and extends it to the multivariate case. It covers competing risks and counting processes and provides many real-world examples, exercises, and R code. The text discusses survival data, survival distributions, frailty models, parametric methods, multivariate data and distributions, copulas, continuous failure, parametric likelihood inference, and non- and semi-parametric methods. There are many books covering survival analysis, but very few that cover the multivariate case in any depth. Written for a graduate-level audience in statistics/biostatistics, this book includes practical exercises and R code for the examples. The author is renowned for his clear writing style, and this book continues that trend. It is an excellent reference for graduate students and researchers looking for grounding in this burgeoning field of research.

Modeling Discrete Time to Event Data

Author: Gerhard Tutz
Publisher: Springer
ISBN: 3319281585
Format: PDF, Mobi
Download Now
This book focuses on statistical methods for the analysis of discrete failure times. Failure time analysis is one of the most important fields in statistical research, with applications affecting a wide range of disciplines, in particular, demography, econometrics, epidemiology and clinical research. Although there are a large variety of statistical methods for failure time analysis, many techniques are designed for failure times that are measured on a continuous scale. In empirical studies, however, failure times are often discrete, either because they have been measured in intervals (e.g., quarterly or yearly) or because they have been rounded or grouped. The book covers well-established methods like life-table analysis and discrete hazard regression models, but also introduces state-of-the art techniques for model evaluation, nonparametric estimation and variable selection. Throughout, the methods are illustrated by real life applications, and relationships to survival analysis in continuous time are explained. Each section includes a set of exercises on the respective topics. Various functions and tools for the analysis of discrete survival data are collected in the R package discSurv that accompanies the book.

Applied Predictive Modeling

Author: Max Kuhn
Publisher: Springer Science & Business Media
ISBN: 1461468493
Format: PDF, ePub, Docs
Download Now
Applied Predictive Modeling covers the overall predictive modeling process, beginning with the crucial steps of data preprocessing, data splitting and foundations of model tuning. The text then provides intuitive explanations of numerous common and modern regression and classification techniques, always with an emphasis on illustrating and solving real data problems. The text illustrates all parts of the modeling process through many hands-on, real-life examples, and every chapter contains extensive R code for each step of the process. This multi-purpose text can be used as an introduction to predictive models and the overall modeling process, a practitioner’s reference handbook, or as a text for advanced undergraduate or graduate level predictive modeling courses. To that end, each chapter contains problem sets to help solidify the covered concepts and uses data available in the book’s R package. This text is intended for a broad audience as both an introduction to predictive models as well as a guide to applying them. Non-mathematical readers will appreciate the intuitive explanations of the techniques while an emphasis on problem-solving with real data across a wide variety of applications will aid practitioners who wish to extend their expertise. Readers should have knowledge of basic statistical ideas, such as correlation and linear regression analysis. While the text is biased against complex equations, a mathematical background is needed for advanced topics.

Multivariable Model Building

Author: Patrick Royston
Publisher: John Wiley & Sons
ISBN: 9780470770788
Format: PDF
Download Now
Multivariable regression models are of fundamental importance in all areas of science in which empirical data must be analyzed. This book proposes a systematic approach to building such models based on standard principles of statistical modeling. The main emphasis is on the fractional polynomial method for modeling the influence of continuous variables in a multivariable context, a topic for which there is no standard approach. Existing options range from very simple step functions to highly complex adaptive methods such as multivariate splines with many knots and penalisation. This new approach, developed in part by the authors over the last decade, is a compromise which promotes interpretable, comprehensible and transportable models.