Exploratory Data Analysis Using R - Chapman & Hall/CRC Data Mining and Knowledge Discovery Series

Exploratory Data Analysis Using R

Exploratory Data Analysis Using R - Chapman & Hall/CRC Data Mining and Knowledge Discovery Series

hardcover
Pre-Order Published On: 1 July, 2026
Standard worldwide delivery by Mon, July 13 - Thu, July 16
Order within 0
Condition: NEW
$120.28
RRP $127.32
You save $7.04 (6%)
Price includes shipping
Coming Soon, Awaiting Publication
- +
FREE Returns within 30 days

Description

Exploratory Data Analysis Using R provides a classroom-tested introduction to exploratory data analysis (EDA), and this revised edition is accompanied by the R package ExploreTheData that implements many of the approaches described. As before, the primary focus of the book is on identifying "interesting" features - good, bad, and ugly - in a dataset, why it is important to find them, how to treat them, and more generally, the use of R to explore and explain datasets and the analysis results derived from them.

The book begins with a brief overview of exploratory data analysis using R, followed by a detailed discussion of creating various graphical data summaries in R. Then comes a thorough introduction to exploratory data analysis, and a detailed treatment of 13 data anomalies, why they are important, how to find them, and some options for addressing them. Subsequent chapters introduce the mechanics of working with external data, structured query language (SQL) for interacting with relational databases, linear regression analysis (the simplest and historically most important class of predictive models), and crafting data stories to explain our results to others. These chapters use R as an interactive data analysis platform, while Chapter 9 turns to writing programs in R, focusing on creating custom functions that can greatly simplify repetitive analysis tasks. Further chapters expand the scope to more advanced topics and techniques: special considerations for working with text data, a second look at exploratory data analysis, and more general predictive models.

The book is designed for both advanced undergraduate, entry-level graduate students, and working professionals with little to no prior exposure to data analysis, modeling, statistics, or programming. It keeps the treatment relatively non-mathematical, even though data analysis is an inherently mathematical subject. Exercises are included at the end of most chapters, and an instructor's solution manual is available.

See more

More Details

Type Book
ISBN13 9781032814810
ISBN10 1032814810
Number Of Pages 592
Item Weight 453 g
Publisher / Reseller Taylor & Francis Ltd
Format hardcover
Edition 2nd edition
See More +

Author's Bio

Ronald K. Pearson holds a PhD in Electrical Engineering and Computer Science from the Massachussetts Institute of Technology and has more than 40 years professional experience in exploratory data analysis. Dr. Pearson has held industrial, business, and academic positions in the fields of industrial process control, bioinformatics, drug safety data analysis, software development, and insurance. He has authored or co-authored books including Exploring Data in Engineering, the Sciences, and Medicine (Oxford University Press, 2011) and Mining Imperfect Data with Examples in R and Python (SIAM, 2020).

Show more