
eTEXT
Statistical Inference via Data Science
A ModernDive into R and the Tidyverse
By: Chester Ismay, Albert Y. Kim, Arturo Valdivia
eText | 2 May 2025 | Edition Number 2
At a Glance
New Edition
eText
$135.35
Available: 2nd May 2025
Preorder. Online access available after release.
Read online on
Not downloadable to your eReader or an app
Why choose an eTextbook?
Instant Access *
Purchase and read your book immediately
Read Aloud
Listen and follow along as Bookshelf reads to you
Study Tools
Built-in study tools like highlights and more
* eTextbooks are not downloadable to your eReader or an app and can be accessed via web browsers only. You must be connected to the internet and have no technical issues with your device or browser that could prevent the eTextbook from operating.
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse, Second Edition offers a comprehensive guide to learning statistical inference with data science tools widely used in industry, academia, and government. The first part of this book introduces the tidyverse suite of R packages, including ggplot2 for data visualization and dplyr for data wrangling. The second part introduces data modeling via simple and multiple linear regression. The third part presents statistical inference using simulation-based methods within a general framework implemented in R via the infer package, a suitable complement to the tidyverse. By working with these methods, readers can implement effective exploratory data analyses, conduct statistical modeling with data, and carry out statistical inference via confidence intervals and hypothesis testing. All of these tasks are performed by strongly emphasizing data visualization.
Key Features in the Second Edition:
- Minimal Prerequisites: No prior calculus or coding experience is needed, making the content accessible to a wide audience.
- Real-World Data: Learn with real-world datasets, including all domestic flights leaving New York City in 2023, the Gapminder project, FiveThirtyEight.com data, and new datasets on health, global development, music, coffee quality, and geyser eruptions.
- Simulation-Based Inference: Statistical inference through simulation-based methods.
- Expanded Theoretical Discussions: Includes deeper coverage of theory-based approaches, their connection with simulation-based approaches, and a presentation of intuitive and formal aspects of these methods.
- Enhanced Use of the infer Package: Leverages the infer package for "tidy" and transparent statistical inference, enabling readers to construct confidence intervals and conduct hypothesis tests through multiple linear regression and beyond.
- Dynamic Online Resources: All code and output are embedded in the text, with additional interactive exercises, discussions, and solutions available online.
- Broadened Applications: Suitable for undergraduate and graduate courses, including statistics, data science, and courses emphasizing reproducible research.
The first edition of the book has been used in so many different ways--for courses in statistical inference, statistical programming, business analytics, and data science for social policy, and by professionals in many other means. Ideal for those new to statistics or looking to deepen their knowledge, this edition provides a clear entry point into data science and modern statistical methods.
Read online on
ISBN: 9781040323410
ISBN-10: 1040323413
Series: Chapman & Hall/CRC The R Series
Available: 2nd May 2025
Format: ePUB
Language: English
Publisher: CRC Press
Edition Number: 2
You Can Find This eBook In
Other Editions and Formats
This product is categorised by
- Non-FictionEconomicsEconometricsEconomic Statistics
- Non-FictionComputing & I.T.DatabasesData Capture & Analysis
- Non-FictionComputing & I.T.Computer ScienceImage Processing
- Non-FictionComputing & I.T.Computer Programming & Software DevelopmentProgramming & Scripting Languages
- Non-FictionMathematicsProbability & Statistics