Learning Data Science : Data Wrangling, Exploration, Visualization, and Modeling with Python - Sam Lau

Learning Data Science

Data Wrangling, Exploration, Visualization, and Modeling with Python

By: Sam Lau, Joseph Gonzalez, Deborah Nolan

Paperback | 29 September 2023

At a Glance

Paperback


RRP $171.00

$73.75

57%OFF

or 4 interest-free payments of $18.44 with

 or 

Aims to ship in 15 to 25 business days

As an aspiring data scientist, you appreciate why organizations rely on data for important decisions—whether it's for companies designing websites, cities deciding how to improve services, or scientists discovering how to stop the spread of disease. And you want the skills required to distill a messy pile of data into actionable insights. We call this the data science lifecycle: the process of collecting, wrangling, analyzing, and drawing conclusions from data.

Learning Data Science is the first book to cover foundational skills in both programming and statistics that encompass this entire lifecycle. It's aimed at those who wish to become data scientists or who already work with data scientists, and at data analysts who wish to cross the "technical/nontechnical" divide. If you have a basic knowledge of Python programming, you'll learn how to work with data using industry-standard tools like pandas.
  • Refine a question of interest to one that can be studied with data
  • Pursue data collection that may involve text processing, web scraping, etc.
  • Glean valuable insights about data through data cleaning, exploration, and visualization
  • Learn how to use modeling to describe the data
  • Generalize findings beyond the data
About the Authors

Sam Lau is a PhD candidate at UC San Diego. He designs novel interfaces for learning and teaching data science, and his research has been published in top-tier conferences in human-computer interaction and end-user programming. Sam instructed and helped design flagship data science courses at UC Berkeley. These courses have grown to serve thousands of students every year and their curriculum is used by universities across the world.

Joseph (Joey) Gonzalez is an assistant professor in the EECS department at UC Berkeley and a founding member of the new UC Berkeley RISE Lab. His research interests are at the intersection of machine learning and data systems, including: dynamic deep neural networks for transfer learning, accelerated deep learning for high-resolution computer vision, and software platforms for autonomous vehicles.

Joey is also co-founder of Turi Inc. (formerly GraphLab), which was based on his work on the GraphLab and PowerGraph Systems. Turi was recently acquired by Apple Inc.

Deborah (Deb) Nolan is Professor of Statistics and Associate Dean for Undergraduate Studies in the Division of Computing, Data Science, and Society at the University of California, Berkeley, where she holds the Zaffaroni Family Chair in Undergraduate Education. Her research has involved the empirical process, high-dimensional modeling, and, more recently, technology in education and reproducible research. Her pedagogical approach connects research, practice and education, and she is co-author of 4 textbooks: Stat Labs, Teaching Statistics, Data Science in R, and Communicating with Data.

More in Machine Learning

AI Based Advancements in Biometrics and its Applications - Balasubramaniam S
How We Learn : The New Science of Education and the Brain - Stanislas Dehaene
Scaling Python with Dask : From Data Science to Machine Learning - Holden Karau
AI Machine Learning - Dr. Kyle Allison

Fold-Out Book or Chart

RRP $19.99

$18.90

Practical Data Privacy : Enhancing Privacy and Security in Data - Katharine Jarmul
Implementing MLOps in the Enterprise : A Production-First Approach - Yaron Haviv
Learning Spark : Lightning-Fast Data Analytics - Jules S. Damji

RRP $152.00

$66.25

56%
OFF