Data Science Bookcamp

  • 18h 1m 33s
  • Leonard Apeltsin
  • Manning Publications
  • 2021
Learn data science with Python by building five real-world projects! Experiment with card game predictions, tracking disease outbreaks, and more, as you build a flexible and intuitive understanding of data science.

In Data Science Bookcamp, you will learn:

  • Techniques for computing and plotting probabilities
  • Statistical analysis using Scipy
  • Web scraping
  • How to organize datasets with clustering algorithms
  • How to visualize complex multi-variable datasets
  • How to train a decision tree machine learning algorithm
In Data Science Bookcamp, you’ll test and build your knowledge of Python with the kind of open-ended problems that professional data scientists work on every day. Downloadable data sets and thoroughly-explained solutions help you lock in what you’ve learned, building your confidence and making you ready for an exciting new data science career. About the Technology

A data science project has a lot of moving parts, and it takes practice and skill to get all the code, algorithms, datasets, formats, and visualizations working together harmoniously. This unique audiobook guides you through five realistic projects, including tracking disease outbreaks from news headlines, analyzing social networks, and finding relevant patterns in ad click data.

About the Audiobook

Data Science Bookcamp doesn’t stop with surface-level theory and toy examples. As you work through each project, you’ll learn how to troubleshoot common problems like missing data, messy data, and algorithms that don’t quite fit the model you’re building. You’ll appreciate the detailed setup instructions and the fully explained solutions that highlight common failure points.

For listeners who know the basics of Python. No prior data science or machine learning skills required. About the Author

Leonard Apeltsin is the head of data science at Anomaly, where his team applies advanced analytics to uncover healthcare fraud, waste, and abuse.

In this Audiobook

  • Section 1 - Computing probabilities using Python
  • Section 2 - Plotting probabilities using Matplotlib
  • Section 3 - Running random simulations in NumPy
  • Section 4 - Case study 1 solution
  • Section 5 - Basic probability and statistical analysis using SciPy
  • Section 6 - Making predictions using the central limit theorem and SciPy
  • Section 7 - Statistical hypothesis testing
  • Section 8 - Analyzing tables using Pandas
  • Section 9 - Case study 2 solution
  • Section 10 - Clustering data into groups
  • Section 11 - Geographic location visualization and analysis
  • Section 12 - Case study 3 solution
  • Section 13 - Measuring text similarities
  • Section 14 - Dimension reduction of matrix data
  • Section 15 - NLP analysis of large text datasets
  • Section 16 - Extracting text from web pages
  • Section 17 - Case study 4 solution
  • Section 18 - An introduction to graph theory and network analysis
  • Section 19 - Dynamic graph theory techniques for node ranking and social network analysis
  • Section 20 - Network-driven supervised machine learning
  • Section 21 - Training linear classifiers with logistic regression
  • Section 22 - Training nonlinear classifiers with decision tree techniques
  • Section 23 - Case study 5 solution
SHOW MORE
FREE ACCESS