Practitioner's Guide to Data Science: Streamlining Data Science Solutions using Python, Scikit-Learn, and Azure ML Service Platform

  • 2h 46m
  • Nasir Ali Mirza
  • BPB Publications
  • 2022

"How is the Data Science project to be implemented?" has never been more conceptually sounding, thanks to the work presented in this book. This book provides an in-depth look at the current state of the world's data and how Data Science plays a pivotal role in everything we do.

This book explains and implements the entire Data Science lifecycle using well-known data science processes like CRISP-DM and Microsoft TDSP. The book explains the significance of these processes in connection with the high failure rate of Data Science projects.

The book helps build a solid foundation in Data Science concepts and related frameworks. It teaches how to implement real-world use cases using data from the HMDA dataset. It explains Azure ML Service architecture, its capabilities, and implementation to the DS team, who will then be prepared to implement MLOps. The book also explains how to use Azure DevOps to make the process repeatable while we're at it.

By the end of this book, you will learn strong Python coding skills, gain a firm grasp of concepts such as feature engineering, create insightful visualizations and become acquainted with techniques for building machine learning models.

What you will learn

  • Organize Data Science projects using CRISP-DM and Microsoft TDSP.
  • Learn to acquire and explore data using Python visualizations.
  • Get well versed with the implementation of data pre-processing and Feature Engineering.
  • Understand algorithm selection, model development, and model evaluation.
  • Hands-on with Azure ML Service, its architecture, and capabilities.
  • Learn to use Azure ML SDK and MLOps for implementing real-world use cases.

About the Author

Nasir Ali Mirza is a Data Architect and Data Science Professional with over 20 years of experience in data technologies. He has designed and implemented large-scale data movement pipelines and data transformations for very large global organizations in the private and public sectors like Lehman Brothers, Caudwell Communications, Bell South, Museum of Science, Delaware State, Wells Fargo, Kennametal, and GEICO utilizing big data and analytics platforms. He is currently working as a Data Architect at Applied Information Sciences designing and implementing modern data analytics solutions. Before joining AIS, he served in the Database and BI practice at Microsoft Global Services. In this role, he architected data solutions for customers in the banking, insurance, and telecom industries.

In this Book

  • Foreword
  • Data Science for Business
  • Data Science Project Methodologies and Team Processes
  • Business Understanding and its Data Landscape
  • Acquire, Explore, and Analyze Data
  • Pre-Processing and Preparing Data
  • Developing a Machine Learning Model
  • Lap around Azure ML Service
  • Deploying and Managing Models