SQL for Data Scientists: A Beginner's Guide for Building Data Sets for Analysis

  • 3h 56m
  • Renee M. P. Teate
  • John Wiley & Sons (US)
  • 2021

Jump-start your career as a data scientist―learn to develop datasets for exploration, analysis, and machine learning

SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis is a resource that’s dedicated to the Structured Query Language (SQL) and dataset design skills that data scientists use most. Aspiring data scientists will learn how to how to construct datasets for exploration, analysis, and machine learning. You can also discover how to approach query design and develop SQL code to extract data insights while avoiding common pitfalls.

You may be one of many people who are entering the field of Data Science from a range of professions and educational backgrounds, such as business analytics, social science, physics, economics, and computer science. Like many of them, you may have conducted analyses using spreadsheets as data sources, but never retrieved and engineered datasets from a relational database using SQL, which is a programming language designed for managing databases and extracting data.

This guide for data scientists differs from other instructional guides on the subject. It doesn’t cover SQL broadly. Instead, you’ll learn the subset of SQL skills that data analysts and data scientists use frequently. You’ll also gain practical advice and direction on "how to think about constructing your dataset."

  • Gain an understanding of relational database structure, query design, and SQL syntax
  • Develop queries to construct datasets for use in applications like interactive reports and machine learning algorithms
  • Review strategies and approaches so you can design analytical datasets
  • Practice your techniques with the provided database and SQL code

In this book, author Renee Teate shares knowledge gained during a 15-year career working with data, in roles ranging from database developer to data analyst to data scientist. She guides you through SQL code and dataset design concepts from an industry practitioner’s perspective, moving your data scientist career forward!

In this Book

  • Introduction
  • Data Sources
  • The SELECT Statement
  • The WHERE Clause
  • CASE Statements
  • Aggregating Results for Analysis
  • Window Functions and Subqueries
  • Date and Time Functions
  • Exploratory Data Analysis with SQL
  • Building SQL Datasets for Analytical Reporting
  • More Advanced Query Structures
  • Creating Machine Learning Datasets Using SQL
  • Analytical Dataset Development Examples
  • Storing and Modifying Data


Course T-SQL Queries
Rating 4.3 of 218 users Rating 4.3 of 218 users (218)
Rating 4.0 of 72 users Rating 4.0 of 72 users (72)
Rating 4.2 of 319 users Rating 4.2 of 319 users (319)