Fundamentals of Data Engineering: Plan and Build Robust Data Systems

  • 17h 31m 4s
  • Joe Reis, Matt Housley
  • Gildan Media
  • 2022

Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available through the framework of the data engineering lifecycle.

Authors Joe Reis and Matt Housley walk you through the data engineering lifecycle and show you how to stitch together a variety of cloud technologies to serve the needs of downstream data consumers. You'll understand how to apply the concepts of data generation, ingestion, orchestration, transformation, storage, and governance that are critical in any data environment regardless of the underlying technology.

This book will help you: get a concise overview of the entire data engineering landscape; assess data engineering problems using an end-to-end framework of best practices; cut through marketing hype when choosing data technologies, architecture, and processes; use the data engineering lifecycle to design and build a robust architecture; and incorporate data governance and security across the data engineering lifecycle.

About the Author

Joe Reis is a business-minded data nerd who's worked in the data industry for 20 years, with responsibilities ranging from statistical modeling, forecasting, machine learning, data engineering, data architecture, and almost everything else in between. Joe is the CEO and cofounder of Ternary Data, a data engineering and architecture consulting firm based in Salt Lake City, Utah. In addition, he volunteers with several technology groups and teaches at the University of Utah. In his spare time, Joe likes to rock climb, produce electronic music, and take his kids on crazy adventures.

Matt Housley is a data engineering consultant and cloud specialist. After some early programming experience with Logo, Basic, and 6502 assembly, he completed a PhD in mathematics at the University of Utah. Matt then began working in data science, eventually specializing in cloud-based data engineering. He cofounded Ternary Data with Joe Reis, where he leverages his teaching experience to train future data engineers and advise teams on robust data architecture. Matt and Joe also pontificate on all things data on The Monday Morning Data Chat.

In this Audiobook

  • Chapter 1 - Data Engineering Described
  • Chapter 2 - The Data Engineering Lifecycle
  • Chapter 3 - Designing Good Data Architecture
  • Chapter 4 - Choosing Technologies Across the Data Engineering Lifecycle
  • Chapter 5 - Data Generation in Source Systems
  • Chapter 6 - Storage
  • Chapter 7 - Ingestion
  • Chapter 8 - Queries, Modeling, and Transformation
  • Chapter 9 - Serving Data for Analytics, Machine Learning, and Reverse ETL
  • Chapter 10 - Security and Privacy
  • Chapter 11 - The Future of Data Engineering
  • Appendix A: Serialization and Compression Technical Details
  • Appendix B: Cloud Networking
SHOW MORE
FREE ACCESS