Python Data Analytics: Data Analysis and Science Using Pandas, Matplotlib, and the Python Programming Language
- 4h 33m
- Fabio Nelli
Python Data Analytics will help you tackle the world of data acquisition and analysis using the power of the Python language. At the heart of this book lies the coverage of pandas, an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
Author Fabio Nelli expertly shows the strength of the Python programming language when applied to processing, managing and retrieving information. Inside, you will see how intuitive and flexible it is to discover and communicate meaningful patterns of data using Python scripts, reporting systems, and data export. This book examines how to go about obtaining, processing, storing, managing and analyzing data using the Python programming language.
You will use Python and other open source tools to wrangle data and tease out interesting and important trends in that data that will allow you to predict future patterns. Whether you are dealing with sales data, investment data (stocks, bonds, etc.), medical data, web page usage, or any other type of data set, Python can be used to interpret, analyze, and glean information from a pile of numbers and statistics.
This book is an invaluable reference with its examples of storing and accessing data in a database; it walks you through the process of report generation; it provides three real world case studies or examples that you can take with you for your everyday analysis needs.
What you’ll learn
- How to set up your environment and use automation scripts
- Where and how to get your data
- How to analyze your data with Python's data analysis library, called pandas
- How to carry out visualization with matplotlib
- How to analyze text data using Python's natural language processing
Who this book is for
This book is for the mid to experienced level programmer who already knows the basics of Python programming. It is for programmers who want to know how to use database data and reporting tools to manipulate raw data into coherent useful information.
About the Author
Fabio Nelli, is an IT Scientific Application Specialist at IRBM Science Park, a private research center in Pomezia, Roma (Italy). He has beena computer consultant for many years at IBM, EDS, Merck Sharp, and Dohme, along with several banks and insurance companies. He has an Organic Chemistry degree and many years of experience in Information technologies and Automation systems applied to Life Sciences (Tech Specialist at Beckman Coulter Italy and Spain).
He is currently developing Java applications that interface Oracle databases with scientific instrumentations generating data and web server applications providing analysis of the results to researchers in real time.
In this Book
An Introduction to Data Analysis
Introduction to the Python's World
The NumPy Library
The Pandas Library—An Introduction
Pandas: Reading and Writing Data
Pandas in Depth: Data Manipulation
Data Visualization with Matplotlib
Machine Learning with Scikit-Learn
An Example—Meteorological Data
Recognizing Handwritten Digits