Statistics for Big Data for Dummies

  • 5h 1m
  • Alan Anderson, David Semmelroth
  • John Wiley & Sons (US)
  • 2015

Learn to:

  • Collect, clean, and interpret data
  • Effectively communicate data analysis
  • Make good predictions

Big data making you dizzy? Relax—here's what it's all about

Big data figures into everything from weather forecasting to political polling. Don't let it give you a big headache; use this friendly book to learn about it in manageable, bite-size chunks. You'll get a handle on the statistical methods used when working with big data, applications for it, ways to organize and check data, and a whole lot more.

  • Solving the big mystery—find out what big data is, characteristics that define it, how it's used, and what it makes possible
  • How to handle it — explore statistical techniques used with big data, including probability distributions, regression analysis, time series analysis, and forecasting techniques
  • Getting graphical — learn how big data can be analyzed with graphical techniques and how to identify valid, useful, and understandable patterns in data
  • A variable approach — examine key univariate and multivariate statistical techniques for analyzing data
  • Thinking ahead — discover techniques for forecasting the future values of a dataset
  • There's a tool for that — learn about the best software packages and programming tools for analyzing statistical data

Open the book and find:

  • Ways to extract previously unknown information from a database
  • Tips for data collection and cleaning
  • Techniques for analyzing time series data
  • How to check data for missing information
  • What to do with outliers in a dataset
  • Some surprising uses for big data
  • An overview of modeling techniques

About the Authors

Alan Anderson, PhD, is a professor of economics and finance at Fordham University and New York University. He's a veteran economist, risk manager, and fixed income analyst.

David Semmelroth is an experienced data analyst, trainer, and statistics instructor who consults on customer databases and database marketing.

In this Book

  • Introduction
  • What is Big Data and What Do You Do with It?
  • Characteristics of Big Data—The Three Vs
  • Using Big Data—The Hot Applications
  • Understanding Probabilities
  • Basic Statistical Ideas
  • Dirty Work—Preparing Your Data for Analysis
  • Figuring the Format—Important Computer File Formats
  • Checking Assumptions—Testing for Normality
  • Dealing with Missing or Incomplete Data
  • Sending Out a Posse—Searching for Outliers
  • An Overview of Exploratory Data Analysis (EDA)
  • A Plot to Get Graphical—Graphical Techniques
  • You're the Only Variable for Me—Univariate Statistical Techniques
  • To All the Variables We've Encountered—Multivariate Statistical Techniques
  • Regression Analysis
  • When You've Got the Time—Time Series Analysis
  • Using Your Crystal Ball—Forecasting with Big Data
  • Crunching Numbers—Performing Statistical Analysis on Your Computer
  • Seeking Free Sources of Financial Data
  • Ten (or So) Best Practices in Data Preparation
  • Ten (or So) Questions Answered by Exploratory Data Analysis (EDA)
SHOW MORE
FREE ACCESS

YOU MIGHT ALSO LIKE