Mining Social Media: Finding Stories in Internet Data

  • 3h 31m
  • Lam Thuy Vo
  • No Starch Press
  • 2020

BuzzFeed News Senior Reporter Lam Thuy Vo explains how to mine, process, and analyze data from the social web in meaningful ways with the Python programming language.

Did fake Twitter accounts help sway a presidential election? What can Facebook and Reddit archives tell us about human behavior? In Mining Social Media, senior BuzzFeed reporter Lam Thuy Vo shows you how to use Python and key data analysis tools to find the stories buried in social media.

Whether you're a professional journalist, an academic researcher, or a citizen investigator, you'll learn how to use technical tools to collect and analyze data from social media sources to build compelling, data-driven stories.

Learn how to:

  • Write Python scripts and use APIs to gather data from the social web
  • Download data archives and dig through them for insights
  • Inspect HTML downloaded from websites for useful content
  • Format, aggregate, sort, and filter your collected data using Google Sheets
  • Create data visualizations to illustrate your discoveries
  • Perform advanced data analysis using Python, Jupyter Notebooks, and the pandas library
  • Apply what you've learned to research topics on your own

Social media is filled with thousands of hidden stories just waiting to be told. Learn to use the data-sleuthing tools that professionals use to write your own data-driven stories.

About the Author

Lam Thuy Vo is a senior reporter at BuzzFeed News where she focuses on the intersection of technology, society, and social media data. She has reported for The Wall Street Journal, Al Jazeera America, and NPR's Planet Money, telling economic stories across the US and throughout Asia. Vo has also spent over a decade as an educator, training newsrooms and developing courses for the Craig Newmark Graduate School of Journalism at CUNY.

In this Book

  • Introduction
  • The Programming Languages You’ll Need to Know
  • Where to Get Your Data
  • Getting Data with Code
  • Scraping Your Own Facebook Data
  • Scraping a Live Site
  • Introduction to Data Analysis
  • Visualizing Your Data
  • Advanced Tools for Data Analysis
  • Finding Trends in Reddit Data
  • Measuring the Twitter Activity of Political Actors
  • Where to Go from Here
SHOW MORE
FREE ACCESS