Advanced R: Data Programming and the Cloud

  • 5h 8m
  • Joshua F. Wiley, Matt Wiley
  • Apress
  • 2016

Program for data analysis using R and learn practical skills to make your work more efficient. This book covers how to automate running code and the creation of reports to share your results, as well as writing functions and packages. Advanced R is not designed to teach advanced R programming nor to teach the theory behind statistical procedures. Rather, it is designed to be a practical guide moving beyond merely using R to programming in R to automate tasks.

This book will show you how to manipulate data in modern R structures and includes connecting R to data bases such as SQLite, PostgeSQL, and MongoDB. The book closes with a hands-on section to get R running in the cloud. Each chapter also includes a detailed bibliography with references to research articles and other resources that cover relevant conceptual and theoretical topics.

What You Will Learn

  • Write and document R functions
  • Make an R package and share it via GitHub or privately
  • Add tests to R code to insure it works as intended
  • Build packages automatically with GitHub
  • Use R to talk directly to databases and do complex data management
  • Run R in the Amazon cloud
  • Generate presentation-ready tables and reports using R

Who This Book Is For

Working professionals, researchers, or students who are familiar with R and basic statistical techniques such as linear regression and who want to learn how to take their R coding and programming to the next level.

About the Authors

Joshua F. Wiley is a lecturer in the Monash Institute for Cognitive and Clinical Neurosciences and School of Psychological Sciences at Monash University and a senior partner at Elkhart Group Limited, a statistical consultancy. He earned his PhD from the University of California, Los Angeles, and his research focuses on using advanced quantitative methods to understand the complex interplays of psychological, social, and physiological processes in relation to psychological and physical health. In statistics and data science, Joshua focuses on biostatistics and is interested in reproducible research and graphical displays of data and statistical models. Through consulting at Elkhart Group Limited and former work at the UCLA Statistical Consulting Group, he has supported a wide array of clients ranging from graduate students, to experienced researchers, and biotechnology companies. He also develops or co-develops a number of R packages including varian, a package to conduct Bayesian scale-location structural equation models, and MplusAutomation, a popular package that links R to the commercial Mplus software.

Matt Wiley is a tenured, associate professor of mathematics with awards in both mathematics education and honour student engagement. He earned degrees in pure mathematics, computer science, and business administration through the University of California and Texas A&M systems. He serves as director for Victoria College’s quality enhancement plan and managing partner at Elkhart Group Limited, a statistical consultancy. With programming experience in R, C++, Ruby, Fortran, and JavaScript, he has always found ways to meld his passion for writing with his joy of logical problem solving and data science. From the boardroom to the classroom, Matt enjoys finding dynamic ways to partner with interdisciplinary and diverse teams to make complex ideas and projects understandable and solvable.

In this Book

  • Programming Basics
  • Programming Utilities
  • Programming Automation
  • Writing Functions
  • Writing Classes and Methods
  • Writing a Package
  • Introduction to Data Management Using data.table
  • Data Munging with data.table
  • Other Tools for Data Management
  • Reading Big Data(bases)
  • Getting a Cloud
  • Cloud Ubuntu for Windows Users
  • Every Cloud has a Shiny Lining
  • Shiny Dashboard Sampler
  • Dynamic Reports and the Cloud
SHOW MORE
FREE ACCESS