Practical Graph Analytics with Apache Giraph

  • 6h 51m
  • Claudio Martella, Dionysios Logothetis, Roman Shaposhnik
  • Apress
  • 2015

Practical Graph Analytics with Apache Giraph helps you build data mining and machine learning applications using the Apache Foundation’s Giraph framework for graph processing. This is the same framework as used by Facebook, Google, and other social media analytics operations to derive business value from vast amounts of interconnected data points.

Graphs arise in a wealth of data scenarios and describe the connections that are naturally formed in both digital and real worlds. Examples of such connections abound in online social networks such as Facebook and Twitter, among users who rate movies from services like Netflix and Amazon Prime, and are useful even in the context of biological networks for scientific research. Whether in the context of business or science, viewing data as connected adds value by increasing the amount of information available to be drawn from that data and put to use in generating new revenue or scientific opportunities.

Apache Giraph offers a simple yet flexible programming model targeted to graph algorithms and designed to scale easily to accommodate massive amounts of data. Originally developed at Yahoo!, Giraph is now a top top-level project at the Apache Foundation, and it enlists contributors from companies such as Facebook, LinkedIn, and Twitter. Practical Graph Analytics with Apache Giraph brings the power of Apache Giraph to you, showing how to harness the power of graph processing for your own data by building sophisticated graph analytics applications using the very same framework that is relied upon by some of the largest players in the industry today.

About the Author

Roman Shaposhnik is a vice-president and one of the lead developers of Apache Bigtop, a 100% open source and community-driven big data management distribution built on top of Apache Hadoop. He has been working on making Hadoop ecosystem components more accessible and easier to use, and he has contributed to a wide array of Apache projects from Avro to Zookeeper. In addition to his day job building Data Fabric APIs at Pivotal Inc., Roman currently serves as a vice-president of Apache Incubator, helping exciting and new open source projects join the Apache family.

In this Book

  • Introducing Giraph
  • Modeling Graph Processing Use Cases
  • The Giraph Programming Model
  • Giraph Algorithmic Building Blocks
  • Working with Giraph
  • Giraph Architecture
  • Graph IO Formats
  • Beyond the Basic API
  • Exposing Parallelism in Giraph
  • Advanced IO
  • Tuning Giraph
  • Giraph in the Cloud


Rating 4.4 of 43 users Rating 4.4 of 43 users (43)
Rating 4.6 of 155 users Rating 4.6 of 155 users (155)