Skip to product information
1 of 1
"This book introduces you to the world of building data-processing applications with the wide variety of tools supported Hadoop 2. Starting with the core components of the frameworkHDFS and YARNthis book will guide you through how to build applications using a variety of approaches.You will learn how YARN completely changes the relationship between MapReduce and Hadoop and allows the latter to support more varied processing approaches and a broader array of applications. These include real-time processing with Apache Samza and iterative computation with Apache Spark. Next up, we discuss Apache Pig and the dataflow data model it provides. You will discover how to use Pig to analyze a Twitter dataset."About the Author
View full details