Spark Operations Cookbook Upplaga 1
The Apache Spark cluster computing system aims to make data analytics fast-both fast to run and fast to write. But as powerful and useful as Spark is for distributed systems, there are many issues that may occur during implementation. This practical cookbook contains recipes solving the most common problems that Spark users face. Author Neelesh Srinivas Salian, a customer operations engineer at Cloudera, has seen all things that can go wrong in the code for Spark applications. Data engineers, system administrators, architects will learn recipes for debugging common and unexpected problems that occur during key phases of Spark implementation on large distributed system environments. From setting up your cluster to running your first application, submitting to a cluster, understanding storage needs, and handling security and monitoring metrics, this book is your guide to facing any Spark operations issue. Learn an approach to debugging Spark from the perspective of improving business logic implementation Understand the nuances of Spark's components, including Spark Core, Spark Streaming, SparkSQL, and MLLib Get an entire chapter devoted to Spark security-an emerging and vital topic
Upplaga: 1a upplagan
Utgiven: 2017
ISBN: 9781491971581
Förlag: O'Reilly Media
Format: Häftad
Språk: Engelska
Sidor: 200 st
The Apache Spark cluster computing system aims to make data analytics fast-both fast to run and fast to write. But as powerful and useful as Spark is for distributed systems, there are many issues that may occur during implementation. This practical cookbook contains recipes solving the most common problems that Spark users face. Author Neelesh Srinivas Salian, a customer operations engineer at Cloudera, has seen all things that can go wrong in the code for Spark applications. Data engineers, system administrators, architects will learn recipes for debugging common and unexpected problems that occur during key phases of Spark implementation on large distributed system environments. From setting up your cluster to running your first application, submitting to a cluster, understanding storage needs, and handling security and monitoring metrics, this book is your guide to facing any Spark operations issue. Learn an approach to debugging Spark from the perspective of improving business logic implementation Understand the nuances of Spark's components, including Spark Core, Spark Streaming, SparkSQL, and MLLib Get an entire chapter devoted to Spark security-an emerging and vital topic
Varje vecka tillkommer tusentals nya säljare. Bevaka boken så får du meddelande när den finns tillgänglig igen.