High Performance Spark is published by O'Reilly Media in June 2016. This book has 175 pages in English, ISBN-13 978-1491943205.
If you’ve successfully used Apache Spark to solve medium sized-problems, but still struggle to realize the “Spark promise” of unparalleled performance on big data, this book is for you. High Performance Spark shows you how take advantage of Spark at scale, so you can grow beyond the novice-level. It’s ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications.
- Learn how to make Spark jobs run faster
- Productionize exploratory data science with Spark
- Handle even larger data sets with Spark
- Reduce pipeline running times for faster insights