Spark : Big Data Cluster Computing in Production - Ilya Ganelin

Spark

Big Data Cluster Computing in Production

By: Ilya Ganelin, Ema Orhian, Kai Sasaki, Brennon York

Paperback | 11 March 2016 | Edition Number 1

At a Glance

Paperback


RRP $82.95

$55.25

33%OFF

or 4 interest-free payments of $13.81 with

 or 

Aims to ship in 7 to 10 business days

When will this arrive by?
Enter delivery postcode to estimate

Production-targeted Spark guidance with real-world use cases

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more.

Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings.

  • Review Spark hardware requirements and estimate cluster size
  • Gain insight from real-world production use cases
  • Tighten security, schedule resources, and fine-tune performance
  • Overcome common problems encountered using Spark in production

Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.

More in Computer Networking & Communications

Computer Networking, Global Edition : 8th edition - James Kurose

RRP $180.95

$142.25

21%
OFF
Data Science from Scratch : First Principles with Python - Joel Grus
Cybersecurity All-in-One For Dummies : For Dummies - Joseph Steinberg
Network Security Assessment : Know Your Network : 3rd Edition - Chris Mcnab
Cybersecurity For Dummies : 2nd edition - Joseph Steinberg

RRP $52.95

$37.25

30%
OFF
Learning Git : A Hands-On and Visual Guide to the Basics of Git - Anna Skoulikari
Business Data Communications and Networking : 14th Edition - Jerry FitzGerald
Networking All-in-One For Dummies : 8th edition - Doug Lowe

RRP $86.25

$60.50

30%
OFF
Wireless Communication Networks and Systems, Global Edition - Cory Beard
Learning Agile : Understanding Scrum, XP, Lean, and Kanban - Andrew Stellman
Microsoft 365 For Dummies : For Dummies (Computer/Tech) - Jennifer Reed
Intelligence-Driven Incident Response : Outwitting the Adversary - Rebekah Brown