Saturday, August 5, 2017

Taming Big Data with Apache Spark and Python - Hands On!

Unknown August 05, 2017 Apache Spark, Bestselling, GraphX, Python, Spark SQL

Dive right in with 15+ hands-on examples of analyzing large data sets with Apache Spark, on your desktop or on Hadoop!

What Will I Learn from the course "Taming Big Data with Apache Spark and Python - Hands On!"?

Frame big data analysis problems as Spark problems
Use Amazon's Elastic MapReduce service to run your job on a cluster with Hadoop YARN
Install and run Apache Spark on a desktop computer or on a cluster
Use Spark's Resilient Distributed Datasets to process and analyze large data sets across many CPU's
Implement iterative algorithms such as breadth-first-search using Spark
Use the MLLib machine learning library to answer common data mining questions
Understand how Spark SQL lets you work with structured data
Understand how Spark Streaming lets your process continuous streams of data in real time
Tune and troubleshoot large jobs running on a cluster
Share information between nodes on a Spark cluster using broadcast variables and accumulators
Understand how the GraphX library helps with network analysis problems

Includes:

5 hours on-demand video
2 Supplemental Resources
Full lifetime access
Access on mobile and TV
Assignments
Certificate of Completion

Take this course

About Unknown
SoraTemplates is a blogger resources site is a provider of high quality blogger template with premium looking layout and robust design. The main mission of SoraTemplates is to provide the best quality blogger templates.