Advanced Apache Spark Training - Sameer Farooqui (Databricks)

Spark Summit
Spark Summit
344.7 هزار بار بازدید - 9 سال پیش - Live Big Data Training from
Live Big Data Training from Spark Summit 2015 in New York City.

"Today I'll cover Spark core in depth and get you prepared to use Spark in your own prototypes. We'll start by learning about the big data ecosystem, then jump into RDDs (Resilient Distributed Datasets). Then we'll talk about integrating Spark with resource managers like YARN and Standalone mode. After a peek into some Spark Internals, we touch base upon Accumulators and Broadcast Variables. Finally, we end with Spark Streaming and a technical explanation of how the 100 TB sort competition was won in 2014." - Sameer

Slides:
https://spark-summit.org/wp-content/u...


Want to learn more about Spark?

Check out my new class, "Exploring Wikipedia with Apache Spark", recorded June 2016:
"Exploring Wikipedia With Apache Spar...


// About the Presenter //
Sameer Farooqui is a Technology Evangelist at Databricks where he helps promote the adoption of Apache Spark. As a founding member of the training team, he created and taught advanced Spark classes at private clients, meetups and conferences globally.

Follow Sameer on -
Twitter: Twitter: blueplastic
LinkedIn: LinkedIn: blueplastic
9 سال پیش در تاریخ 1394/01/21 منتشر شده است.
344,797 بـار بازدید شده
... بیشتر