Spark with HDInsight - Enterprise Ready Machine Learning and Interactive Data Analysis at Scale

Description

Apache Spark is a fast and general-purpose cluster computing system providing unifide solution on Batch processing, Streaming and Machine Learning.  It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs.  The Azure HDInsight is a Hadoop implementation on Azure Cloud which also provides Apache Spark instance.  This course is intended to get insight of architecture and functioning of Apache Spark, eco system of Azure Cloud and HDInsight for Apache Spark to harness power from to augment capabilities multifold.  Understanding all these things certainly not possible without developing spark applications in real life scenarios.

Course Duration: 3 Day(s)

Target Audience

Data Scientist

Personas

Data Engineer & Data Analyst