The main focus of this session is exploring a set of functionalities for processing large datasets without having to load them all at once in the memory. MS R offers a rich set of distributed statistical and machine learning algorithms, which get added to over time. Finally, MS R also offers a mechanism by which we can take code that we developed on our laptop and deploy it on a remote server such as SQL Server or Spark (where the infrastructure is very different under the hood), with minimal effort.
Developers
Data Engineer & Data Analyst