Webinar "Introduction to Apache Spark on SHARCNET" - 12:00pm
Date | Wednesday June 08 2016 |
---|---|
Time | 12:00 - 13:00 |
Location | Online |
Contact | syam@sharcnet.ca |
URL | http://vidyo.computecanada.ca/flex.html?roomdirect.html&key=Pr1GiEI51kFi |
You are NOT registered for this event. Register. | |
Topic: “Introduction to Apache Spark on SHARCNET” Speaker: Jose Nandez, SHARCNET Apache Spark is a general purpose light cluster computer platform. Spark extends the MapReduce model to support more computations. Spark is accessible through Python, Scala, Java, R or SQL. Spark can run on Hadoop clusters or in a standalone mode, and access any Hadoop data, including databases from Cassandra, Hive or Hbase. It has been recently paired with MongoDB. In this talk, I will discuss Spark’s main data structure and commands. Some real examples where Spark can be used will be presented. I will also show how to load Spark module on SHARCNET clusters, how to submit a Spark script to the SHARCNET scheduler, and how to use SHARCNET resources for developing your Spark program using Python. Need help attending a webinar? See the SHARCNET Help Wiki. |