Amazon EMR enables fast processing of large structured or unstructured datasets, and in this recorded webinar we’ll show you how to setup an Amazon EMR job flow to analyse application logs, and perform Hive queries against it. Also, we will review best practices around data file organisation on Amazon Simple Storage Service (S3), how clusters can be started from the AWS web console and command line, and how to monitor the status of a MapReduce job.

Finally we take a look at Hadoop ecosystem tools you can use with Amazon EMR and the additional features of the service.

View and download the slides from this webinar on Slideshare here:

Check out the rest of the Masterclass webinars for 2015 here:

See the Journey Through the Cloud webinar series here: