Data Analytics with HPC
Dates: 29-30 June 2017
Location: University of Portsmouth
Please note: these materials are still in draft form and may be subject to change before the course begins, but they will give you an idea of the content to be covered.
Lecture Slides
Unless otherwise indicated all material is Copyright © EPCC, The University of Edinburgh, and is only made available for private study.
Day 1
- 09:00 – 09:30 Arrival/set-up/Welcome
- 09:30 – 10:30 What are data analytics, big data, data science
- 10:30 – 11:00 COFFEE
- 11:00 – 12:00 Data Cleaning
- 12:00 – 13:00 Practical: Data Cleaning
- 13:00 – 14:00 LUNCH
- 14:00 – 14:45 Supervised Learning, feature selection, trees, forests
- 14:45 – 15:30 Naïve Bayes
- 15:30 – 16:00 COFFEE
- 16:00 – 17:00 Naïve Bayes Practical
- 17:00 CLOSE OF DAY
Day 2
- 09:00 – 10:30 MapReduce / Hadoop
- 10:30 – 11:00 COFFEE
- 11:00 – 11:30 Hadoop walkthrough
- 11:30 – 12:30 Unsupervised learning
- 12:30 – 13:30 LUNCH
- 13:30 – 14:15 Spark
- 14:15 – 15:00 Data streaming
- 15:00 – 15:30 COFFEE
- 15:30 – 16:00 Spark, Data streaming demonstrations
- 16:00 – CLOSE OF COURSE
Exercise Material
Unless otherwise indicated all material is Copyright © EPCC, The University of Edinburgh, and is only made available for private study.Data cleaning materials Naïve Bayes materials Hadoop materials Spark demo materials