CSC-343 Course Notes
- Course Introduction (with Docker examples)
- Linux Tutorial
- Introduction to Hadoop (Available on Blackboard)
- Hadoop Architecture (Available on Blackboard)
- HDFS
- Python Basics
- MapReduce
- Relational Databases
- Sqoop (Available on Blackboard)
- Intro to Impala and Hive (Available on Blackboard)
- Modeling Data with Impala and Hive (Available on Blackboard)
- Introduction to Spark (Available on Blackboard)
- RDDs in Spark (Available on Blackboard)
- Working with RDDs in Spark (Available on Blackboard)
- Aggregating Data with Pair RDDs (Available on Blackboard)
- Spark Parallel Processing (Available on Blackboard)
- Spark RDD Persistence (Available on Blackboard)
- Spark Algorithms (Available on Blackboard)
- SparkSQL Example
- Wrap Up