Syllabus List
- Basics of Functional Programming and Scala
- Data Replication Topology
- Spark Core Processing RDD
- Spark SQL - Processing DataFrames
- Apache Hive
- Characteristics of HDFS
- Data Injection into Big data systems And ETL
- Hadoop Architecture And YARN
- Hadoop Architecture
- Distributed Storage (HDFS) and YARN
- HDFS Architecture And Components
- HDFS Components File system Namespace
- High availability cluster implementation
- Introduction to Big data and Hadoop
- Regular file system vs HDFS
- Spark GraphX
- Stream Processing Frameworks and Spark Streaming
- What is HDFS