Home > Course

Big Data Analytics Beginner



Course Information

Level:Beginner
Modules:4
Duration:1 Month
Category:Big Data Analytics
Language:English
Certificate:Yes

Course Overview

Course Description:Understand data fundamentals, key tools like Excel & SQL, and explore data collection, cleaning, and visualization techniques.

Topics Covered:

  • Fundamentals of big data and its characteristics
  • Introduction to big data technologies (Hadoop, Spark)
  • Basic data storage and processing techniques
  • Overview of data ingestion and ETL processes
  • Simple data analysis and visualization

Syllabus

Module 1: Introduction to Big Data & Analytics

  • What is Big Data? 5 Vs (Volume, Velocity, Variety, Veracity, Value)
  • Big Data Ecosystem & Career Scope
  • Traditional Databases vs Big Data Systems

LAB 1

  • Explore Kaggle Datasets
  • Perform Exploratory Data Analysis using Python (Pandas + Matplotlib)

Module 2: Data Collection & Ingestion

  • Data Sources (Logs, IoT, Social Media, Sensors)
  • Batch vs Streaming Data Ingestion
  • ETL Concepts (Extract, Transform, Load)

LAB 2

  • Use Python Requests + APIs to Collect Data
  • Ingest Streaming Data using Kafka Local Setup

Module 3: Hadoop Ecosystem Basics

  • HDFS (Hadoop Distributed File System)
  • MapReduce Programming Model
  • Hadoop Ecosystem (Hive, Pig, HBase, Sqoop)

LAB 3

  • Install Hadoop Single-Node Cluster (Local VM or Docker)
  • Run HDFS File Operations (Upload, List, Read, Delete)
  • Execute a Simple MapReduce Word Count Job

Module 4: Data Warehousing & Hive

  • Introduction to Hive & HQL (Hive Query Language)
  • Data Warehousing Concepts
  • Partitioning & Bucketing

LAB 4

  • Install Apache Hive
  • Run SQL Queries on Hive Tables (SELECT, GROUP BY, JOIN)

Learning Outcome

Understand core big data concepts, use basic big data technologies, manage data storage and processing, and perform introductory data analysis and visualization.



Related Courses

Big Data Analysis Level 1
Big Data Analytics Level 1

Focuses on data mining, Spark framework, and analytical techniques. Students process large-scale datasets to extract meaningful insights.

Big Data Analysis Level 2
Big Data Analytics Level 2

Covers advanced analytics, data pipelines, and machine learning integration. Learners design real-time big data solutions for business intelligence.

Data Science Beginner
Data Science Beginner

Introduces data collection, cleaning, and visualization techniques. Learners gain an overview of how data drives decisions in various industries.