📚 Free Learning Resources

Data Analyst
Learning Hub

Master Python and SQL from scratch — structured roadmaps, curated resources, and real projects to land your first data analyst role.

START HERE

Your Learning Roadmap

1

Python Basics (Week 1–2)

Variables, loops, functions, lists, dictionaries — the foundation everything else builds on.

2

SQL Fundamentals (Week 3–4)

SELECT, WHERE, GROUP BY, JOINs — query real databases and understand relational data.

3

Pandas & Data Wrangling (Week 5–6)

Load CSVs, clean messy data, filter, merge, reshape — 80% of real analyst work.

4

Advanced SQL (Week 7–8)

Window functions, CTEs, subqueries, performance optimisation — what separates juniors from seniors.

5

Data Visualisation (Week 9–10)

Matplotlib, Seaborn, Plotly — turn numbers into charts that tell a story.

6

Capstone Projects + Portfolio (Week 11–12)

Build 2–3 end-to-end projects, push to GitHub, and start applying.

PYTHON FOR DATA

Python — What to Learn

PYTHON BASICS

🐍 Core Python

Everything you need before touching data science libraries.

  • Variables, data types, type casting
  • if/else, for loops, while loops
  • Functions, *args, **kwargs
  • Lists, tuples, sets, dictionaries
  • List comprehensions
  • File I/O — read/write CSV, JSON
  • Error handling — try/except
  • OOP basics — classes, __init__, methods
PANDAS

🐼 Pandas — The Analyst's Bible

You will use Pandas every single day as a data analyst.

  • pd.read_csv(), pd.read_excel()
  • df.head(), df.info(), df.describe()
  • Filtering: df[df['col'] > value]
  • groupby() + agg() — like SQL GROUP BY
  • merge() and join() — like SQL JOINs
  • Handling nulls: fillna(), dropna()
  • apply() with lambda functions
  • pivot_table() for summary reports
  • df.to_excel() — export clean reports
NUMPY

🔢 NumPy — Fast Math

The engine under Pandas. Learn the basics — don't over-invest here early.

  • np.array(), np.zeros(), np.ones()
  • Array slicing and indexing
  • np.mean(), np.median(), np.std()
  • Broadcasting rules
  • np.where() — conditional logic on arrays
VISUALISATION

📊 Data Visualisation

Numbers don't convince leadership — charts do. Learn to build both.

  • Matplotlib: line, bar, scatter, histogram
  • Seaborn: heatmaps, boxplots, pairplots
  • Plotly: interactive charts for dashboards
  • Subplots — multiple charts one figure
  • Styling: colors, labels, titles, grid
  • Saving: plt.savefig() for reports
SQL FOR DATA

SQL — From Zero to Senior

SQL BASICS

🗃️ Fundamentals

The queries every analyst must know cold.

  • SELECT, FROM, WHERE, ORDER BY, LIMIT
  • AND, OR, NOT, IN, BETWEEN, LIKE
  • GROUP BY + HAVING
  • COUNT(), SUM(), AVG(), MIN(), MAX()
  • DISTINCT — remove duplicates
  • CASE WHEN — conditional columns
  • NULL handling: IS NULL, COALESCE(), NULLIF()
  • Date functions: DATE_DIFF, DATE_TRUNC, EXTRACT
JOINS

🔗 JOINs — Master These

70% of interview SQL questions involve JOINs. Understand each type cold.

  • INNER JOIN — only matching rows
  • LEFT JOIN — all left, matched right
  • RIGHT JOIN — all right, matched left
  • FULL OUTER JOIN — everything
  • SELF JOIN — table joined to itself
  • CROSS JOIN — every combination
  • Multi-table JOINs (3+ tables)
ADVANCED SQL

🚀 Window Functions

What separates a junior analyst from a senior. Non-negotiable for top companies.

  • ROW_NUMBER() — unique rank per row
  • RANK() and DENSE_RANK()
  • LAG() and LEAD() — compare with prev/next row
  • SUM() OVER PARTITION BY — running totals
  • FIRST_VALUE(), LAST_VALUE()
  • NTILE() — percentile buckets
  • PARTITION BY vs ORDER BY in windows
ADVANCED SQL

📐 CTEs & Subqueries

Write clean, readable, multi-step queries that don't break under complexity.

  • WITH cte AS (...) — Common Table Expressions
  • Chained CTEs for multi-step logic
  • Subqueries in WHERE, SELECT, FROM
  • Correlated subqueries
  • EXISTS vs IN — when to use each
  • Recursive CTEs for hierarchical data
PRACTICE PROJECTS

Build These Projects

PYTHON PROJECT

📦 E-Commerce Sales Analysis

Load a sales CSV, clean nulls, calculate monthly revenue, find top products, visualise trends. Classic analyst task.

SQL PROJECT

🏪 Retail Chain Dashboard

Query a retail DB: find top-selling stores, month-over-month growth, customer cohort retention using window functions.

PYTHON PROJECT

📉 Churn Analysis

Identify which customers are about to leave using historical behaviour data. Real-world problem every company faces.

SQL PROJECT

🚗 Ride-Sharing Trip Analysis

Analyse trips, driver performance, surge pricing patterns. Uses CTEs, JOINs, window functions — all in one project.

PYTHON PROJECT

💰 Personal Finance Tracker

Parse bank statement PDFs, categorise spending with regex, build monthly budget report. Useful + impressive.

SQL PROJECT

📱 App User Funnel

Track users from signup → onboarding → purchase → retention. Classic product analytics funnel every startup needs.

FREE RESOURCES

Best Free Resources