Skip to main content
Data Science

Data Science Essentials

Machine learning, predictive modeling and storytelling, with a capstone

150h total6 courses3 stages
Start this roadmap free

What you'll be able to do

  • Run an end-to-end data-science workflow
  • Apply statistics and hypothesis testing
  • Build and evaluate predictive models
  • Communicate results with compelling visualizations

Before you start

  • Python fundamentals
  • Basic statistics and algebra
  • Comfort working with data in spreadsheets or code

Phase 1 · Foundations

Python & Statistics for Data Science

beginner24h

NumPy/Pandas plus the inferential statistics (hypothesis tests, confidence intervals) that ML rests on.

  • Run a t-test and interpret it
  • Bootstrap a confidence interval
  • EDA with summary statistics

Data Wrangling & Feature Engineering

intermediate18h

Reshaping, joining, encoding, scaling, and building reproducible preprocessing pipelines.

  • ColumnTransformer pipeline
  • Target vs. one-hot encoding
  • Leakage-free train/test prep

Phase 2 · Machine Learning

Supervised Learning

intermediate28h

Regression, classification, trees, ensembles, and rigorous model evaluation.

  • Cross-validation + tuning
  • Gradient boosting (XGBoost) model
  • Top 20% on a Kaggle competition

Unsupervised Learning & Intro to Deep Learning

advanced22h

Clustering, dimensionality reduction, and a first neural network with Keras.

  • K-means + silhouette score
  • PCA for visualization
  • Train a small neural net

Phase 3 · Big Data, Storytelling & Capstone

Big Data Tools & Storytelling

advanced16h

Working at scale with Spark basics, plus communicating results that drive decisions.

  • PySpark DataFrame transforms
  • Narrative deck from an analysis
  • Tailor a result for executives

Capstone: Predictive Modeling Project

advanced24h

A complete project: problem framing, data, model, evaluation, and a written report.

  • Baseline + improved model
  • Error analysis writeup
  • Publish notebook + README

Frequently asked

Is the Data Science Essentials roadmap free?+

Yes. The entire Data Science Essentials roadmap and every curated resource is free to follow on Commit. You can track your progress, keep a daily streak, and earn a shareable certificate at no cost — there is no paywall.

How long does the Data Science Essentials roadmap take to complete?+

About 150 hours of focused study across 6 courses and 3 stages. At roughly one hour a day that is about 5 months; you can move faster by studying more each day.

Do I get a certificate for finishing the Data Science Essentials roadmap?+

Yes. When you complete the roadmap on Commit you receive a verifiable certificate of completion that you can add to LinkedIn and your public Commit profile as proof of what you finished.

Make it stick

Copy this roadmap into Commit and turn it into a tracked program with a streak graph, study logging, and a shareable certificate when you finish. Free forever.

Start Data Science Essentials free