Free · documentation-driven · interactive
Learn data engineering, slowly.
Long-form study paths that walk through the documentation, paired with interactive visualizations of what is actually happening inside the system. No videos, no bootcamps, no hype — just careful reading and the kind of widgets that make abstract internals feel concrete.
Available now
Mastering Apache Spark
Four weeks of long-form, source-grounded study material on how Apache Spark actually works. Cluster mode, RDDs, shuffles, partitioning, persistence, and shared variables — each section paired with interactive visualizations you can manipulate to see Spark internals in action.
4 weeks~5 hours18 widgetsQuiz-gated
Start the course →
Coming next
Microsoft Fabric · DataFrames in depth · Databricks performance
Three more study paths are in production. Subscribe with your email when you start the Spark course and we'll send you a note when they go live.
In progress