Data Engineering

Airbnb Data

Data quality culture

Focuses on Data Quality and building a data-driven culture, creators of Airflow.

Analytics Engineering Roundup

Modern Data Stack center

Defines the Analytics Engineer role, applying software engineering rigor to SQL; the cultural center of the Modern Data Stack.

Andreas Kretz

Data plumbing focus

Focuses on the plumbing—Kafka, Hadoop, Spark—and moving data reliably.

Benn Stancil

Data industry strategy

Strategic thinking on data analysis and the data industry.

Blef.fr Data News

Independent data analysis

Independent data news and analysis.

Chartr

Visual data storytelling

Visual data storytelling and charts.

Confessions of a Data Guy

Vendor hype critique

Brutally honest critique of vendor hype, advocating for boring, reliable technology.

Confluent Blog

Kafka & data in motion

The home of Kafka, explaining the shift from Data at Rest to Data in Motion.

Darshil Parmar

E2E portfolio projects

Creates massive, end-to-end portfolio projects building complete pipelines.

Data Elixir

Data science news & trends

Data science news, trends, and resources.

Data Engineering Podcast

Tool creator interviews

Deep dives with tool creators exploring the why behind design decisions.

Data Engineering Weekly

Data systems heavy lifting

Covers the heavy lifting of data systems: moving terabytes, managing Kafka streams, and optimizing Snowflake costs.

Data School

Pandas authority

Authority on Python's data stack, particularly Pandas for transformations.

Databricks Blog

Lakehouse paradigm

The Lakehouse Leader detailing how to unify streaming/batch data and govern data for AI.

Databricks YT

Lakehouse paradigm

Essential for understanding the Lakehouse paradigm.

EcZachly (DataExpert.io)

Technical DE deep dives

Zach Wilson's technical deep dives into data engineering.

Geralt of Rivia (DE)

Airflow & Spark deep

Community favorite for deep technical tutorials on Airflow and Spark.

Joe Reis Substack

DE fundamentals insights

Rants and insights on data engineering fundamentals.

Karolina Sowinska

Airflow operations

Focuses on the operational side, with tutorials on Airflow and containers.

Ken Jee

Data project ideation

Focuses on career and project ideation, often involving significant data engineering.

Locally Optimistic

Data team organization

Focuses on the organization of data teams, Analytics Engineering roles, and career ladders.

Metadata Weekly

Data catalogs & governance

Data catalogs, governance, and metadata management.

Modern Data Stack

Data observability focus

Focuses on observability and reliability in data stacks.

Pedram's Data Based

Data career advice

Data career advice and tech insights.

Seattle Data Guy YT

MDS business value

Evaluates the Modern Data Stack based on business value and ROI.

SeattleDataGuy's Newsletter

Data tool selection focus

Focuses on value delivery and tool selection (e.g., Airflow vs. Prefect) for the Data Consultant.

Shashank Mishra

DE entry roadmap

Offers a structured roadmap for entering the field and necessary skills.

Snowflake Blog

Data sharing & governance

Focuses on data sharing, governance, and running Python/AI directly inside the warehouse.

Snowflake YT

Platform deep dives

Provides deep dives into their platform capabilities and storage layer.

The Seattle Data Guy

MDS critique & ROI

Consultant view critiquing the Modern Data Stack and focusing on Data ROI.

Zach Wilson

Petabyte-scale reality

Discusses the reality of petabyte-scale data and actual engineering needs.