Data Engineering
Airbnb Data
Data quality culture
Focuses on Data Quality and building a data-driven culture, creators of Airflow.
Analytics Engineering Roundup
Modern Data Stack center
Defines the Analytics Engineer role, applying software engineering rigor to SQL; the cultural center of the Modern Data Stack.
Andreas Kretz
Data plumbing focus
Focuses on the plumbing—Kafka, Hadoop, Spark—and moving data reliably.
Benn Stancil
Data industry strategy
Strategic thinking on data analysis and the data industry.
Blef.fr Data News
Independent data analysis
Independent data news and analysis.
Chartr
Visual data storytelling
Visual data storytelling and charts.
Confessions of a Data Guy
Vendor hype critique
Brutally honest critique of vendor hype, advocating for boring, reliable technology.
Confluent Blog
Kafka & data in motion
The home of Kafka, explaining the shift from Data at Rest to Data in Motion.
Darshil Parmar
E2E portfolio projects
Creates massive, end-to-end portfolio projects building complete pipelines.
Data Elixir
Data science news & trends
Data science news, trends, and resources.
Data Engineering Podcast
Tool creator interviews
Deep dives with tool creators exploring the why behind design decisions.
Data Engineering Weekly
Data systems heavy lifting
Covers the heavy lifting of data systems: moving terabytes, managing Kafka streams, and optimizing Snowflake costs.
Data School
Pandas authority
Authority on Python's data stack, particularly Pandas for transformations.
Databricks Blog
Lakehouse paradigm
The Lakehouse Leader detailing how to unify streaming/batch data and govern data for AI.
Databricks YT
Lakehouse paradigm
Essential for understanding the Lakehouse paradigm.
EcZachly (DataExpert.io)
Technical DE deep dives
Zach Wilson's technical deep dives into data engineering.
Geralt of Rivia (DE)
Airflow & Spark deep
Community favorite for deep technical tutorials on Airflow and Spark.
Joe Reis Substack
DE fundamentals insights
Rants and insights on data engineering fundamentals.
Karolina Sowinska
Airflow operations
Focuses on the operational side, with tutorials on Airflow and containers.
Ken Jee
Data project ideation
Focuses on career and project ideation, often involving significant data engineering.
Locally Optimistic
Data team organization
Focuses on the organization of data teams, Analytics Engineering roles, and career ladders.
Metadata Weekly
Data catalogs & governance
Data catalogs, governance, and metadata management.
Modern Data Stack
Data observability focus
Focuses on observability and reliability in data stacks.
Pedram's Data Based
Data career advice
Data career advice and tech insights.
Seattle Data Guy YT
MDS business value
Evaluates the Modern Data Stack based on business value and ROI.
SeattleDataGuy's Newsletter
Data tool selection focus
Focuses on value delivery and tool selection (e.g., Airflow vs. Prefect) for the Data Consultant.
Shashank Mishra
DE entry roadmap
Offers a structured roadmap for entering the field and necessary skills.
Snowflake Blog
Data sharing & governance
Focuses on data sharing, governance, and running Python/AI directly inside the warehouse.
Snowflake YT
Platform deep dives
Provides deep dives into their platform capabilities and storage layer.
The Seattle Data Guy
MDS critique & ROI
Consultant view critiquing the Modern Data Stack and focusing on Data ROI.
Zach Wilson
Petabyte-scale reality
Discusses the reality of petabyte-scale data and actual engineering needs.