r/dataengineering is a subreddit with 464k members. The most common kinds of discussions are advice requests and solution requests, and the community frequently discusses data engineering, looking for, tool, struggling, and databricks, and they frequently recommend/review etl tool, bi tool, and automation tools.
News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.
Popular Themes in r/dataengineering
#1
Advice Requests
: "Less than two years ago I wrote my first line of code. This month I will be interning as a Data Engineer at a big tech company"
15 posts
#2
Solution Requests
: "What kind of ETL pipeline would be helpful when the incoming file is an excel and the structure keeps changing and every piece of info is important and needs to be loaded into the Db?"
7 posts
#3
Pain & Anger
: "My boss is having us use AI way too much"
3 posts
#4
Ideas
: "Need help with ideas for Master’s Capstone Project"
2 posts
Popular Topics in r/dataengineering
#1
Data Engineering
: "Should I switch from Windows to Linux for Data Engineering? Which Distro is best"
63 posts
#2
Looking For
43 posts
#3
Tool
11 posts
#4
Struggling
8 posts
#5
Databricks
: "Databricks vs Snowflake vs Azure/GCP/AWS products"
7 posts
#6
Sql
: "QueryFlux: Multi-engine Sql query router in Rust—with routing, queuing, and Sqlglot dialect translation"
6 posts
#7
Dbt
: "SQLBuild - Skip Unnecessary Rebuilds for Your Existing Dbt Project, Free & OSS (No Per-Skip Bill)"
4 posts
#8
Looking For A Job
3 posts
#9
Frustrating Experiences
3 posts
#10
Pipeline
: "My first Pipeline - Thanks to you all"
3 posts
Products Discussed in r/dataengineering
Etl Tool
282 reviews
#1
Fivetran
3.9★ from 20 reviews
#2
Airbyte
4.1★ from 16 reviews
#3
Talend
3.3★ from 15 reviews
Bi Tool
12 reviews
#1
Tableau
4.5★ from 4 reviews
#2
Microsoft
4.0★ from 3 reviews
#3
Looker Studio
4.0★ from 1 review
Automation Tools
9 reviews
#1
Power Automate
1.5★ from 2 reviews
#2
Airbyte
4.0★ from 1 review
#3
Telescope AI
4.0★ from 1 review
Flair Used in r/dataengineering
#1
Discussion
: "I feel like I don't know anything. And I am nothing without Claude"
59 posts
#2
Help
: "Moving away from databricks to OLTP"
52 posts
#3
Career
: "Boss keeps throwing me under the bus for using python. Is python a no-go in this sector?"
32 posts
#4
Open Source
: "dbt Core v2 is here: still open source, now rebuilt for what's next"
19 posts
#5
Blog
: "101 concepts every data engineer should know (or some of them :)"
14 posts
#6
Personal Project Showcase
: "Trying to solve the Airflow schedule pain"
10 posts
#7
Meme
: "showed leadership our architecture diagram. forgot to take the last box out."
5 posts
#8
Rant
: "Vibe coded dashboard failing on a Friday"
4 posts
Member Growth in r/dataengineering
Yearly
+105k members(29.3%)
Similar Subreddits to r/dataengineering
r/Backend
63k members
189.6% / yr
r/chemistry
3.9M members
0.9% / yr
r/cybersecurity
1.5M members
19.9% / yr
r/dataanalysis
221k members
29.6% / yr
r/dataengineer
3k members
126.7% / yr
r/DataEngineeringPH
5k members
131.8% / yr
r/DMAcademy
702k members
5.8% / yr
r/FilmIndustryLA
84k members
15.6% / yr
r/musicians
199k members
67.6% / yr
r/naturalbodybuilding
452k members
8.1% / yr
About
GummySearch helps people research Reddit communities by organizing activity, growth, themes, and post-level signals into one place.
This page gives a focused view of r/dataengineering, including current member size, discussion patterns, product reviews, and related communities to explore.
This data is synced periodically so insights stay current and useful for ongoing research.
Last updated: July 1, 2026