r/dataengineering is a subreddit with 459k members. The most common kinds of discussions are advice requests and solution requests, and the community frequently discusses data engineering, looking for, struggling, struggling with, and career, and they frequently recommend/review etl tool, etl tools, and database solution.
News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.
Popular Themes in r/dataengineering
#1
Advice Requests
: "Which Udemy course is good for Python for Data Engineering?"
11 posts
#2
Solution Requests
: "Experience with Dataiku, Knime or Alteryx? Which one is better?"
8 posts
#3
Pain & Anger
: "dagster price increase 10x insane , don't ever use them"
7 posts
#4
Opportunities
: "Would you risk vendor lock in for your career? Is it worth it to become take a Pentaho developer job for $130k?"
2 posts
Popular Topics in r/dataengineering
#1
Data Engineering
: "Data Engineering is boring!"
98 posts
#2
Looking For
43 posts
#3
Struggling
: "Fresh Data Analyst Struggling with building a working data pipeline from ground up"
35 posts
#4
Struggling With
18 posts
#5
Career
: "Would you risk vendor lock in for your Career? Is it worth it to become take a Pentaho developer job for $130k?"
16 posts
#6
Job
13 posts
#7
Tool
11 posts
#8
Airflow
: "Dagster vs Airflow? What do we use?"
11 posts
#9
Sql
: "SqlMesh orchestration"
9 posts
#10
Ai
: "Getting Salesforce data ready for Ai analytics?"
8 posts
Products Discussed in r/dataengineering
Etl Tool
282 reviews
#1
Fivetran
3.9★ from 20 reviews
#2
Airbyte
4.1★ from 16 reviews
#3
Talend
3.3★ from 15 reviews
Etl Tools
104 reviews
#1
Airbyte
4.4★ from 10 reviews
#2
Apache Airflow
4.7★ from 10 reviews
#3
dbt
4.8★ from 5 reviews
Database Solution
14 reviews
#1
Postgres
4.0★ from 2 reviews
#2
ClickHouse
4.5★ from 2 reviews
#3
Oracle
5.0★ from 1 review
Flair Used in r/dataengineering
#1
Discussion
: "Twin brothers wipe 96 gov’t databases minutes after being fired"
68 posts
#2
Help
: "How did you guys learn CI/CD and IaC?"
40 posts
#3
Career
: "Boss keeps throwing me under the bus for using python. Is python a no-go in this sector?"
33 posts
#4
Blog
: "101 concepts every data engineer should know (or some of them :)"
21 posts
#5
Personal Project Showcase
: "Pyspark cheat sheet"
12 posts
#6
Open Source
: "dbt Core v2 is here: still open source, now rebuilt for what's next"
9 posts
#7
Rant
: "Maybe I am not cut out to be a DE"
7 posts
#8
Meme
: "when someone asks you what programming language they should learn, don't simply answer the one you prefer"
4 posts
#9
Meta
: "Meta post: Promotion and AI generated text clarifications"
1 post
Member Growth in r/dataengineering
Yearly
+115k members(33.4%)
Similar Subreddits to r/dataengineering
r/cybersecurity
1.5M members
19.7% / yr
r/data
52k members
16.5% / yr
r/dataanalysis
218k members
30.2% / yr
r/deeplearning
238k members
21.6% / yr
r/hubspot
21k members
64.6% / yr
r/Infographics
524k members
10.6% / yr
r/learnpython
1.0M members
10.7% / yr
r/MLQuestions
107k members
38.3% / yr
r/msp
242k members
17.9% / yr
r/Python
1.5M members
9.0% / yr
About
GummySearch helps people research Reddit communities by organizing activity, growth, themes, and post-level signals into one place.
This page gives a focused view of r/dataengineering, including current member size, discussion patterns, product reviews, and related communities to explore.
This data is synced periodically so insights stay current and useful for ongoing research.
Last updated: June 10, 2026