/r/dataengineering/

r/dataengineering

459k members
r/dataengineering is a subreddit with 459k members. The most common kinds of discussions are advice requests and solution requests, and the community frequently discusses data engineering, looking for, struggling, struggling with, and career, and they frequently recommend/review etl tool, etl tools, and database solution.
News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.

Popular Themes in r/dataengineering

#1
Advice Requests
: "Which Udemy course is good for Python for Data Engineering?"
11 posts
#2
Solution Requests
: "Experience with Dataiku, Knime or Alteryx? Which one is better?"
8 posts
#3
Pain & Anger
: "dagster price increase 10x insane , don't ever use them"
7 posts
#4
Opportunities
: "Would you risk vendor lock in for your career? Is it worth it to become take a Pentaho developer job for $130k?"
2 posts

Popular Topics in r/dataengineering

#1

Data Engineering

: "Data Engineering is boring!"
98 posts
#2

Looking For

43 posts
#3

Struggling

: "Fresh Data Analyst Struggling with building a working data pipeline from ground up"
35 posts
#4

Struggling With

18 posts
#5

Career

: "Would you risk vendor lock in for your Career? Is it worth it to become take a Pentaho developer job for $130k?"
16 posts
#6

Job

13 posts
#7

Tool

11 posts
#8

Airflow

: "Dagster vs Airflow? What do we use?"
11 posts
#9

Sql

: "SqlMesh orchestration"
9 posts
#10

Ai

: "Getting Salesforce data ready for Ai analytics?"
8 posts

Products Discussed in r/dataengineering

Etl Tool

282 reviews
#1
Fivetran
3.9 from 20 reviews
#2
Airbyte
4.1 from 16 reviews
#3
Talend
3.3 from 15 reviews

Etl Tools

104 reviews
#1
Airbyte
4.4 from 10 reviews
#2
Apache Airflow
4.7 from 10 reviews
#3
dbt
4.8 from 5 reviews
#1
Postgres
4.0 from 2 reviews
#2
ClickHouse
4.5 from 2 reviews
#3
Oracle
5.0 from 1 review

Flair Used in r/dataengineering

#1
Discussion
: "Twin brothers wipe 96 gov’t databases minutes after being fired"
68 posts
#2
Help
: "How did you guys learn CI/CD and IaC?"
40 posts
#3
Career
: "Boss keeps throwing me under the bus for using python. Is python a no-go in this sector?"
33 posts
#4
Blog
: "101 concepts every data engineer should know (or some of them :)"
21 posts
#5
Personal Project Showcase
: "Pyspark cheat sheet"
12 posts
#6
Open Source
: "dbt Core v2 is here: still open source, now rebuilt for what's next"
9 posts
#7
Rant
: "Maybe I am not cut out to be a DE"
7 posts
#8
Meme
: "when someone asks you what programming language they should learn, don't simply answer the one you prefer"
4 posts
#9
Meta
: "Meta post: Promotion and AI generated text clarifications"
1 post

Member Growth in r/dataengineering

Yearly
+115k members(33.4%)

Similar Subreddits to r/dataengineering

/r/cybersecurity

r/cybersecurity

1.5M members
19.7% / yr
/r/data

r/data

52k members
16.5% / yr

r/dataanalysis

218k members
30.2% / yr

r/deeplearning

238k members
21.6% / yr
/r/hubspot

r/hubspot

21k members
64.6% / yr

r/Infographics

524k members
10.6% / yr
/r/learnpython

r/learnpython

1.0M members
10.7% / yr
/r/MLQuestions

r/MLQuestions

107k members
38.3% / yr
/r/msp

r/msp

242k members
17.9% / yr
/r/Python

r/Python

1.5M members
9.0% / yr

About

GummySearch helps people research Reddit communities by organizing activity, growth, themes, and post-level signals into one place.

This page gives a focused view of r/dataengineering, including current member size, discussion patterns, product reviews, and related communities to explore.

This data is synced periodically so insights stay current and useful for ongoing research.

Last updated: June 10, 2026