r/dataengineering is a subreddit with 457k members. The most common kinds of discussions are advice requests and solution requests, and the community frequently discusses data engineering, looking for, struggling, struggling with, and job, and they frequently recommend/review etl tool, etl tools, and database solution.
News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.
Popular Themes in r/dataengineering
#1
Advice Requests
: "How did you guys learn CI/CD and IaC?"
12 posts
#2
Solution Requests
: "Suggest AWS ETL tools"
8 posts
#3
Pain & Anger
: "Data Engineering is boring!"
6 posts
#4
Opportunities
: "Would you risk vendor lock in for your career? Is it worth it to become take a Pentaho developer job for $130k?"
2 posts
#5
Self-Promotion
: "I open-sourced ducklake-sdk: a general SDK for interacting with DuckLake"
1 post
Popular Topics in r/dataengineering
#1
Data Engineering
: "Data Engineering is boring!"
120 posts
#2
Looking For
43 posts
#3
Struggling
: "Fresh Data Analyst Struggling with building a working data pipeline from ground up"
31 posts
#4
Struggling With
18 posts
#5
Job
: "Job market for Data engineer"
15 posts
#6
Career
: "Would you risk vendor lock in for your Career? Is it worth it to become take a Pentaho developer job for $130k?"
13 posts
#7
Etl
: "Suggest AWS Etl tools"
13 posts
#8
Tool
11 posts
#9
Airflow
: "Where to see enterprise grade Airflow data pipeline?"
10 posts
#10
Pipeline
: "Where to see enterprise grade Airflow data Pipeline?"
9 posts
Products Discussed in r/dataengineering
Etl Tool
282 reviews
#1
Fivetran
3.9★ from 20 reviews
#2
Airbyte
4.1★ from 16 reviews
#3
Talend
3.3★ from 15 reviews
Etl Tools
104 reviews
#1
Airbyte
4.4★ from 10 reviews
#2
Apache Airflow
4.7★ from 10 reviews
#3
dbt
4.8★ from 5 reviews
Database Solution
14 reviews
#1
Postgres
4.0★ from 2 reviews
#2
ClickHouse
4.5★ from 2 reviews
#3
Oracle
5.0★ from 1 review
Flair Used in r/dataengineering
#1
Discussion
: "Is anyone migrating away from Databricks?"
71 posts
#2
Help
: "How did you guys learn CI/CD and IaC?"
40 posts
#3
Career
: "How do I become a better data engineer?"
35 posts
#4
Blog
: "Quack: The DuckDB Client-Server Protocol"
20 posts
#5
Personal Project Showcase
: "Pyspark cheat sheet"
12 posts
#6
Rant
: "Maybe I am not cut out to be a DE"
8 posts
#7
Open Source
: "dbt-colibri v0.3.4 : local column-level lineage for your dbt projects."
7 posts
#8
Meme
: "Well played Dagster"
2 posts
#9
Meta
: "Meta post: Promotion and AI generated text clarifications"
1 post
Member Growth in r/dataengineering
Yearly
+120k members(35.5%)
Similar Subreddits to r/dataengineering
r/cybersecurity
1.5M members
19.6% / yr
r/data
52k members
15.8% / yr
r/dataanalysis
217k members
30.5% / yr
r/deeplearning
237k members
21.6% / yr
r/hubspot
21k members
64.5% / yr
r/Infographics
523k members
10.6% / yr
r/learnpython
1.0M members
10.7% / yr
r/MLQuestions
106k members
38.9% / yr
r/msp
241k members
18.0% / yr
r/Python
1.5M members
9.0% / yr
About
GummySearch helps people research Reddit communities by organizing activity, growth, themes, and post-level signals into one place.
This page gives a focused view of r/dataengineering, including current member size, discussion patterns, product reviews, and related communities to explore.
This data is synced periodically so insights stay current and useful for ongoing research.
Last updated: June 3, 2026