/r/dataengineering/

r/dataengineering

457k members
r/dataengineering is a subreddit with 457k members. The most common kinds of discussions are advice requests and solution requests, and the community frequently discusses data engineering, looking for, struggling, struggling with, and job, and they frequently recommend/review etl tool, etl tools, and database solution.
News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.

Popular Themes in r/dataengineering

#1
Advice Requests
: "How did you guys learn CI/CD and IaC?"
12 posts
#2
Solution Requests
: "Suggest AWS ETL tools"
8 posts
#3
Pain & Anger
: "Data Engineering is boring!"
6 posts
#4
Opportunities
: "Would you risk vendor lock in for your career? Is it worth it to become take a Pentaho developer job for $130k?"
2 posts
#5
Self-Promotion
: "I open-sourced ducklake-sdk: a general SDK for interacting with DuckLake"
1 post

Popular Topics in r/dataengineering

#1

Data Engineering

: "Data Engineering is boring!"
120 posts
#2

Looking For

43 posts
#3

Struggling

: "Fresh Data Analyst Struggling with building a working data pipeline from ground up"
31 posts
#4

Struggling With

18 posts
#5

Job

: "Job market for Data engineer"
15 posts
#6

Career

: "Would you risk vendor lock in for your Career? Is it worth it to become take a Pentaho developer job for $130k?"
13 posts
#7

Etl

: "Suggest AWS Etl tools"
13 posts
#8

Tool

11 posts
#9

Airflow

: "Where to see enterprise grade Airflow data pipeline?"
10 posts
#10

Pipeline

: "Where to see enterprise grade Airflow data Pipeline?"
9 posts

Products Discussed in r/dataengineering

Etl Tool

282 reviews
#1
Fivetran
3.9 from 20 reviews
#2
Airbyte
4.1 from 16 reviews
#3
Talend
3.3 from 15 reviews

Etl Tools

104 reviews
#1
Airbyte
4.4 from 10 reviews
#2
Apache Airflow
4.7 from 10 reviews
#3
dbt
4.8 from 5 reviews
#1
Postgres
4.0 from 2 reviews
#2
ClickHouse
4.5 from 2 reviews
#3
Oracle
5.0 from 1 review

Flair Used in r/dataengineering

#1
Discussion
: "Is anyone migrating away from Databricks?"
71 posts
#2
Help
: "How did you guys learn CI/CD and IaC?"
40 posts
#3
Career
: "How do I become a better data engineer?"
35 posts
#4
Blog
: "Quack: The DuckDB Client-Server Protocol"
20 posts
#5
Personal Project Showcase
: "Pyspark cheat sheet"
12 posts
#6
Rant
: "Maybe I am not cut out to be a DE"
8 posts
#7
Open Source
: "dbt-colibri v0.3.4 : local column-level lineage for your dbt projects."
7 posts
#8
Meme
: "Well played Dagster"
2 posts
#9
Meta
: "Meta post: Promotion and AI generated text clarifications"
1 post

Member Growth in r/dataengineering

Yearly
+120k members(35.5%)

Similar Subreddits to r/dataengineering

/r/cybersecurity

r/cybersecurity

1.5M members
19.6% / yr
/r/data

r/data

52k members
15.8% / yr

r/dataanalysis

217k members
30.5% / yr

r/deeplearning

237k members
21.6% / yr
/r/hubspot

r/hubspot

21k members
64.5% / yr

r/Infographics

523k members
10.6% / yr
/r/learnpython

r/learnpython

1.0M members
10.7% / yr
/r/MLQuestions

r/MLQuestions

106k members
38.9% / yr
/r/msp

r/msp

241k members
18.0% / yr
/r/Python

r/Python

1.5M members
9.0% / yr

About

GummySearch helps people research Reddit communities by organizing activity, growth, themes, and post-level signals into one place.

This page gives a focused view of r/dataengineering, including current member size, discussion patterns, product reviews, and related communities to explore.

This data is synced periodically so insights stay current and useful for ongoing research.

Last updated: June 3, 2026