Etl Tools reviews from Reddit
Summary
We analyzed 178 Reddit reviews across 10 subreddits and 23 posts to rank the best Etl Tools brands recommended by redditors, including communities like r/dataengineering, r/ETL, r/tableau, r/snowflake, r/bigdata. Top-rated brands include Airbyte (4.5/5), Apache Airflow (4.7/5), Talend (4.0/5).
Stats
Reviews178
Subreddits10
Posts23
Brands83
Products21
178 reviews from
and
By Brand
/By Product
#1
Airbyte
4.5
(11)
"We currently use Airbyte + dbt and built custom connectors for ERP’s."
"I would recommend to look into Airbyte"
"Airbyte for E and L"
"We use airbyte for extraction + load"
"Airbyte"
"If you are working to a budget then I can wholly recommend airbyte. Open source out of the box."
"Did you try Airbyte? I heard it’s cheaper than Fivetran"
"We have been analyzing Hevo, Talend, Pentaho, Airbyte, etc. They suit very well for SMEs."
"Airbyte definitely felt like a very simple, clear design for moving data from various external sources to various other targets on a regular schedule."
"Airbyte connectors are written in Python, so you are very close and it shouldn't be a problem to write your own."
#2
Apache Airflow
4.7
(10)
"Go for open-source tools like Apache Airflow for data extraction"
"Consider Apache Airflow for orchestration"
"Airflow (orchestration)"
"For open-source ETL tools compatible with GCP, consider Airflow"
"Airflow is a good alternative to things like SSIS"
"Surprised to not have seen Airflow mentioned for scheduling yet"
"Airflow for orchestration"
"You can self host etl tools with docker like Mage or Airflow"
"I would have said airflow , which I was using all the time myself"
"Some of these modern tools like Airflow, Prefect and Dagster work way better cause, well, you just write Python."
#3
Talend
4.0
(11)
"Talend, all the way on this one"
"Talend Data Integration works great with MySQL."
"We have been analyzing Hevo, Talend, Pentaho, Airbyte, etc. They suit very well for SMEs."
"Talend is very similar to NiFi. The free version is pretty good, although it has the same maintenance problem as NiFi."
"For pure ETL I'd say Talend is the richest toolset though you are likely to end up with Talend commercial for production jobs."
"Have a look at Talend Open Studio."
"If you are on a budget, I would recommend Talend. It is free to use and get started with."
"Talend offers a good alternative for data integration."
"Talend Open Studio might be worth a look. Lots of connectors for in/outbound data. MySQL is covered IIRC, as are a lot of other DBs."
"Talend is also available as an Open source option, coming with a good connector library but you need good Java skills to customize from time to time."
#4
Alteryx
4.0
(10)
"Alteryx is worth it. It works great with R,Python and Tableau."
"I can build a workflow in Alteryx in a matter of minutes that it reusable -AND- other people can understand it and support it later."
"Alteryx is always woth because of its ridiculous productivity, and geo tools, and stats tools, and deploy to tableau server."
"Alteryx."
"Alteryx"
"Ive heard good things about Alteryx."
"Alteryx is like $50,000/year for licensing. It's a super productive, easy to use tool with tons of out of the box capabilities."
"Alteryx, is the logical modern successor, but I've had problems in their server environment"
"Alteryx is not really an ETL tool though in the classic Data Warehouse sense."
"I've seen alytex and knime recommended. Running either of those in a server is expensive."
#5
Pentaho
3.6
(8)
"Pentaho"
"We have been analyzing Hevo, Talend, Pentaho, Airbyte, etc. They suit very well for SMEs."
"Pentaho Data Integration (Kettle) is good, and FOSS. It's written in Java."
"Pentaho is a great tool to learn if you want to explore new options."
"Kettle itself is a fairly solid ETL tool."
"My team used Pentaho for years, but as you said at a certain point the complexity can get a bit difficult to deal with."
"Are you using Pentaho for data integrations (aka syncing data from 3rd party APIs)?"
"I'd also add Pentaho to the list."
#6
dbt
4.5
(6)
"DBT for modeling"
"Dbt for transformations"
"Dbt for transformation"
"DBT (Core) for transformation"
"Check out dbt and sqlmesh"
"For Transformation, you got DBT(code intensive) or Coalesce (specifically built for Snowflake)"
#7
Fivetran
3.8
(6)
"For extraction and loading , tools like Fivetran are considered best in the market"
"Fivetran is a good SaaS for data replication of cloud sources but it's not cheap"
"Fivetran is a very popular ETLaaS solution that works with Snowflake and supports AWS Gov Cloud"
"My team is using fivetran to easily house all our different connections"
"Technically ELT, fivetran worked well for my team"
"I’ve never heard of Pentaho, but there are SaaS tools like Fivetran that can help."
#8
Python
4.6
(5)
"Yes, Python is the go-to language in the DE world for interacting with REST APIs."
"Python my preferred method even better if they have an SDK for the API otherwise requests library is really good."
"Moved all the ETLs to Python PETL module."
"Personally, I'd probably do it using vanilla python instead of Pandas."
"Even if budget was no object I would still stick with python."
#9
KNIME
4.4
(5)
"Knime"
"I've been using free desktop KNIME in my organization and it is a great ETL and analytical tool."
"KNIME all the way."
"If open-source is appealing, be sure to check out KNIME"
"Knime is a good alternative to Alteryx and its more customizable and integrates with r python and Java."
#10
Apache Hop
4.5
(4)
"Apache Hop (ETL pipeline)"
"Currently migrating it to Apache Hop"
"Is a tool that our team is exploring since we typically perform one-off data migrations"
"Check out Apache Hop!"
#11
Apache
4.3
(4)
"Great for large-scale data processing and complex transf"
"I dont hear a lot of people talk about apache seatunnel"
"I have managed and deployed NiFi clusters for Enterprise. They are extremely useful for streaming and batch ETL."
"I like Apache Camel it has libraries for every type of connector in existence. It scales into the hundreds of millions of objects per hour."
#12
Microsoft
3.8
(4)
"Power automate might be the way to go."
"Have you considered Azure Data Factory?"
"Having come from a Microsoft background, I like it quite a bit, and like Talend, I believe it represents the classical ETL tool."
"SSIS Performance totally depends on the hardware you install it on."
#13
Informatica
3.3
(4)
"Informatica is great as well."
"We use Python, Informatica and IBM ACE for REST API integrations across my org."
"Informatica or other big old players still exist, but that's pricey and well-hated."
"You can install informatica for free and practice on it."
#14
SSIS
4.0
(3)
"I use SSIS / visual studio with the SQL server data tools plugin as a completely no code tool."
"I use SSIS and Dell Boomi for my ETL needs."
"Focus on your skills of understanding warehousing best practices and how to do things manually with SQL."
#15
Apache Kafka
4.0
(3)
"If you are looking for an open-source solution with good performance, try Kafka and Flink."
"Apache Kafka/Confluent is the big name that comes to mind for streaming."
"Kafka with Logstash"
#16
Hevo
4.0
(3)
"Has a free trial you can always check them out!"
"Have you tried Hevo Data (www.hevodata.com). They seem to have pretty good G2 Ratings."
"We have been analyzing Hevo, Talend, Pentaho, Airbyte, etc. They suit very well for SMEs."
#17
Apache NiFi
4.0
(3)
"For open-source ETL tools compatible with GCP, consider Apache NiFi"
"Apache NiFi could be a good option for you."
"Nifi, Kafka and Flink"
#18
Estuary Flow
4.0
(3)
"[Estuary Flow] has a free tier if you're looking for unified (batch + real-time) ETL platforms 🙂"
"If you're looking for a real-time ETL tool, Estuary Flow (estuary. dev) is a strong contender."
"Estuary.dev Predictable pricing and easy to setup"
#19
Talend Open Studio
5.0
(2)
"For open-source ETL tools compatible with GCP, consider Talend Open Studio"
"Talend Open Studio is what my team uses, it has a GUI and I like it"
#20
Pentaho Data Integration
5.0
(2)
"My engineer built a very complex and fully automated ETL solution with PDI"
"Pentaho Data Integration."
#21
Keboola
5.0
(2)
"I'd recommend checking out Keboola. There's over 200+ connectors available out of the box."
"Try Keboola.com, we have both extractor and writer and generous free tier."
#22
DuckDB
5.0
(2)
"DuckDB for processing"
"Use DuckDB or Postgres for your data warehouse"
#23
Meltano
4.5
(2)
"I'd highly recommend [https://sdk.meltano.com/en/latest/](https://sdk.meltano.com/en/latest/) (open source, MIT, free)."
"Meltano for ingestion"
#24
PostgreSQL
4.0
(2)
"PostgreSQL/MySQL for storage"
"PostgreSQL (database)"
#25
Quix.io
4.0
(2)
"If you're thinking of using Apache Kafka with Python, check out Quix Streams."
"I've been using Quix.io lately. It's billed at a Stream processing platform but can totally be used for micro batch / data engineering workloads too."
#26
Prefect
4.0
(2)
"In a small company I setup prefect with python for Pipelines info a basic on prem SQL server"
"I use prefect to decorate that and it orchestrates jobs and handles failures"
#27
Mage
4.0
(2)
"You can self host etl tools with docker like Mage or Airflow"
"If you want an open-source solution, check out Mage."
#28
Precog
3.5
(2)
"Check out https://precog.com their api experience is quit smooth."
"Some of the smaller players like Precog or Portable could probably be purchased without compete as small businesses."
#29
Stitch Fix
5.0
(1)
"We've found a lot of value using this at Stitch Fix and are looking to expand to new users."
#30
Safe Software
5.0
(1)
"FME by Safe Software meets the criteria of working with cloud and on-premise databases and SaaS."
#31
StackWizard
5.0
(1)
"Super easy and it takes about 30 seconds."
#32
Estuary
5.0
(1)
"Thers a useful free tier, too! And once you capture the data, we can do streaming materializations of the data into any number of destinations."
#33
Sprinkle Data
5.0
(1)
"I have been using Sprinkle Data for more than 2 years now and am able to pull in data from 10s of databases and perform ETL on top of the data."
#34
Dagster
5.0
(1)
"Dagster for orchestration"
#35
Data Streams
4.0
(1)
"A completely free alternative would be Data Streams! It’s simple to use and it’s free so you have nothing to lose."
#36
Maestro
4.0
(1)
"I've been using the beta version of Maestro and that is really cool and fast."
#37
Epitech Integrator
4.0
(1)
"If you are looking for an easy to use data integration tool with a short learning curve, I suggest you go to Epitech Integrator."
#38
Stitch
4.0
(1)
"Stitch is one possibility. But there are SOOO many others."
#39
Zapier
4.0
(1)
"I would suggest adding point-to-point tools like Zapier or Tray."
#40
Census
4.0
(1)
"Reverse ETL like Census to that list."
#41
DBAmp
4.0
(1)
"If you use MSSQL and Salesforce I like DBAmp if you like working with native SQL."
#42
Kafka
4.0
(1)
"You can also use Kafka-delta-inject (which is written in Rust) to do this."
#43
Azure Data Factory
4.0
(1)
"If you already use ADF, but focus it on EL only."
#44
Microsoft Power Automate
4.0
(1)
"Power automate or using the ms graph API."
#45
Microsoft Graph API
4.0
(1)
"The former is the more intuitive GUI option, another has given good advice re service account etc."
#46
AskOnData
4.0
(1)
"AskOnData - a chat based GenAI powered data engineering tool."
#47
Google Cloud Functions
4.0
(1)
"We have been using an ELT architecture wherein API pulls are done to Cloud Storage by running python code in Google Cloud Functions."
#48
EasyMorph
4.0
(1)
"For visual ETL, EasyMorph."
#49
Confluent Kafka
4.0
(1)
"You can look into hosting your own confluent Kafka with Kafka connect for streaming."
#50
pdpipe
4.0
(1)
"Try pdpipe – Easy pipelines for pandas dataframes"
#51
ETLWorks
4.0
(1)
"Check etlworks.com."
#52
FME
4.0
(1)
"I recommend FME ETL tool for querying APIs."
#53
Azure
4.0
(1)
"Just saw this post and wanted to suggest that you try out Azure."
#54
MySQL
4.0
(1)
"Using SSIS with MySQL is straightforward and effective."
#55
Precisely
4.0
(1)
"Data360 Analyze has connectors to many applications and offers a free download."
#56
CloverDX
4.0
(1)
"If you're considering Data Integration and ETL tools then I'd add CloverDX to the list - formerly CloverETL."
#57
Dataiku
4.0
(1)
"Have a look at https://www.dataiku.com/ , it’s pretty cool if you need to do proper data engineering with workflows i.e. a lot of T in ETL."
#58
etlworks
4.0
(1)
"Try Integrator by etlworks. https://etlworks.com It is easy to use cloud data integration service, flexible, works with diverse data sources."
#59
Matillion
4.0
(1)
"You also have got enterprise tools like Matillion and Informatica IDMC, which give you an end to end ETL functionality"
#60
Databricks
4.0
(1)
"I would checkout delta live tables by databricks"
#61
AWS
4.0
(1)
"Have you looked at AWS AppFlow which competes with Fivetran and the like but is an AWS service?"
#62
Preswald
4.0
(1)
"Preswald is a solid choice for cleaning, enriching, and visualizing your data without breaking the bank"
#63
TabsData
4.0
(1)
"You can use tabsdata to model, transform and export data of interest within the system"
#64
Power BI
4.0
(1)
"Power BI / Excel (visualization)"
#65
Metabase
4.0
(1)
"Metabase"
#66
Apache Superset
4.0
(1)
"Apache Superset"
#67
Skyvia
4.0
(1)
"Skyvia provides easy integration with GCP, suitable for those new to ETL processes"
#68
Apache SeaTunnel
4.0
(1)
"Apache SeaTunnel"
#69
QuickTable
4.0
(1)
"We are building QuickTable which is low code. It can generate SQL and udf functions for different data warehouses automatically."
#70
TimeXtender
4.0
(1)
"For a GUI application I found timextender to be quite nice."
#71
Ascend.io
4.0
(1)
"Ascend.io is awesome but cloud only."
#72
Palantir
4.0
(1)
"Finally, Palantir is the gold stand"
#73
Datacoves
4.0
(1)
"If you want to use dbt and don't want to figure out all the platform bits, check out Datacoves"
#74
Omnata Sync Engine
4.0
(1)
"Omnata Sync Engine is a Snowflake native app, so you get your own private instance 100% on Snowflake"
#75
DatErica
4.0
(1)
"I recommend looking into DatErica.com for a budget-friendly ETL tool that handles real-time data processing effectively."
#76
Pathway
4.0
(1)
"Pathway is a Python ETL framework for real-time data processing."
#77
FolioProjects
4.0
(1)
"[FolioProjects](https://folioprojects.com) for project management ETLs"
#78
Bento
4.0
(1)
"You can use Bento, which is 100% open source, so it's free."
#79
IICS
3.0
(1)
"IICS, GCP Datafusion , azure data Factory"
#80
Coalesce
3.0
(1)
"For Transformation, you got DBT(code intensive) or Coalesce (specifically built for Snowflake)"
#81
Portable
3.0
(1)
"Some of the smaller players like Precog or Portable could probably be purchased without compete as small businesses."
#82
OpenRefine
3.0
(1)
"Depending on the complexity of your ETL, Open Refine might be an option."
#83
IBM DataStage
1.0
(1)
"You aren’t wrong. DataStage is terrible with it."
Discover your audience
GummySearch is an audience research toolkit for 130,000 unique communities on Reddit.
If you are looking for startup problems to solve, want to validate your idea or find your customers online, GummySearch is for you.
Sign up for free, get community insights in minutes.
Tell me more
Get started
