Best etl tool on Reddit

227 reviews from r/dataengineering, r/ETL, r/salesforce and 4 more subreddits

227 reviews from
and
By Brand
/
By Product
#1

Talend

3.9
(20)
"Talend also offers a free version."
·
"I loved using Talend Open Studio and orchestrating with Rundeck back in the day"
·
"Pretty terrific"
·
"Best bets"
·
"Top free tool right now"
·
"Talend is easy to use, have lot of integrations inbuilt."
·
"If you are a small team I can suggest Talend, you can quickly build up ETL with minimum infrastructure setup."
·
"Talend offers an intuitive interface and strong data transformation features."
·
"Cost effective"
·
"Talend, Mulesoft, Informatica"
·
#2

dbt

4.6
(12)
"Dbt simplifies the data transformation process."
·
"DBT is just a build/modeling tool run on straight sql"
·
"We actually like our work because of them."
·
"Dbt is a good option."
·
"Run snapshots on raw or staged tables"
·
"A lot of buzz about dbt lately, it’s focused on SQL, look into it."
·
"Dbt is a great tool for modeling data, all SQL driven."
·
"Dbt has been around longer and is a solid choice for data transformation."
·
"Works well with Airbyte."
·
"Dagster and dbt."
·
#3

Fivetran

4.0
(13)
"Fivetran + dbt will solve it for most use cases."
·
"Fivetran is very popular - they are an industry leader and have a ton of rebuilt connectors. Fivetran just works..."
·
"Handles data loading from various services"
·
"Fivetran/ Qliks are not ETL per say, but great ELT enablers."
·
"It's good"
·
"Fivetran is the easiest to use with the most available sources and targets."
·
"If you want a GUI centric tool then Fivetran is good."
·
"Great"
·
"Segment, Stitch, and Fivetran are all different but affordable"
·
"Fivetran offers a cloud-based ETL solution that is easy to use."
·
#4

Matillion

4.0
(11)
"Great program"
·
"I would suggest giving Matillion Productivity Cloud a go (Cloud SaaS). I'm yet to come across an equivalent ELT Cloud hosted tool that covers both integration and transformation."
·
"Fantastic"
·
"GUI based tooling"
·
"If the datasets are average and transformation are not complex, Matillion works and goes above a simple EL solution."
·
"Matillion is a top choice for its user-friendly design."
·
"Matillion Cloud is a scalable solution for ETL."
·
"GUI-based and powerful"
·
"Good performance and reasonable cost"
·
"What is wrong with Matillion?"
·
#5

Pentaho

4.3
(8)
"Best bets"
·
"Free edition is available and very full featured"
·
"I'm quite fond of Pentaho personally - free edition is available and very full featured."
·
"Big fan of Pentaho. Open source and free goes a long way, if you don't mind the occasional bug."
·
"Pentaho or ApacheHop"
·
"Pentaho is also a strong contender in the free ETL tools space."
·
"More user-friendly and easier to use"
·
"It is free and has lots of connectors."
#6

Airflow

4.4
(7)
"Using airflow for orchestration"
·
"Airflow and Python"
·
"Python-based"
·
"Consider running your scripts as Airflow tasks."
·
"Good for orchestration."
·
"Airflow for ingress with DBT for transformations."
·
"If, however, you're going to deal with multiple independent data sources, you might want to look into more complex solutions like Airflow or Nifi."
#7

Airbyte

4.4
(7)
"Just started exploring airbyte."
·
"The pricing is fair and per use"
·
"Both an open-source version and a cloud one"
·
"**Open-source**: sling-cli, airbyte, peerdb"
·
"Airbyte is a powerful tool for data ingestion."
·
"The pricing is fair and per use."
·
"Airbyte + AwsLambda"
#8

Informatica

3.3
(9)
"Depends on your budget, timeframe and your skillset."
·
"Informatica provides robust features for data transformation."
·
"Cost effective"
·
"Informatica is a good option for file-based integrations."
·
"Talend, Mulesoft, Informatica"
·
"Cost effective"
·
"Informatica have modernized for cloud, but, it's for large enterprise."
·
"Pretty antiquated"
·
"Informatica should be avoided unless your boss forces you to use it."
#9

Apache

4.8
(6)
"Open source. Visual low code / no code. Metadata driven. It is a fork of Pentaho Kettle. It is easy to learn but with a lot of features."
·
"Pyspark is an excellent tool for big data processing."
·
"Apache Beam makes this pretty easy with the documentation."
·
"Lots of good choices mentioned here already, but one more is Apache Hop."
·
"Apache Airflow is scalable, extensible, and dynamic with configuration-as-code in Python."
·
"You'll still want to get to know Apache Spark in general."
#10

PySpark

4.0
(6)
"PySpark has become a pretty standard part of many DE stacks, and it's pretty battle-tested"
·
"Widely applicable and a more generalized skillset in addition to allowing ETL"
·
"For pure ELT big things PySpark."
·
"Learning PySpark is valuable, but most ETL work can be automated."
·
"Pyspark is fine if you have large datasets to process using spark/databrick clusters."
·
"Haven't used pyspark but we don't have big data requirements (yet)."
#11

Hevo

4.8
(5)
"Really like it"
·
"Simple ETL that I find easier to use than Stitcher or Fivetran."
·
"We use Hevo for E&L for easy built-in integrations and webhooks."
·
"Easy built-in integrations and webhooks"
·
"If you want a no-code kind of thing, go for Hevo."
#12

Meltano

4.4
(5)
"Meltano + dbt is a great option."
·
"Use airbyte or meltano, or the best tool ever python, but if you wanna give a good platform, use meltano + python and dbt with airflow or dagster as orchestrator."
·
"Using meltano for staging/replication"
·
"Can probably make things work"
·
"Meltano is a handy ETL tool, I’ve used it for E&L before following up with dbt for T&L within the warehouse."
#13

Snowflake

4.2
(5)
"One of the best etl tools"
·
"Snowflake is a lot easier to use compared to other solutions."
·
"Snowflake is among the top technologies to consider."
·
"Snowflake also has some integrations that work directly with Azure Blob storage."
·
"Snowflake is one of the best ETL tools."
#14

MuleSoft

4.2
(5)
"Integrations specifically built for Salesforce applications and other major apps like SAP"
·
"No compromises"
·
"Depends on your budget, timeframe and your skillset."
·
"Talend, Mulesoft, Informatica"
·
"Mulesoft is overkill and extremely expensive for this integration."
#15

Integrate.io

4.7
(3)
"Integrate.io has done the job for my team for 3 years. No complaints."
·
"We did a pilot with Integrate.io three months ago and found they make data pipelines work faster than anyone else's."
·
"Gave Integrate.io a try and was straightforward overall, which is a win when I have a billion-and-one-things on our plate."
#16

Alteryx

4.7
(3)
"Huge wealth of information online in their forums"
·
"Alteryx is excellent for data engineering and analytics, allowing for seamless ETL processes."
·
"We use a tool called Alteryx for this I think."
#17

Etlworks

4.7
(3)
"Look at Etlworks. Linux, Windows, Docker, cloud, self-hosted. Hundreds of connectors."
·
"It does what you need."
·
"Not very well known but allows us to do almost everything ETL/ELT related."
#18

AWS

3.0
(4)
"You could use etl blocks in a high level or You can use a language program like python."
·
"More like a set of in-cloud tools, it has a database, but has not yet built supply chain (EDI and ERP integration or ETL) tools"
·
"AWS solutions like Redshift require a lot of maintenance."
·
"AWS has not yet built supply chain tools, which is a gap."
#19

Microsoft

5.0
(2)
"If you're already using Azure I'd recommend checking Azure Data Factory."
·
"Cheap, effective and battle tested."
#20

Databricks

4.5
(2)
"Databricks is the best for all kind of ETL operations."
·
"Databricks is a popular technology in data engineering."
#21

Stitch

4.5
(2)
"They now support more than 65 integrations, the most of any ETL vendor we’re aware of."
·
"Segment, Stitch, and Fivetran are all different but affordable"
#22

Dagster

4.5
(2)
"Great too"
·
"Dagster and dbt."
#23

Prefect

4.5
(2)
"Highly recommend it"
·
"A solid alternative for orchestration."
#24

Ab Initio

4.5
(2)
"Can do such things if it makes sense price wise. Will run on your hardware and it's quite easy to parallelize graphs. Also supports metaprogramming so you might find a smart way to handle different file formats."
·
"Abinitio has very good parallel processing when your source data is huge."
#25

Python

4.5
(2)
"A great library for boilerplate loading!"
·
"For pure ELT small things Python."
#26

KNIME

4.5
(2)
"I like KNIME"
·
"I'm using KNIME Analytics for dinner kinds of things."
#27

Safe Software

4.5
(2)
"Specialize in Geospatial ETL but can read/write/transform just about anything"
·
"Pretty robust and give you the ability to create custom transformers"
#28

n8n

4.0
(2)
"N8n"
·
"A bit more visual, but still quite simple would be n8n."
#29

Jitterbit

4.0
(2)
"Jitterbit has a free version available."
·
"Cost effective"
#30

Estuary

4.0
(2)
"Estuary's CDC integration tool is cost-effective and easy to set up."
·
"Estuary is an up-and-coming option for affordable data extraction and loading."
#31

Mulesoft

3.0
(2)
"Mulesoft has integrations specifically built for Salesforce applications."
·
"Mulesoft is the 'no compromises' Middleware, but there are more cost-effective offerings."
#32

Singer

5.0
(1)
"We use Singer taps running in docker containers for the extract/load and DBT for the transform."
#33

dagster

5.0
(1)
"We use it for our data warehouse and are very happy with it."
#34

AirByte

5.0
(1)
"AirByte offers an open source version that can be easily set up with docker-compose."
#35

Ask On Data

5.0
(1)
"Ask On Data is a cutting-edge NLP-based ETL tool designed to streamline and optimize data processing workflows."
#36

Megalada

5.0
(1)
"Low-code + Visual Design + Performance is the best bunch for ETL."
#37

BigQuery

5.0
(1)
"BigQuery is recommended for data transformation."
#38

Data Studio

5.0
(1)
"Data Studio is free and excellent for dashboards and reports."
#39

Skyvia

5.0
(1)
"Skyvia is a standout option for simplifying data ingestion into Amazon Redshift."
#40

Google

5.0
(1)
"It's extremely easy to run on GCP with Dataflow."
#41

ClickHouse

5.0
(1)
"ClickHouse is great for analytical queries."
#42

Spark

5.0
(1)
"Spark is highly scalable and works well with iceberg tables."
#43

dlt

5.0
(1)
"Has a small learning curve compared to a gui tool, but absolutely was worth the investment for me."
#44

prophecy.io

5.0
(1)
"It supports both ETL and ELT; allows for visual development; gives you clean & editable Dbt Core and Airflow code."
#45

Metabase

5.0
(1)
"Personally quite fond of Metabase since it's easy to put in existing infrastructure."
#46

GetDBT.com

5.0
(1)
"Great tool for data transformation and integration with delta lake and airflow."
#47

Sprinkle Data

5.0
(1)
"I can recommend a tool which has a free plan and it might suit your use case perfectly. Personally, I have been using it for more than a couple of years now."
#48

Domo

5.0
(1)
"For sure"
#49

Easy Data Transform

5.0
(1)
"Might be worth a look at Easy Data Transform for serious Excel munging. Only $100 (one time fee) and does what you want, except it can't write directly to MySQL (it can generate the SQL to insert into a database though)."
#50

Kubernetes

5.0
(1)
"It's all running in kubernetes"
#51

Zapier

5.0
(1)
"Cost effective"
#52

SFDmu

5.0
(1)
"If you are good to code look at JS library jsforce and/or SFDMU."
#53

JSforce

5.0
(1)
"If you are good to code look at JS library jsforce and/or SFDMU."
#54

Azure

5.0
(1)
"Cheap, effective and battle tested"
#55

Datatlas

5.0
(1)
"Drag & drop data management platform"
#56

Sprinkle

5.0
(1)
"Been using Sprinkle Pipelines for quite some time now"
#57

Estuary Flow

4.0
(1)
"It's a unified (real-time + batch) data integration platform."
#58

CloverDX

4.0
(1)
"My team is currently using CloverDX."
#59

Sequel

4.0
(1)
"Check out the ruby based sequel"
#60

Mage

4.0
(1)
"I’ve been pleasantly surprised with Mage after a few days of playing around with it."
#61

SymmetricDS

4.0
(1)
"Symmetric DS works well and has an open source version."
#62

Wormhole

4.0
(1)
"Check out "Wormhole", way more affordable than Informatica."
#63

Funnel.io

4.0
(1)
"You could also take a look at funnel.io and Stitch."
#64

Apache SeaTunnel

4.0
(1)
"I have had my eyes on Apache SeaTunnel for a while."
#65

Weld

4.0
(1)
"You could look into Weld. They offer “automated” ELT/reverse-ELT to snowflake afaik."
#66

Apache Airflow

4.0
(1)
"Look at Apache Airflow."
#67

Celigo

4.0
(1)
"Check out Seekwell.io and Celigo integrator.io"
#68

Oracle Data Integrator

4.0
(1)
"Oracle Data Integrator 12c has delivered connectors to the Oracle cloud."
#69

Sling CLI

4.0
(1)
"**Open-source**: sling-cli, airbyte, peerdb"
#70

PeerDB

4.0
(1)
"**Open-source**: sling-cli, airbyte, peerdb"
#71

Seekwell

4.0
(1)
"Check out Seekwell.io and Celigo integrator.io"
#72

StreamSets

4.0
(1)
"Check out Streamsets as well."
#73

Segment

4.0
(1)
"Segment, Stitch, and Fivetran are all different but affordable"
#74

Prophecy

4.0
(1)
"I came across prophecy.io and have to say that they are fun 🤩"
#75

Dremio

4.0
(1)
"Looks like you need dremio"
#76

AWS Glue

4.0
(1)
"You could potentially replace Informatica with AWS Glue."
#77

IBM

4.0
(1)
"IBM Sterling is also a strong choice for file-based integrations."
#78

GCP

4.0
(1)
"For GCP Pub/Sub."
#79

MinIO

4.0
(1)
"MinIO provides excellent object storage capabilities."
#81

Postgres

4.0
(1)
"Postgres is a reliable choice for data storage."
#82

sqlmesh

4.0
(1)
"Sqlmesh seems like a more sanely thought-out tool set."
#83

portable.io

4.0
(1)
"Portable.io is a lower cost option that can extract and load data easily."
#84

Rivery

4.0
(1)
"Rivery is another affordable option for data extraction and loading."
#85

Coalesce

4.0
(1)
"Coalesce is great for transformation requirements, especially on Snowflake."
#86

Mage.ai

4.0
(1)
"Great for coding in Python."
#87

Rundeck

4.0
(1)
"Orchestrating with Rundeck back in the day was great."
#88

Coalesce.io

4.0
(1)
"Coalesce.io is a great transformation tool but requires an integration tool as well thus more vendors to deal with."
#89

Lambda

4.0
(1)
"Can build what you want in AWS"
#90

Glue

4.0
(1)
"Can build what you want in AWS"
#91

DMS

4.0
(1)
"Can build what you want in AWS"
#92

Apache NiFi

4.0
(1)
"Apache Nifi"
#93

NiFi

4.0
(1)
"If, however, you're going to deal with multiple independent data sources, you might want to look into more complex solutions like Airflow or Nifi."
#94

Workato

4.0
(1)
"Pretty good features"
#95

Boomi

4.0
(1)
"Cost effective"
#96

Xplenty

3.0
(1)
"Decent performance but slightly expensive"
#97

Stitcher

3.0
(1)
"Trailed Stitcher"
#98

Dell Boomi

3.0
(1)
"Cost effective"
#99

CloverETL

2.0
(1)
"CloverETL is not free but nothing really is."
#100

VM

1.0
(1)
"Why it isn't effective?"

Discover your audience

GummySearch is an audience research toolkit for 130,000 unique communities on Reddit.

If you are looking for startup problems to solve, want to validate your idea or find your first customers online, GummySearch is for you.

Sign up for free, get community insights in minutes.

Tell me more
Get started
Audience Research