Best etl tool on Reddit

89 reviews from r/dataengineering, r/ETL, r/salesforce and 3 more subreddits

89 reviews from
and
By Brand
/
By Product
#1

Talend

4.1
(9)
"I loved using Talend Open Studio and orchestrating with Rundeck back in the day"
·
"Pretty terrific"
·
"Best bets"
·
"Top free tool right now"
·
"Cost effective"
·
"Ok"
·
"Will not recommend Talend/ Informatica in current data scenarios. They are running on fumes..and don't see any future for them."
·
#2

Fivetran

4.1
(8)
"Fivetran + dbt will solve it for most use cases."
·
"Handles data loading from various services"
·
"Fivetran/ Qliks are not ETL per say, but great ELT enablers."
·
"It's good"
·
"If you want a GUI centric tool then Fivetran is good."
·
"Great"
·
"Native history mode"
·
"Trailed Fivetran"
#3

Matillion

4.2
(6)
"Great program"
·
"Fantastic"
·
"GUI based tooling"
·
"GUI-based and powerful"
·
"Good performance and reasonable cost"
·
"What is wrong with Matillion?"
#4

Airbyte

4.8
(4)
"Just started exploring airbyte."
·
"The pricing is fair and per use"
·
"Both an open-source version and a cloud one"
·
"Airbyte + AwsLambda"
#5

Airflow

4.8
(4)
"Using airflow for orchestration"
·
"Airflow and Python"
·
"Python-based"
·
"If, however, you're going to deal with multiple independent data sources, you might want to look into more complex solutions like Airflow or Nifi."
#6

Pentaho

4.5
(4)
"Best bets"
·
"Free edition is available and very full featured"
·
"Pentaho or ApacheHop"
·
"More user-friendly and easier to use"
#7

Informatica

3.4
(5)
"Depends on your budget, timeframe and your skillset."
·
"Cost effective"
·
"Cost effective"
·
"Informatica have modernized for cloud, but, it's for large enterprise."
·
"Pretty antiquated"
#8

MuleSoft

5.0
(3)
"Integrations specifically built for Salesforce applications and other major apps like SAP"
·
"No compromises"
·
"Depends on your budget, timeframe and your skillset."
#9

PySpark

4.7
(3)
"PySpark has become a pretty standard part of many DE stacks, and it's pretty battle-tested"
·
"Widely applicable and a more generalized skillset in addition to allowing ETL"
·
"Pyspark is fine if you have large datasets to process using spark/databrick clusters."
#10

dbt

4.7
(3)
"DBT is just a build/modeling tool run on straight sql"
·
"Run snapshots on raw or staged tables"
·
"Can probably make things work"
#11

Hevo

5.0
(2)
"Really like it"
·
"Easy built-in integrations and webhooks"
#12

Safe Software

4.5
(2)
"Specialize in Geospatial ETL but can read/write/transform just about anything"
·
"Pretty robust and give you the ability to create custom transformers"
#13

n8n

4.0
(2)
"N8n"
·
"A bit more visual, but still quite simple would be n8n."
#14

Meltano

4.0
(2)
"Using meltano for staging/replication"
·
"Can probably make things work"
#15

Jitterbit

4.0
(2)
"Cost effective"
·
#16

AWS

4.0
(2)
"You could use etl blocks in a high level or You can use a language program like python."
·
"More like a set of in-cloud tools, it has a database, but has not yet built supply chain (EDI and ERP integration or ETL) tools"
#17

Stitch

5.0
(1)
"They now support more than 65 integrations, the most of any ETL vendor we’re aware of."
#18

Domo

5.0
(1)
"For sure"
#19

KNIME

5.0
(1)
"I like KNIME"
#20

Snowflake

5.0
(1)
"One of the best etl tools"
#21

Dagster

5.0
(1)
"Great too"
#22

Prefect

5.0
(1)
"Highly recommend it"
#23

Ab Initio

5.0
(1)
"Can do such things if it makes sense price wise. Will run on your hardware and it's quite easy to parallelize graphs. Also supports metaprogramming so you might find a smart way to handle different file formats."
#24

Easy Data Transform

5.0
(1)
"Might be worth a look at Easy Data Transform for serious Excel munging. Only $100 (one time fee) and does what you want, except it can't write directly to MySQL (it can generate the SQL to insert into a database though)."
#25

Alteryx

5.0
(1)
"Huge wealth of information online in their forums"
#26

Kubernetes

5.0
(1)
"It's all running in kubernetes"
#27

Zapier

5.0
(1)
"Cost effective"
#28

SFDmu

5.0
(1)
"If you are good to code look at JS library jsforce and/or SFDMU."
#29

JSforce

5.0
(1)
"If you are good to code look at JS library jsforce and/or SFDMU."
#30

Azure

5.0
(1)
"Cheap, effective and battle tested"
#31

Datatlas

5.0
(1)
"Drag & drop data management platform"
#32

Apache

5.0
(1)
"Open source. Visual low code / no code. Metadata driven. It is a fork of Pentaho Kettle. It is easy to learn but with a lot of features."
#33

Sprinkle

5.0
(1)
"Been using Sprinkle Pipelines for quite some time now"
#34

Lambda

4.0
(1)
"Can build what you want in AWS"
#35

Glue

4.0
(1)
"Can build what you want in AWS"
#36

DMS

4.0
(1)
"Can build what you want in AWS"
#37

Apache NiFi

4.0
(1)
"Apache Nifi"
#38

NiFi

4.0
(1)
"If, however, you're going to deal with multiple independent data sources, you might want to look into more complex solutions like Airflow or Nifi."
#39

Workato

4.0
(1)
"Pretty good features"
#40

Boomi

4.0
(1)
"Cost effective"
#41

Xplenty

3.0
(1)
"Decent performance but slightly expensive"
#42

Stitcher

3.0
(1)
"Trailed Stitcher"
#43

Dell Boomi

3.0
(1)
"Cost effective"
#44

VM

1.0
(1)
"Why it isn't effective?"

Discover your audience

GummySearch is an audience research toolkit for 130,000 unique communities on Reddit.

If you are looking for startup problems to solve, want to validate your idea or find your first customers online, GummySearch is for you.

Sign up for free, get community insights in minutes.

Tell me more
Get started
Audience Research