This is a subreddit preview page. If you have a GummySearch account, please add this Subreddit to your audience to view the full analysis features there.
r/datacleaning
5k members
r/datacleaning is a subreddit with 5k members. Its distinguishing qualities are that the community is medium in size.
Data scientists can spend up to 80 percent of their time correcting data errors before extracting value from the data.
We at /r/datacleaning are interested in data cleaning as a preprocessing step to data mining. This subreddit is focused on advances in data cleaning research, data cleaning algorithms, and data cleaning tools. Related topics that we are interested in include: databases, statistics, machine learning, data mining, AI, visualization, etc.
Popular Themes in r/datacleaning
#1
Advice Requests
: "How to Engineer and Cleanse your data prior to Machine Learning | Analytics | Data Science"
31 posts
#2
Solution Requests
: "Data extraction from scanned documents"
28 posts
#3
Self-Promotion
: "End-To-End Data Preparation with my new open source project: https://github.com/kuwala-io/kuwala"
8 posts
#4
Pain & Anger
: "Bad data guide : problems seen in real-world data along with suggestions on how to resolve them."
7 posts
#5
Money Talk
: "Data Quality Analysts: Talk to us about data quality issues, get a $50 Amazon gift card!"
1 post
#6
News
: "Why scraping public pages is legal in the US"
1 post
Popular Topics in r/datacleaning
#1
Data Cleaning
: "Data Cleaning is one of the basic and important technique used in data preprocessing. Following article explains about the different Data Cleaning methods"
71 posts
#2
Office Cleaning
15 posts
#3
Data Cleansing
: "Best Practices for Effective Data Cleansing: A Guide for Businesses"
13 posts
#4
Machine Learning
: "How to Engineer and Cleanse your data prior to Machine Learning | Analytics | Data Science"
11 posts
#5
Python
: "Data Science for Sports Injuries Using R, Python, and Weka"
10 posts
#6
R
: "Data Science foR SpoRts InjuRies Using R, Python, and Weka"
9 posts
#7
Data Science
: "The Rise of Data Science"
9 posts
#8
Data Quality
: ""Data Quality problems cost U.S. businesses more than $600 billion a year"- a report from 2002."
7 posts
#9
Data Preparation
: "End-To-End Data Preparation with my new open source project: https://github.com/kuwala-io/kuwala"
5 posts
#10
Excel
: "Standardizing Excel workbooks with different formats in R. Any tips on automating?"
5 posts
Member Growth in r/datacleaning
Daily
+-1 members(-0.0%)
Monthly
+1 members(0.0%)
Yearly
+49 members(1.1%)