Result: Multi-approaches on scrubbing data for medium-sized enterprises

Title:
Multi-approaches on scrubbing data for medium-sized enterprises
Authors:
Publication Year:
2019
Document Type:
Conference conference object
File Description:
text
Language:
English
Relation:
https://research.skylineuniversity.ac.ae/id/eprint/55/1/53.pdf; Faiz, T (2019) Multi-approaches on scrubbing data for medium-sized enterprises. In: 2019 International Conference on Digitization (ICD), 18-19 November 2019, Sharjah, United Arab Emirates.
DOI:
10.1109/ICD47981.2019.9105739
Accession Number:
edsbas.6B04A2F3
Database:
BASE

Further Information

Tidy and fit for purpose data are the prerequisite for analyzing data and for guaranteeing good business decisions. Data Scrubbing or data cleaning is the process of identifying errors and inconsistencies in the data and fixing these errors before analyzing the data. Organization's decisions rely on Data Quality which makes data scrubbing a very important step towards their productivity. Untidy data includes; importing data from multiple sources, missing values or corrupt records, data types mismatch, special character removal or discarding duplicates. Current research is lacking the latest data scrubbing techniques practiced by the medium sized enterprises. This article highlights possible data errors, literature review, and data science project life cycle. The document explains how to clean data using Python libraries for exploratory data analysis such as Pandas, NumPy, Scikit- Learn and libraries for data visualization for example matplotlib, Seaborn, and Plotly.