A Comparative Study of Data Cleaning Tools

dc.contributor.authorOni, Samson
dc.contributor.authorChen, Zhiyuan
dc.contributor.authorHoban, Susan
dc.contributor.authorJademi, Onimi Precious
dc.date.accessioned2025-06-05T14:03:29Z
dc.date.available2025-06-05T14:03:29Z
dc.date.issued2019-10-01
dc.description.abstractIn the information era, data is crucial in decision making. Most data sets contain impurities that need to be weeded out before any meaningful decision can be made from the data. Hence, data cleaning is essential and often takes more than 80 percent of time and resources of the data analyst. Adequate tools and techniques must be used for data cleaning. There exist a lot of data cleaning tools but it is unclear how to choose them in various situations. This research aims at helping researchers and organizations choose the right tools for data cleaning. This article conducts a comparative study of four commonly used data cleaning tools on two real data sets and answers the research question of which tool will be useful based on different scenario.
dc.description.urihttps://www.igi-global.com/gateway/article/237137
dc.format.extent18 pages
dc.genrejournal articles
dc.identifierdoi:10.13016/m2jd1u-omvi
dc.identifier.citationOni, Samson, Zhiyuan Chen, Susan Hoban, and Onimi Jademi. “A Comparative Study of Data Cleaning Tools.” International Journal of Data Warehousing and Mining (IJDWM) 15, no. 4 (October 1, 2019): 48–65. https://doi.org/10.4018/IJDWM.2019100103.
dc.identifier.urihttps://doi.org/10.4018/IJDWM.2019100103
dc.identifier.urihttp://hdl.handle.net/11603/38714
dc.language.isoen_US
dc.publisherIGI Global
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC College of Engineering and Information Technology Dean's Office
dc.relation.ispartofUMBC Information Systems Department
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Joint Center for Earth Systems Technology (JCET)
dc.relation.ispartofUMBC Mathematics and Statistics Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectUMBC Mobile, Pervasive and Sensor Computing Lab (MPSC Lab)
dc.subjectUMBC Cybersecurity Institute
dc.subjectUMBC Accelerated Cognitive Cybersecurity Laboratory
dc.subjectUMBC High Performance Computing Facility (HPCF)
dc.titleA Comparative Study of Data Cleaning Tools
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-6984-7248
dcterms.creatorhttps://orcid.org/0000-0001-9206-1297

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AComparativeStudyofDataCleaningTools.pdf
Size:
745.79 KB
Format:
Adobe Portable Document Format