Evaluation of Traditional and Deep Clustering Algorithms for Multivariate Spatio-Temporal Data

dc.contributor.authorNji, Francis Ndikum
dc.contributor.authorSalvi, Rohan Mandar
dc.contributor.authorTirumala, Sai Sri Kuram
dc.contributor.authorWang, Jianwu
dc.contributor.authorZheng, Xue
dc.date.accessioned2025-10-29T19:14:54Z
dc.date.issued2024-10-28
dc.descriptionSeventh IEEE International Workshop on Benchmarking, Performance Tuning and Optimization for Big Data Applications (BPOD 2024), December 15-18, 2024 , Washington DC, United States
dc.description.abstractSpatiotemporal data is commonly available in many disciplines such as atmospheric science, Earth sciences and environment science, and data is generated by monitoring a certain area over a period of time. Analyzing such high-dimensional data is critical for uncovering hidden patterns and one important approach is to categorize it along the temporal dimension into smaller groups. While classical methods like K-means and Gaussian Mixture Models (GMM) are favored for their simplicity and interpretability, they encounter challenges in modeling complex, high-dimensional relationships inherent in nonlinear spatiotemporal data. In contrast, deep clustering algorithms that combine neural networks with unsupervised learning objectives excel by learning latent representations that better capture nonlinear spatiotemporal dependencies. This study provides a rigorous evaluation of both traditional and deep clustering algorithms on high dimensional multivariate spatiotemporal climate datasets. Our comparative study examines the performance of these techniques across synthetic and real-world datasets, assessing clustering accuracy and stability. We emphasize the advantages of deep clustering, particularly in applications such as climate data analysis and traffic flow prediction, where mining and understanding nonlinear high-dimensional correlations are critical. The results demonstrate that while traditional clustering algorithms are effective for basic tasks, deep learning-based approaches outperform them in managing complex nonlinear patterns present in high dimensional multivariate spatiotemporal data.
dc.description.sponsorshipThis work is supported by the DOE Office of Science Early Career Research Program. This work was performed under the auspices of the U.S. Department of Energy (DOE) by LLNL under contract DE-AC52-07NA27344. LLNL-CONF-870933.
dc.description.urihttps://www.osti.gov/servlets/purl/2519314
dc.format.extent12 pages
dc.genreconference papers and proceedings
dc.identifierdoi:10.13016/m2kkwt-w731
dc.identifier.citationNji, Ndikum Francis, Rohan Mandar Salvi, Sai Sri Ram Kuram Tirumala, Jianwu Wang, and Xue Zheng. “Evaluation of Traditional and Deep Clustering Algorithms for Multivariate Spatio-Temporal Data.” Lawrence Livermore National Laboratory, October 28, 2024.
dc.identifier.urihttp://hdl.handle.net/11603/40686
dc.language.isoen
dc.publisherLawrence Livermore National Laboratory
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Joint Center for Earth Systems Technology (JCET)
dc.relation.ispartofUMBC Center for Accelerated Real Time Analysis
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Information Systems Department
dc.relation.ispartofUMBC Center for Real-time Distributed Sensing and Autonomy
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC GESTAR II
dc.relation.ispartofiHARP NSF HDR Institute for Harnessing Data and Model Revolution in the Polar Regions
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rightsPublic Domain
dc.rights.urihttps://creativecommons.org/publicdomain/mark/1.0/
dc.subjectUMBC Big Data Analytics Lab
dc.titleEvaluation of Traditional and Deep Clustering Algorithms for Multivariate Spatio-Temporal Data
dc.typeText
dcterms.creatorhttps://orcid.org/0009-0009-6559-4659
dcterms.creatorhttps://orcid.org/0000-0002-9933-1170

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2519314.pdf
Size:
3.33 MB
Format:
Adobe Portable Document Format