Evaluation of Traditional and Deep Clustering Algorithms for Multivariate Spatio-Temporal Data
| dc.contributor.author | Nji, Francis Ndikum | |
| dc.contributor.author | Salvi, Rohan Mandar | |
| dc.contributor.author | Tirumala, Sai Sri Kuram | |
| dc.contributor.author | Wang, Jianwu | |
| dc.contributor.author | Zheng, Xue | |
| dc.date.accessioned | 2025-10-29T19:14:54Z | |
| dc.date.issued | 2024-10-28 | |
| dc.description | Seventh IEEE International Workshop on Benchmarking, Performance Tuning and Optimization for Big Data Applications (BPOD 2024), December 15-18, 2024 , Washington DC, United States | |
| dc.description.abstract | Spatiotemporal data is commonly available in many disciplines such as atmospheric science, Earth sciences and environment science, and data is generated by monitoring a certain area over a period of time. Analyzing such high-dimensional data is critical for uncovering hidden patterns and one important approach is to categorize it along the temporal dimension into smaller groups. While classical methods like K-means and Gaussian Mixture Models (GMM) are favored for their simplicity and interpretability, they encounter challenges in modeling complex, high-dimensional relationships inherent in nonlinear spatiotemporal data. In contrast, deep clustering algorithms that combine neural networks with unsupervised learning objectives excel by learning latent representations that better capture nonlinear spatiotemporal dependencies. This study provides a rigorous evaluation of both traditional and deep clustering algorithms on high dimensional multivariate spatiotemporal climate datasets. Our comparative study examines the performance of these techniques across synthetic and real-world datasets, assessing clustering accuracy and stability. We emphasize the advantages of deep clustering, particularly in applications such as climate data analysis and traffic flow prediction, where mining and understanding nonlinear high-dimensional correlations are critical. The results demonstrate that while traditional clustering algorithms are effective for basic tasks, deep learning-based approaches outperform them in managing complex nonlinear patterns present in high dimensional multivariate spatiotemporal data. | |
| dc.description.sponsorship | This work is supported by the DOE Office of Science Early Career Research Program. This work was performed under the auspices of the U.S. Department of Energy (DOE) by LLNL under contract DE-AC52-07NA27344. LLNL-CONF-870933. | |
| dc.description.uri | https://www.osti.gov/servlets/purl/2519314 | |
| dc.format.extent | 12 pages | |
| dc.genre | conference papers and proceedings | |
| dc.identifier | doi:10.13016/m2kkwt-w731 | |
| dc.identifier.citation | Nji, Ndikum Francis, Rohan Mandar Salvi, Sai Sri Ram Kuram Tirumala, Jianwu Wang, and Xue Zheng. “Evaluation of Traditional and Deep Clustering Algorithms for Multivariate Spatio-Temporal Data.” Lawrence Livermore National Laboratory, October 28, 2024. | |
| dc.identifier.uri | http://hdl.handle.net/11603/40686 | |
| dc.language.iso | en | |
| dc.publisher | Lawrence Livermore National Laboratory | |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Student Collection | |
| dc.relation.ispartof | UMBC Joint Center for Earth Systems Technology (JCET) | |
| dc.relation.ispartof | UMBC Center for Accelerated Real Time Analysis | |
| dc.relation.ispartof | UMBC Faculty Collection | |
| dc.relation.ispartof | UMBC Information Systems Department | |
| dc.relation.ispartof | UMBC Center for Real-time Distributed Sensing and Autonomy | |
| dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department | |
| dc.relation.ispartof | UMBC GESTAR II | |
| dc.relation.ispartof | iHARP NSF HDR Institute for Harnessing Data and Model Revolution in the Polar Regions | |
| dc.rights | This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law. | |
| dc.rights | Public Domain | |
| dc.rights.uri | https://creativecommons.org/publicdomain/mark/1.0/ | |
| dc.subject | UMBC Big Data Analytics Lab | |
| dc.title | Evaluation of Traditional and Deep Clustering Algorithms for Multivariate Spatio-Temporal Data | |
| dc.type | Text | |
| dcterms.creator | https://orcid.org/0009-0009-6559-4659 | |
| dcterms.creator | https://orcid.org/0000-0002-9933-1170 |
Files
Original bundle
1 - 1 of 1
Collections
UMBC Student Collection
iHARP: NSF HDR Institute for Harnessing Data and Model Revolution in the Polar Regions
UMBC Center for Accelerated Real Time Analysis
UMBC Center for Real-time Distributed Sensing and Autonomy
UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection
Load more iHARP: NSF HDR Institute for Harnessing Data and Model Revolution in the Polar Regions
UMBC Center for Accelerated Real Time Analysis
UMBC Center for Real-time Distributed Sensing and Autonomy
UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection
