Spatio-Temporal Multivariate Weather Data Clustering Using DBSCAN And K-MEDOIDS Methods

Author/Creator

Author/Creator ORCID

Date

2023-01-01

Department

Information Systems

Program

Information Systems

Citation of Original Publication

Rights

This item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
Distribution Rights granted to UMBC by the author.
Access limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.

Abstract

This thesis focuses on the examination of the efficacy of well-known data clustering techniques, namely DBSCAN and K-Medoids, in categorizing spatio-temporal multivariate weather data obtained from various disciplines such as atmosphericscience, Earth sciences, and environmental science. The data, which is generated through monitoring specific regions over a period of time, typically consists of four dimensions: time, longitude, latitude, and variables such as temperature and wind speed. The temporal dimension is used as the basis for clustering the data. The study proposes new quantitative metrics to evaluate the results of the clustering process. The findings indicate that while popular clustering algorithms are effective in handling simple synthetic data, they face challenges when applied to complex real-world data. Furthermore, the results show that as the number of variables in the dataset increases, the performance of the clustering methods worsens.