Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results

dc.contributor.authorGhosh, Subhankar
dc.contributor.authorGupta, Jayant
dc.contributor.authorSharma, Arun
dc.contributor.authorAn, Shuai
dc.contributor.authorShekhar, Shashi
dc.date.accessioned2025-10-29T19:15:00Z
dc.date.issued2023-09-07
dc.descriptionInternational Conference on Geographic Information Science (GIScience) 2023, September 12-15, 2023, Leeds, UK
dc.description.abstractGiven a set S of spatial feature types, its feature instances, a study area, and a neighbor relationship, the goal is to find pairs <a region (r*), a subset C of S> such that C is a statistically significant regional-colocation pattern in r*. This problem is important for applications in various domains including ecology, economics, and sociology. The problem is computationally challenging due to the exponential number of regional colocation patterns and candidate regions. Previously, we proposed a miner [Subhankar et. al, 2022] that finds statistically significant regional colocation patterns. However, the numerous simultaneous statistical inferences raise the risk of false discoveries (also known as the multiple comparisons problem) and carry a high computational cost. We propose a novel algorithm, namely, multiple comparisons regional colocation miner (MultComp-RCM) which uses a Bonferroni correction. Theoretical analysis, experimental evaluation, and case study results show that the proposed method reduces both the false discovery rate and computational cost. *abstract=g
dc.description.sponsorshipThis material is based upon work supported by the National Science Foundation under Grants No. 2118285, 2040459, 1901099, and 1916518
dc.description.urihttps://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.GIScience.2023.3
dc.format.extent18 pages
dc.genreconference papers and proceedings
dc.identifierdoi:10.13016/m27hop-1vem
dc.identifier.citationGhosh, Subhankar, Jayant Gupta, Arun Sharma, Shuai An, and Shashi Shekhar. “Reducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results.” 12th International Conference on Geographic Information Science (GIScience 2023), Leibniz International Proceedings in Informatics (LIPIcs), vol. 277 (2023): 3:1-3:18. https://doi.org/10.4230/LIPIcs.GIScience.2023.3.
dc.identifier.urihttps://doi.org/10.4230/LIPIcs.GIScience.2023.3
dc.identifier.urihttp://hdl.handle.net/11603/40705
dc.language.isoen
dc.publisherDagstuhl Publishing
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofiHARP NSF HDR Institute for Harnessing Data and Model Revolution in the Polar Regions
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectParticipation index
dc.subjectColocation pattern
dc.subjectMultiple comparisons problem
dc.subjectSpatial heterogeneity
dc.subjectStatistical significance
dc.titleReducing False Discoveries in Statistically-Significant Regional-Colocation Mining: A Summary of Results
dc.typeText

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LIPIcsGIScience20233.pdf
Size:
3.74 MB
Format:
Adobe Portable Document Format