Double truncation method for controlling local false discovery rate in case of spiky null

dc.contributor.authorKim, Shinjune
dc.contributor.authorOh, Youngjae
dc.contributor.authorLim, Johan
dc.contributor.authorPark, DoHwan
dc.contributor.authorGreen, Erin
dc.contributor.authorRamos, Mark L.
dc.contributor.authorJeong, Jaesik
dc.date.accessioned2026-01-06T20:51:39Z
dc.date.issued2025-02-01
dc.description.abstractMany multiple test procedures, which control the false discovery rate, have been developed to identify some cases (e.g. genes) showing statistically significant difference between two different groups. However, a common issue encountered in some practical data sets is the presence of highly spiky null distributions. Existing methods struggle to control type I error in such cases due to the “inflated false positives," but this problem has not been addressed in previous literature. Our team recently encountered this issue while analyzing SET4 gene deletion data and proposed modeling the null distribution using a scale mixture normal distribution. However, the use of this approach is limited due to strong assumptions on the spiky peak. In this paper, we present a novel multiple test procedure that can be applied to any type of spiky peak data, including situations with no spiky peak or with one or two spiky peaks. Our approach involves truncating the central statistics around 0, which primarily contribute to the null spike, as well as the two tails that may be contaminated by alternative distributions. We refer to this method as the “double truncation method." After applying double truncation, we estimate the null density using the doubly truncated maximum likelihood estimator. We demonstrate numerically that our proposed method effectively controls the false discovery rate at the desired level using simulated data. Furthermore, we apply our method to two real data sets, namely the SET protein data and peony data.
dc.description.sponsorshipThis research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF2018R1D1A1B0704237214) and Brain Pool Program by National Research Foundation of Korea funded by the Ministry of Education (NRF-2022H1D3A2A01063793).
dc.description.urihttps://link.springer.com/article/10.1007/s00180-024-01510-4
dc.format.extent22 pages
dc.genrejournal articles
dc.identifierdoi:10.13016/m2bym0-qx5x
dc.identifier.citationKim, Shinjune, Youngjae Oh, Johan Lim, et al. “Double Truncation Method for Controlling Local False Discovery Rate in Case of Spiky Null.” Computational Statistics 40, no. 2 (2025): 745–66. https://doi.org/10.1007/s00180-024-01510-4.
dc.identifier.urihttps://doi.org/10.1007/s00180-024-01510-4
dc.identifier.urihttp://hdl.handle.net/11603/41345
dc.language.isoen
dc.publisherSpringer Nature
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Mathematics and Statistics Department
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Biological Sciences Department
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rightsPublic Domain
dc.rights.urihttps://creativecommons.org/publicdomain/mark/1.0/
dc.subjectDoubly truncated maximum likelihood estimator
dc.subjectMultiple testing
dc.subjectTail-area FDR
dc.subjectSET protein data
dc.subjectSpiky null
dc.subjectLocal false discovery rate (FDR)
dc.titleDouble truncation method for controlling local false discovery rate in case of spiky null
dc.typeText

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
s00180024015104.pdf
Size:
3.62 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
18020241510MOESM1ESM.pdf
Size:
726.08 KB
Format:
Adobe Portable Document Format