Double truncation method for controlling local false discovery rate in case of spiky null

Kim, Shinjune; Oh, Youngjae; Lim, Johan; Park, DoHwan; Green, Erin; Ramos, Mark L.; Jeong, Jaesik

Double truncation method for controlling local false discovery rate in case of spiky null

dc.contributor.author	Kim, Shinjune
dc.contributor.author	Oh, Youngjae
dc.contributor.author	Lim, Johan
dc.contributor.author	Park, DoHwan
dc.contributor.author	Green, Erin
dc.contributor.author	Ramos, Mark L.
dc.contributor.author	Jeong, Jaesik
dc.date.accessioned	2026-01-06T20:51:39Z
dc.date.issued	2025-02-01
dc.description.abstract	Many multiple test procedures, which control the false discovery rate, have been developed to identify some cases (e.g. genes) showing statistically significant difference between two different groups. However, a common issue encountered in some practical data sets is the presence of highly spiky null distributions. Existing methods struggle to control type I error in such cases due to the “inflated false positives," but this problem has not been addressed in previous literature. Our team recently encountered this issue while analyzing SET4 gene deletion data and proposed modeling the null distribution using a scale mixture normal distribution. However, the use of this approach is limited due to strong assumptions on the spiky peak. In this paper, we present a novel multiple test procedure that can be applied to any type of spiky peak data, including situations with no spiky peak or with one or two spiky peaks. Our approach involves truncating the central statistics around 0, which primarily contribute to the null spike, as well as the two tails that may be contaminated by alternative distributions. We refer to this method as the “double truncation method." After applying double truncation, we estimate the null density using the doubly truncated maximum likelihood estimator. We demonstrate numerically that our proposed method effectively controls the false discovery rate at the desired level using simulated data. Furthermore, we apply our method to two real data sets, namely the SET protein data and peony data.
dc.description.sponsorship	This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF2018R1D1A1B0704237214) and Brain Pool Program by National Research Foundation of Korea funded by the Ministry of Education (NRF-2022H1D3A2A01063793).
dc.description.uri	https://link.springer.com/article/10.1007/s00180-024-01510-4
dc.format.extent	22 pages
dc.genre	journal articles
dc.identifier	doi:10.13016/m2bym0-qx5x
dc.identifier.citation	Kim, Shinjune, Youngjae Oh, Johan Lim, et al. “Double Truncation Method for Controlling Local False Discovery Rate in Case of Spiky Null.” Computational Statistics 40, no. 2 (2025): 745–66. https://doi.org/10.1007/s00180-024-01510-4.
dc.identifier.uri	https://doi.org/10.1007/s00180-024-01510-4
dc.identifier.uri	http://hdl.handle.net/11603/41345
dc.language.iso	en
dc.publisher	Springer Nature
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Mathematics and Statistics Department
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Biological Sciences Department
dc.rights	This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rights	Public Domain
dc.rights.uri	https://creativecommons.org/publicdomain/mark/1.0/
dc.subject	Doubly truncated maximum likelihood estimator
dc.subject	Multiple testing
dc.subject	Tail-area FDR
dc.subject	SET protein data
dc.subject	Spiky null
dc.subject	Local false discovery rate (FDR)
dc.title	Double truncation method for controlling local false discovery rate in case of spiky null
dc.type	Text

Files

Original bundle

Now showing 1 - 2 of 2

Name:: s00180024015104.pdf
Size:: 3.62 MB
Format:: Adobe Portable Document Format

Download

Name:: 18020241510MOESM1ESM.pdf
Size:: 726.08 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Mathematics and Statistics Department
UMBC Biological Sciences Department
UMBC Faculty Collection