METHODS IN LARGE SCALE MULTIPLE TESTING: MIXTURE NULL, SMALL SAMPLE REPLICATES, AND POWER BOOSTING

Ramos, Mark Louie Frac

METHODS IN LARGE SCALE MULTIPLE TESTING: MIXTURE NULL, SMALL SAMPLE REPLICATES, AND POWER BOOSTING

dc.contributor.advisor	Park, DoHwan
dc.contributor.author	Ramos, Mark Louie Frac
dc.contributor.department	Mathematics and Statistics
dc.contributor.program	Statistics
dc.date.accessioned	2022-09-29T15:38:19Z
dc.date.available	2022-09-29T15:38:19Z
dc.date.issued	2021-01-01
dc.description.abstract	In this dissertations, we study some methods in multiple testing. In the first topic, we consider the setting of gene expression experiments that use logfold change statistics where the null distribution is assumed to be a mixture of two normal distributions. An important issue in this setting is choosing the optimal interval of statistic values with which to estimate the null distribution. A modified cumulative sum changepoint detection criterion is constructed for this purpose and incorporated in three different methods for estimating local false discovery rate. In simulation studies, it is shown that two of those three methods successfully control false discovery rate (FDR). Both methods that controlled FDR produced better power than a baseline method. In the second topic, the problem of small sample replicates in logfold change-based experiments is addressed. A 2-stage method was constructed that addressed the magnitude of the signal and the variability of the signal separately. It is shown that the method controls false discovery rate, and that it performs competitively compared to a baseline method when there is considerable variability in the weighted counts of replicates coming from the alternative distribution. In the third topic, a new decision rule is proposed under some structural assumptions. When it can be assumed that the p-values of true nulls are uncorrelated, it is shown that this decision rule controls family-wise error rate (FWER) in the weak sense. Furthermore, under some conditions, simulation studies are presented to show that it controls false discovery rate in the strong sense. Most importantly, it is demonstrated using genome-wide association studies data how this method can be used as an ``add-on'' to existing FDR controlling methods in order to ``boost'' overall power.
dc.format	application:pdf
dc.genre	dissertations
dc.identifier	doi:10.13016/m2ez1s-k0j2
dc.identifier.other	12455
dc.identifier.uri	http://hdl.handle.net/11603/26033
dc.language	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Mathematics and Statistics Department Collection
dc.relation.ispartof	UMBC Theses and Dissertations Collection
dc.relation.ispartof	UMBC Graduate School Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	This item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
dc.source	Original File Name: Ramos_umbc_0434D_12455.pdf
dc.title	METHODS IN LARGE SCALE MULTIPLE TESTING: MIXTURE NULL, SMALL SAMPLE REPLICATES, AND POWER BOOSTING
dc.type	Text
dcterms.accessRights	Distribution Rights granted to UMBC by the author.
dcterms.accessRights	Access limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Ramos_umbc_0434D_12455.pdf
Size:: 1.56 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: Ramos-Mark Louie_Open.pdf
Size:: 257.28 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

UMBC Theses and Dissertations
UMBC Graduate School
UMBC Mathematics and Statistics Department
UMBC Student Collection