METHODS IN LARGE SCALE MULTIPLE TESTING: MIXTURE NULL, SMALL SAMPLE REPLICATES, AND POWER BOOSTING

dc.contributor.advisorPark, DoHwan
dc.contributor.authorRamos, Mark Louie Frac
dc.contributor.departmentMathematics and Statistics
dc.contributor.programStatistics
dc.date.accessioned2022-09-29T15:38:19Z
dc.date.available2022-09-29T15:38:19Z
dc.date.issued2021-01-01
dc.description.abstractIn this dissertations, we study some methods in multiple testing. In the first topic, we consider the setting of gene expression experiments that use logfold change statistics where the null distribution is assumed to be a mixture of two normal distributions. An important issue in this setting is choosing the optimal interval of statistic values with which to estimate the null distribution. A modified cumulative sum changepoint detection criterion is constructed for this purpose and incorporated in three different methods for estimating local false discovery rate. In simulation studies, it is shown that two of those three methods successfully control false discovery rate (FDR). Both methods that controlled FDR produced better power than a baseline method. In the second topic, the problem of small sample replicates in logfold change-based experiments is addressed. A 2-stage method was constructed that addressed the magnitude of the signal and the variability of the signal separately. It is shown that the method controls false discovery rate, and that it performs competitively compared to a baseline method when there is considerable variability in the weighted counts of replicates coming from the alternative distribution. In the third topic, a new decision rule is proposed under some structural assumptions. When it can be assumed that the p-values of true nulls are uncorrelated, it is shown that this decision rule controls family-wise error rate (FWER) in the weak sense. Furthermore, under some conditions, simulation studies are presented to show that it controls false discovery rate in the strong sense. Most importantly, it is demonstrated using genome-wide association studies data how this method can be used as an ``add-on'' to existing FDR controlling methods in order to ``boost'' overall power.
dc.formatapplication:pdf
dc.genredissertations
dc.identifierdoi:10.13016/m2ez1s-k0j2
dc.identifier.other12455
dc.identifier.urihttp://hdl.handle.net/11603/26033
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Mathematics and Statistics Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
dc.sourceOriginal File Name: Ramos_umbc_0434D_12455.pdf
dc.titleMETHODS IN LARGE SCALE MULTIPLE TESTING: MIXTURE NULL, SMALL SAMPLE REPLICATES, AND POWER BOOSTING
dc.typeText
dcterms.accessRightsDistribution Rights granted to UMBC by the author.
dcterms.accessRightsAccess limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ramos_umbc_0434D_12455.pdf
Size:
1.56 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Ramos-Mark Louie_Open.pdf
Size:
257.28 KB
Format:
Adobe Portable Document Format
Description: