The efficient design of Nested Group Testing algorithms for disease identification in clustered data

dc.contributor.authorBest, Ana F.
dc.contributor.authorMalinovsky, Yaakov
dc.contributor.authorAlbert, Paul S.
dc.date.accessioned2023-10-30T14:36:52Z
dc.date.available2023-10-30T14:36:52Z
dc.date.issued2022-05-09
dc.description.abstractGroup testing study designs have been used since the 1940s to reduce screening costs for uncommon diseases; for rare diseases, all cases are identifiable with substantially fewer tests than the population size. Substantial research has identified efficient designs under this paradigm. However, little work has focused on the important problem of disease screening among clustered data, such as geographic heterogeneity in HIV prevalence. We evaluated designs where we first estimate disease prevalence and then apply efficient group testing algorithms using these estimates. Specifically, we evaluate prevalence using individual testing on a fixed-size subset of each cluster and use these prevalence estimates to choose group sizes that minimize the corresponding estimated average number of tests per subject. We compare designs where we estimate cluster-specific prevalences as well as a common prevalence across clusters, use different group testing algorithms, construct groups from individuals within and in different clusters, and consider misclassification. For diseases with low prevalence, our results suggest that accounting for clustering is unnecessary. However, for diseases with higher prevalence and sizeable between-cluster heterogeneity, accounting for clustering in study design and implementation improves efficiency. We consider the practical aspects of our design recommendations with two examples with strong clustering effects: (1) Identification of HIV carriers in the US population and (2) Laboratory screening of anti-cancer compounds using cell lines.en_US
dc.description.sponsorshipThis work was supported by the Intramural Research Program of the National Cancer Institute. The views presented in this article are those of the authors and should not be viewed as official opinions or positions of the National Cancer Institute, National Institutes of Health, or US Department of Health and Human Services. The research of YM was supported by grant number 2020063 from the United States-Israel Binational Science Foundation (BSF), Jerusalem, Israel.en_US
dc.description.urihttps://www.tandfonline.com/doi/abs/10.1080/02664763.2022.2071419en_US
dc.format.extent18 pagesen_US
dc.genrejournal articlesen_US
dc.identifierdoi:10.13016/m2ypxv-hj2o
dc.identifier.citationBest, Ana F., Yaakov Malinovsky, and Paul S. Albert. “The Efficient Design of Nested Group Testing Algorithms for Disease Identification in Clustered Data.” Journal of Applied Statistics 50, no. 10 (July 27, 2023): 2228–45. https://doi.org/10.1080/02664763.2022.2071419.en_US
dc.identifier.urihttps://doi.org/10.1080/02664763.2022.2071419
dc.identifier.urihttp://hdl.handle.net/11603/30444
dc.language.isoen_USen_US
dc.publisherTaylor & Francisen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Mathematics Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.en_US
dc.rightsPublic Domain Mark 1.0*
dc.rights.urihttp://creativecommons.org/publicdomain/mark/1.0/*
dc.titleThe efficient design of Nested Group Testing algorithms for disease identification in clustered dataen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0003-2888-674Xen_US

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
CJAS_50_2071419.pdf
Size:
1.86 MB
Format:
Adobe Portable Document Format
Description:
Loading...
Thumbnail Image
Name:
Supplementary Materials.pdf
Size:
2.77 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: