Clustering of Multidimensional Data Sets with Applications to Spatial Distributions of Ribosomal Proteins

dc.contributor.authorMistry, Nil
dc.contributor.authorRamsey, Jordan
dc.contributor.authorWiley, Benjamin
dc.contributor.authorYanchuck, Jackie
dc.contributor.authorHuang, Xuan
dc.contributor.authorRaim, Andrew
dc.contributor.authorGobbert, Matthias K.
dc.contributor.authorNeerchal, Nagaraj K.
dc.contributor.authorFarabaugh, Philip J.
dc.date.accessioned2018-10-01T13:52:51Z
dc.date.available2018-10-01T13:52:51Z
dc.date.issued2013
dc.description.abstractConsider ribosomal proteins, each with a three-dimensional spatial location. Proteins related to the cofactor phenotype may be randomly or non-randomly distributed within the ribosome. To investigate this question, the Mahalanobis distance is computed between each pair of protein locations, and the optimal pairing is determined by minimizing the sum of the within-pair distances. Since no single code exists that allows for the computation of Mahalanobis distances, determining the optimal pairing, and determining whether the two groups are statistically different, we created a code that allows a user to do just this. The user can also compute an exact p-value for this distribution rather than rely on an approximation.en_US
dc.description.sponsorshipThese results were obtained as part of the REU Site: Interdisciplinary Program in High Performance Computing (www.umbc.edu/hpcreu) in the Department of Mathematics and Statistics at the University of Maryland, Baltimore County (UMBC) in Summer 2013. This program is funded jointly by the National Science Foundation and the National Security Agency (NSF grant no. DMS–1156976), with additional support from UMBC, the Department of Mathematics and Statistics, the Center for Interdisciplinary Research and Consulting (CIRC), and the UMBC High Performance Computing Facility (HPCF). HPCF (www.umbc.edu/hpcf) is supported by the National Science Foundation through the MRI program (grant nos. CNS–0821258 and CNS–1228778) and the SCREMS program (grant no. DMS–0821311), with additional substantial support from UMBC. Co-author Jordan Ramsey was supported, in part, by the UMBC National Security Agency (NSA) Scholars Program though a contract with the NSA. Graduate RAs Xuan Huang and Andrew Raim were supported by UMBC as HPCF RAs.en_US
dc.description.urihttps://userpages.umbc.edu/~gobbert/papers/REU2013Team3Bio.pdfen_US
dc.format.extent10 pagesen_US
dc.genretechical reporten_US
dc.identifierdoi:10.13016/M23X83Q0K
dc.identifier.urihttp://hdl.handle.net/11603/11410
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Mathematics Department Collection
dc.relation.ispartofUMBC Biological Sciences Department
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofseriesHPCF Technical Report;HPCF-2013-10
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectribosomal proteinsen_US
dc.subjectMahalanobis distanceen_US
dc.subjectUMBC High Performance Computing Facility (HPCF)en_US
dc.subjectProteins related to the cofactor phenotype
dc.subjectcomputation of Mahalanobis distances
dc.subjectdetermining the optimal pairing
dc.titleClustering of Multidimensional Data Sets with Applications to Spatial Distributions of Ribosomal Proteinsen_US
dc.typeTexten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
REU2013Team3Bio.pdf
Size:
427.24 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: