Utilizing Latent Space Representation for Disease Phenotyping and Patient Risk Stratification

dc.contributor.advisorDr. Aijuan Dong
dc.contributor.authorFerrer, Sophia Isabel
dc.contributor.departmentHood College Computer Science and Information Technology
dc.contributor.programHood College Departmental Honors
dc.date.accessioned2025-04-25T20:15:12Z
dc.date.available2025-04-25T20:15:12Z
dc.date.issued2025-04-25
dc.description.abstractObstructive sleep apnea (OSA) is a common sleep-related disorder characterized by intermittent breathing pauses during sleep, which can significantly increase the risk of cardiovascular and metabolic diseases. The often undiagnosed nature of OSA, coupled with the difficulty in identifying patients most at risk for associated comorbidities, has led to sub-optimal personalized patient care. While previous studies have established a correlation between OSA and various comorbidities, the complexity and inconsistency of clinical data in electronic health records (EHR) pose challenges in deriving reliable results in healthcare studies. In this paper, we extracted and compared learned latent spaces-- a compressed representation of input data used to uncover hidden patterns-- using methods such as such as Autoencoders, Uniform Manifold Approximation and Projection (UMAP) and Principal Component Analysis (PCA) to filter out the noise and irrelevant details from the EHR data. We then deep phenotyped OSA patients through unsupervised clustering using the latent representation, identified patient subgroups and uncover potential risk factors that drive subgroup differentiation, and developed a clinical tool to predict patient group assignment via supervised learning. These findings enhance the understanding of OSA deep phenotyping and improve patient comorbidity risk assessment.
dc.genreDepartmental Honors Paper
dc.identifierdoi:10.13016/m2rn4h-pydy
dc.identifier.urihttp://hdl.handle.net/11603/38125
dc.language.isoen_US
dc.subjectmachine learning
dc.subjectautoencoders
dc.subjectlatent space
dc.subjectDimensionality reduction
dc.subjectdeep phenotyping
dc.subjectclustering algorithm
dc.subjectlearned latent space
dc.subjecthealthcare
dc.subjectmimic-iv
dc.titleUtilizing Latent Space Representation for Disease Phenotyping and Patient Risk Stratification
dc.typeText
dcterms.creatorhttps://orcid.org/0009-0004-2775-7000

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Utilizing_Latent_Space_Representation_for_Disease_Phenotyping_and_Patient_Risk_Stratification_SFerrer.pdf
Size:
556.63 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.65 KB
Format:
Item-specific license agreed upon to submission
Description: