Utilizing Latent Space Representation for Disease Phenotyping and Patient Risk Stratification
dc.contributor.advisor | Dr. Aijuan Dong | |
dc.contributor.author | Ferrer, Sophia Isabel | |
dc.contributor.department | Hood College Computer Science and Information Technology | |
dc.contributor.program | Hood College Departmental Honors | |
dc.date.accessioned | 2025-04-25T20:15:12Z | |
dc.date.available | 2025-04-25T20:15:12Z | |
dc.date.issued | 2025-04-25 | |
dc.description.abstract | Obstructive sleep apnea (OSA) is a common sleep-related disorder characterized by intermittent breathing pauses during sleep, which can significantly increase the risk of cardiovascular and metabolic diseases. The often undiagnosed nature of OSA, coupled with the difficulty in identifying patients most at risk for associated comorbidities, has led to sub-optimal personalized patient care. While previous studies have established a correlation between OSA and various comorbidities, the complexity and inconsistency of clinical data in electronic health records (EHR) pose challenges in deriving reliable results in healthcare studies. In this paper, we extracted and compared learned latent spaces-- a compressed representation of input data used to uncover hidden patterns-- using methods such as such as Autoencoders, Uniform Manifold Approximation and Projection (UMAP) and Principal Component Analysis (PCA) to filter out the noise and irrelevant details from the EHR data. We then deep phenotyped OSA patients through unsupervised clustering using the latent representation, identified patient subgroups and uncover potential risk factors that drive subgroup differentiation, and developed a clinical tool to predict patient group assignment via supervised learning. These findings enhance the understanding of OSA deep phenotyping and improve patient comorbidity risk assessment. | |
dc.genre | Departmental Honors Paper | |
dc.identifier | doi:10.13016/m2rn4h-pydy | |
dc.identifier.uri | http://hdl.handle.net/11603/38125 | |
dc.language.iso | en_US | |
dc.subject | machine learning | |
dc.subject | autoencoders | |
dc.subject | latent space | |
dc.subject | Dimensionality reduction | |
dc.subject | deep phenotyping | |
dc.subject | clustering algorithm | |
dc.subject | learned latent space | |
dc.subject | healthcare | |
dc.subject | mimic-iv | |
dc.title | Utilizing Latent Space Representation for Disease Phenotyping and Patient Risk Stratification | |
dc.type | Text | |
dcterms.creator | https://orcid.org/0009-0004-2775-7000 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Utilizing_Latent_Space_Representation_for_Disease_Phenotyping_and_Patient_Risk_Stratification_SFerrer.pdf
- Size:
- 556.63 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.65 KB
- Format:
- Item-specific license agreed upon to submission
- Description: