LOST: A Mental Health Dataset of Low Self-esteem in Reddit Posts

dc.contributor.authorGarg, Muskan
dc.contributor.authorGaur, Manas
dc.contributor.authorGoswami, Raxit
dc.contributor.authorSohn, Sunghwan
dc.date.accessioned2023-07-18T19:43:18Z
dc.date.available2023-07-18T19:43:18Z
dc.date.issued2023-06-08
dc.description.abstractLow self-esteem and interpersonal needs (i.e., thwarted belongingness (TB) and perceived burdensomeness (PB)) have a major impact on depression and suicide attempts. Individuals seek social connectedness on social media to boost and alleviate their loneliness. Social media platforms allow people to express their thoughts, experiences, beliefs, and emotions. Prior studies on mental health from social media have focused on symptoms, causes, and disorders. Whereas an initial screening of social media content for interpersonal risk factors and low self-esteem may raise early alerts and assign therapists to at-risk users of mental disturbance. Standardized scales measure self-esteem and interpersonal needs from questions created using psychological theories. In the current research, we introduce a psychology-grounded and expertly annotated dataset, LoST: Low Self esTeem, to study and detect low self-esteem on Reddit. Through an annotation approach involving checks on coherence, correctness, consistency, and reliability, we ensure gold-standard for supervised learning. We present results from different deep language models tested using two data augmentation techniques. Our findings suggest developing a class of language models that infuses psychological and clinical knowledge.en_US
dc.description.sponsorshipWe extend our sincere acknowledgement to the postgraduate student annotators, Ritika Bhardwaj, Astha Jain, and Amrit Chadha, for their diligent efforts in the annotation process. We express our gratitude to Veena Krishnan, a senior clinical psychologist, and Ruchi Joshi, a rehabilitation counselor, for their unwavering support throughout the project. This project was partially supported by NIH R01 AG068007. We thank Surjodeep Sarkar for proofreading this work.en_US
dc.description.urihttps://arxiv.org/abs/2306.05596en_US
dc.format.extent6 pagesen_US
dc.genrejournal articlesen_US
dc.genrepreprintsen_US
dc.identifierdoi:10.13016/m2etir-jn38
dc.identifier.urihttps://doi.org/10.48550/arXiv.2306.05596
dc.identifier.urihttp://hdl.handle.net/11603/28744
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en_US
dc.titleLOST: A Mental Health Dataset of Low Self-esteem in Reddit Postsen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0002-5411-2230en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2306.05596.pdf
Size:
344.07 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: