A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19

Khandelwal, Vedant; Gaur, Manas; Kursuncu, Ugur; Shalin, Valerie; Sheth, Amit

A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19

dc.contributor.author	Khandelwal, Vedant
dc.contributor.author	Gaur, Manas
dc.contributor.author	Kursuncu, Ugur
dc.contributor.author	Shalin, Valerie
dc.contributor.author	Sheth, Amit
dc.date.accessioned	2024-12-11T17:02:47Z
dc.date.available	2024-12-11T17:02:47Z
dc.date.issued	2024-11-11
dc.description.abstract	Monitoring public sentiment via social media is potentially helpful during health crises such as the COVID-19 pandemic. However, traditional frequency-based, data-driven neural network-based approaches can miss newly relevant content due to the evolving nature of language in a dynamically evolving environment. Human-curated symbolic knowledge sources, such as lexicons for standard language and slang terms, can potentially elevate social media signals in evolving language. We introduce a neurosymbolic method that integrates neural networks with symbolic knowledge sources, enhancing the detection and interpretation of mental health-related tweets relevant to COVID-19. Our method was evaluated using a corpus of large datasets (approximately 12 billion tweets, 2.5 million subreddit data, and 700k news articles) and multiple knowledge graphs. This method dynamically adapts to evolving language, outperforming purely data-driven models with an F1 score exceeding 92\%. This approach also showed faster adaptation to new data and lower computational demands than fine-tuning pre-trained large language models (LLMs). This study demonstrates the benefit of neurosymbolic methods in interpreting text in a dynamic environment for tasks such as health surveillance.
dc.description.sponsorship	Research is funded as part of NSF Awards: ”Spokes: MEDIUM: MIDWEST: Collaborative: Community-Driven Data Engineering for Substance Abuse Prevention in the Rural Midwest” (Award#: 1956009), ”EAGER: Advancing Neurosymbolic AI with Deep Knowledge-infused Learning” (Award#: 2133842), and ”EAGER: Knowledge-guided neurosymbolic AI with guardrails for safe virtual health assistants” (Award#: 2335967)
dc.description.uri	http://arxiv.org/abs/2411.07163
dc.format.extent	13 pages
dc.genre	journal articles
dc.genre	preprints
dc.identifier	doi:10.13016/m2lgbd-hp2e
dc.identifier.uri	https://doi.org/10.48550/arXiv.2411.07163
dc.identifier.uri	http://hdl.handle.net/11603/37106
dc.language.iso	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department
dc.rights	Attribution 4.0 International CC BY 4.0
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	UMBC Ebiquity Research Group
dc.subject	Computer Science - Artificial Intelligence
dc.title	A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19
dc.type	Text
dcterms.creator	https://orcid.org/0000-0002-5411-2230

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2411.07163v1.pdf
Size:: 1.33 MB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Faculty Collection
UMBC Computer Science and Electrical Engineering Department