A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19

dc.contributor.authorKhandelwal, Vedant
dc.contributor.authorGaur, Manas
dc.contributor.authorKursuncu, Ugur
dc.contributor.authorShalin, Valerie
dc.contributor.authorSheth, Amit
dc.date.accessioned2024-12-11T17:02:47Z
dc.date.available2024-12-11T17:02:47Z
dc.date.issued2024-11-11
dc.description.abstractMonitoring public sentiment via social media is potentially helpful during health crises such as the COVID-19 pandemic. However, traditional frequency-based, data-driven neural network-based approaches can miss newly relevant content due to the evolving nature of language in a dynamically evolving environment. Human-curated symbolic knowledge sources, such as lexicons for standard language and slang terms, can potentially elevate social media signals in evolving language. We introduce a neurosymbolic method that integrates neural networks with symbolic knowledge sources, enhancing the detection and interpretation of mental health-related tweets relevant to COVID-19. Our method was evaluated using a corpus of large datasets (approximately 12 billion tweets, 2.5 million subreddit data, and 700k news articles) and multiple knowledge graphs. This method dynamically adapts to evolving language, outperforming purely data-driven models with an F1 score exceeding 92\%. This approach also showed faster adaptation to new data and lower computational demands than fine-tuning pre-trained large language models (LLMs). This study demonstrates the benefit of neurosymbolic methods in interpreting text in a dynamic environment for tasks such as health surveillance.
dc.description.sponsorshipResearch is funded as part of NSF Awards: ”Spokes: MEDIUM: MIDWEST: Collaborative: Community-Driven Data Engineering for Substance Abuse Prevention in the Rural Midwest” (Award#: 1956009), ”EAGER: Advancing Neurosymbolic AI with Deep Knowledge-infused Learning” (Award#: 2133842), and ”EAGER: Knowledge-guided neurosymbolic AI with guardrails for safe virtual health assistants” (Award#: 2335967)
dc.description.urihttp://arxiv.org/abs/2411.07163
dc.format.extent13 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2lgbd-hp2e
dc.identifier.urihttps://doi.org/10.48550/arXiv.2411.07163
dc.identifier.urihttp://hdl.handle.net/11603/37106
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsAttribution 4.0 International CC BY 4.0
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectUMBC Ebiquity Research Group
dc.subjectComputer Science - Artificial Intelligence
dc.titleA Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-5411-2230

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2411.07163v1.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format