A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19
| dc.contributor.author | Khandelwal, Vedant | |
| dc.contributor.author | Gaur, Manas | |
| dc.contributor.author | Kursuncu, Ugur | |
| dc.contributor.author | Shalin, Valerie | |
| dc.contributor.author | Sheth, Amit | |
| dc.date.accessioned | 2024-12-11T17:02:47Z | |
| dc.date.available | 2024-12-11T17:02:47Z | |
| dc.date.issued | 2024-11-11 | |
| dc.description.abstract | Monitoring public sentiment via social media is potentially helpful during health crises such as the COVID-19 pandemic. However, traditional frequency-based, data-driven neural network-based approaches can miss newly relevant content due to the evolving nature of language in a dynamically evolving environment. Human-curated symbolic knowledge sources, such as lexicons for standard language and slang terms, can potentially elevate social media signals in evolving language. We introduce a neurosymbolic method that integrates neural networks with symbolic knowledge sources, enhancing the detection and interpretation of mental health-related tweets relevant to COVID-19. Our method was evaluated using a corpus of large datasets (approximately 12 billion tweets, 2.5 million subreddit data, and 700k news articles) and multiple knowledge graphs. This method dynamically adapts to evolving language, outperforming purely data-driven models with an F1 score exceeding 92\%. This approach also showed faster adaptation to new data and lower computational demands than fine-tuning pre-trained large language models (LLMs). This study demonstrates the benefit of neurosymbolic methods in interpreting text in a dynamic environment for tasks such as health surveillance. | |
| dc.description.sponsorship | Research is funded as part of NSF Awards: ”Spokes: MEDIUM: MIDWEST: Collaborative: Community-Driven Data Engineering for Substance Abuse Prevention in the Rural Midwest” (Award#: 1956009), ”EAGER: Advancing Neurosymbolic AI with Deep Knowledge-infused Learning” (Award#: 2133842), and ”EAGER: Knowledge-guided neurosymbolic AI with guardrails for safe virtual health assistants” (Award#: 2335967) | |
| dc.description.uri | http://arxiv.org/abs/2411.07163 | |
| dc.format.extent | 13 pages | |
| dc.genre | journal articles | |
| dc.genre | preprints | |
| dc.identifier | doi:10.13016/m2lgbd-hp2e | |
| dc.identifier.uri | https://doi.org/10.48550/arXiv.2411.07163 | |
| dc.identifier.uri | http://hdl.handle.net/11603/37106 | |
| dc.language.iso | en | |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Faculty Collection | |
| dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department | |
| dc.rights | Attribution 4.0 International CC BY 4.0 | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | UMBC Ebiquity Research Group | |
| dc.subject | Computer Science - Artificial Intelligence | |
| dc.title | A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19 | |
| dc.type | Text | |
| dcterms.creator | https://orcid.org/0000-0002-5411-2230 |
Files
Original bundle
1 - 1 of 1
