A Modular Unsupervised Framework for Attribute Recognition from Unstructured Text

dc.contributor.authorSolaiman, K. M. A.
dc.date.accessioned2025-07-30T19:21:50Z
dc.date.issued2025-07-05
dc.description.abstractWe propose POSID, a modular, lightweight and on-demand framework for extracting structured attribute-based properties from unstructured text without task-specific fine-tuning. While the method is designed to be adaptable across domains, in this work, we evaluate it on human attribute recognition in incident reports. POSID combines lexical and semantic similarity techniques to identify relevant sentences and extract attributes. We demonstrate its effectiveness on a missing person use case using the InciText dataset, achieving effective attribute extraction without supervised training.
dc.description.sponsorshipThis work began while the author was at Purdue University under the supervision of Bharat Bhargava. It was initially supported by the Northrop Grumman Mission Systems’ Research in Applications for Learning Machines (REALM) Program during the early stages, with further development carried out independently by the author
dc.description.urihttp://arxiv.org/abs/2507.03949
dc.format.extent5 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2zeup-gc5m
dc.identifier.urihttps://doi.org/10.48550/arXiv.2507.03949
dc.identifier.urihttp://hdl.handle.net/11603/39461
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsAttribution-NonCommercial-ShareAlike 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/
dc.subjectComputer Science - Computation and Language
dc.titleA Modular Unsupervised Framework for Attribute Recognition from Unstructured Text
dc.typeText

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2507.03949v1.pdf
Size:
334.47 KB
Format:
Adobe Portable Document Format