PASTA: A Dataset for Modeling Participant States in Narratives

dc.contributor.authorGhosh, Sayontan
dc.contributor.authorKoupaee, Mahnaz
dc.contributor.authorChen, Isabella
dc.contributor.authorFerraro, Francis
dc.contributor.authorChambers, Nathanael
dc.contributor.authorBalasubramanian, Niranjan
dc.date.accessioned2022-08-23T14:44:35Z
dc.date.available2022-08-23T14:44:35Z
dc.date.issued2023-11-02
dc.description.abstractThe events in a narrative are understood as a coherent whole via the underlying states of their participants. Often, these participant states are not explicitly mentioned, instead left to be inferred by the reader. A model that understands narratives should likewise infer these implicit states, and even reason about the impact of changes to these states on the narrative. To facilitate this goal, we introduce a new crowdsourced English-language, Participant States dataset, PASTA. This dataset contains inferable participant states; a counterfactual perturbation to each state; and the changes to the story that would be necessary if the counterfactual were true. We introduce three state-based reasoning tasks that test for the ability to infer when a state is entailed by a story, to revise a story conditioned on a counterfactual state, and to explain the most likely state change given a revised story. Experiments show that today’s LLMs can reason about states to some degree, but there is large room for improvement, especially in problems requiring access and ability to reason with diverse types of knowledge (e.g., physical, numerical, factual).1en_US
dc.description.sponsorshipWe would like to thank the anonymous reviewers for their comments, questions, and suggestions. This material is also based on research that is in part supported by the NSF, Grant No. 2007290, Army Research Laboratory, Grant No. W911NF2120076, and by the Air Force Research Laboratory (AFRL), DARPA, for the KAIROS program under agreement number FA8750-19-2-1003. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright notation thereon. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either express or implied, of the Air Force Research Laboratory (AFRL), DARPA, or the U.S. Government. This material is based in part upon work supported by the National Science Foundation under grant no. IIS-2024878.
dc.description.urihttps://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00600/118075/PASTA-A-Dataset-for-Modeling-PArticipant-STAtes-inen_US
dc.format.extent18 pagesen_US
dc.genrejournal articlesen_US
dc.identifierdoi:10.13016/m2cpgf-x1l7
dc.identifier.citationGhosh, Sayontan, Mahnaz Koupaee, Isabella Chen, Francis Ferraro, Nathanael Chambers, and Niranjan Balasubramanian. “PASTA: A Dataset for Modeling PArticipant STAtes in Narratives.” Transactions of the Association for Computational Linguistics 11 (November 2, 2023): 1283–1300. https://doi.org/10.1162/tacl_a_00600.
dc.identifier.urihttps://doi.org/10.1162/tacl_a_00600
dc.identifier.urihttp://hdl.handle.net/11603/25546
dc.language.isoen_USen_US
dc.publisherMIT Press
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.en_US
dc.rightsPublic Domain Mark 1.0*
dc.rights.urihttp://creativecommons.org/publicdomain/mark/1.0/*
dc.subjectUMBC Ebiquity Research Group
dc.titlePASTA: A Dataset for Modeling Participant States in Narrativesen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
tacl_a_00600.pdf
Size:
1.72 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: