Transferring Semantic Knowledge Into Language Encoders

dc.contributor.advisorFerraro, Francis
dc.contributor.authorUMAIR, MOHAMMAD
dc.contributor.departmentComputer Science and Electrical Engineering
dc.contributor.programComputer Science
dc.date.accessioned2022-09-29T15:37:54Z
dc.date.available2022-09-29T15:37:54Z
dc.date.issued2021-01-01
dc.description.abstractWe introduce semantic form mid-tuning, an approach for transferring semantic knowledge from semantic meaning representations into transformer-based language encoders. In mid-tuning, we learn to align the text of general sentences---not tied to any particular inference task---and semantic representations of those sentences that were automatically generated by FrameNet and PropBank Semantic Role parsers. We show that this alignment can be learned implicitly via classification or directly via triplet loss. Our method yields language encoders that demonstrate improved predictive performance across inference, reading comprehension, textual similarity, and other semantic tasks drawn from the GLUE, SuperGLUE, and SentEval benchmarks. We evaluate our approach on three popular baseline models, where our experimental results and analysis concludes that current pre-trained language models can further benefit from structured semantic frames with the proposed mid-tuning method, as they inject additional task-agnostic knowledge to the encoder, improving the generated embeddings as well as the linguistic properties of the given model, as evident from improvements on a popular sentence embedding toolkit and a variety of probing tasks.
dc.formatapplication:pdf
dc.genretheses
dc.identifierdoi:10.13016/m2ztl9-dlsv
dc.identifier.other12405
dc.identifier.urihttp://hdl.handle.net/11603/25979
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
dc.sourceOriginal File Name: UMAIR_umbc_0434M_12405.pdf
dc.subjectLanguage Modeling
dc.subjectLinguistics
dc.subjectMachine Learning
dc.subjectNatural Language Processing
dc.subjectSemantic Knowledge
dc.subjectSemantic Representations
dc.titleTransferring Semantic Knowledge Into Language Encoders
dc.typeText
dcterms.accessRightsDistribution Rights granted to UMBC by the author.
dcterms.accessRightsAccess limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
UMAIR_umbc_0434M_12405.pdf
Size:
1.29 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
UMAIR-MOHAMMAD_Open.pdf
Size:
190.31 KB
Format:
Adobe Portable Document Format
Description: