INFORMATION RETRIEVAL IN LARGE LANGUAGE MODELS

dc.contributor.advisorDey, Sanorita
dc.contributor.authorLeta, Edosa
dc.contributor.departmentComputer Science and Electrical Engineering
dc.contributor.programComputer Science
dc.date.accessioned2023-07-31T20:00:03Z
dc.date.available2023-07-31T20:00:03Z
dc.date.issued2023-01-01
dc.description.abstractLarge Language Models (LLMs) have been used to retrieve information. While this is an exciting opportunity to reduce costs in performing user studies and speeding up social science research (which is often bottlenecked by user study costs), this also opens up opportunities for harm. In particular, LLMs have been shown to generate random outputs. In this project, we aim to investigate how skewed LLMs are in their demographic predictions of the US population by comparing them to PEW research surveys. For this research, we are using one of the biggest and most commonly used LLMs, GPT 3.5 (text-DaVinci-003). I hope that this result sheds light on the gap that exists in information retrieval with LLMs and shows some room for more future work as we try to improve the use of LLMs, in this case, ChatGPT.
dc.formatapplication:pdf
dc.genrethesis
dc.identifierdoi:10.13016/m2lvnx-r6nr
dc.identifier.other12742
dc.identifier.urihttp://hdl.handle.net/11603/28958
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
dc.sourceOriginal File Name: Leta_umbc_0434M_12742.pdf
dc.titleINFORMATION RETRIEVAL IN LARGE LANGUAGE MODELS
dc.typeText
dcterms.accessRightsDistribution Rights granted to UMBC by the author.
dcterms.accessRightsAccess limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Leta_umbc_0434M_12742.pdf
Size:
137.1 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Leta-Edosa_988243_UMBC Publication Form_Open.pdf
Size:
201.86 KB
Format:
Adobe Portable Document Format
Description: