The Keyword Explorer Suite: A Toolkit for Understanding Online Populations

dc.contributor.authorFeldman, Philip
dc.contributor.authorPan, Shimei
dc.contributor.authorFoulds, James
dc.date.accessioned2024-02-27T17:45:10Z
dc.date.available2024-02-27T17:45:10Z
dc.date.issued2023-03-27
dc.descriptionIUI '23 Companion: Companion Proceedings of the 28th International Conference on Intelligent User Interfaces (March 2023)
dc.description.abstractWe have developed a set of Python applications that use large language models to identify and analyze data from social media platforms relevant to a population of interest. Our pipeline begins with using OpenAI’s GPT-3 to generate potential keywords for identifying relevant text content from the target population. The keywords are then validated, and the content downloaded and analyzed using GPT-3 embedding and manifold reduction. Corpora are then created to fine-tune GPT-2 models to explore latent information via prompt-based queries. These tools allow researchers and practitioners to gain valuable insights into population subgroups online.
dc.description.urihttps://dl.acm.org/doi/abs/10.1145/3581754.3584122
dc.format.extent6 pages
dc.genreconference papers and proceedings
dc.genrepreprints
dc.identifierdoi:10.13016/m2qeaf-dhst
dc.identifier.citationPhilip G Feldman, Shimei Pan, and James Foulds. 2023. The Keyword Explorer Suite: A Toolkit for Understanding Online Populations. In Companion Proceedings of the 28th International Conference on Intelligent User Interfaces (IUI '23 Companion). Association for Computing Machinery, New York, NY, USA, 21–24. https://doi.org/10.1145/3581754.3584122
dc.identifier.urihttps://doi.org/10.1145/3581754.3584122
dc.identifier.urihttp://hdl.handle.net/11603/31711
dc.publisherACM
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.titleThe Keyword Explorer Suite: A Toolkit for Understanding Online Populations
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0001-6164-6620
dcterms.creatorhttps://orcid.org/0000-0002-5989-8543
dcterms.creatorhttps://orcid.org/0000-0003-0935-4182

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2301.05198.pdf
Size:
311.91 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: