LDAExplore: Visualizing Topic Models Generated Using Latent Dirichlet Allocation

dc.contributor.authorGanesan, Ashwinkumar
dc.contributor.authorBrantley, Kiante
dc.contributor.authorPan, Shimei
dc.contributor.authorChen, Jian
dc.date.accessioned2025-01-08T15:08:58Z
dc.date.available2025-01-08T15:08:58Z
dc.date.issued2015-07-23
dc.description.abstractWe present LDAExplore, a tool to visualize topic distributions in a given document corpus that are generated using Topic Modeling methods. Latent Dirichlet Allocation (LDA) is one of the basic methods that is predominantly used to generate topics. One of the problems with methods like LDA is that users who apply them may not understand the topics that are generated. Also, users may find it difficult to search correlated topics and correlated documents. LDAExplore, tries to alleviate these problems by visualizing topic and word distributions generated from the document corpus and allowing the user to interact with them. The system is designed for users, who have minimal knowledge of LDA or Topic Modelling methods. To evaluate our design, we run a pilot study which uses the abstracts of 322 Information Visualization papers, where every abstract is considered a document. The topics generated are then explored by users. The results show that users are able to find correlated documents and group them based on topics that are similar.
dc.description.urihttp://arxiv.org/abs/1507.06593
dc.format.extent7 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m2uoty-omcu
dc.identifier.urihttps://doi.org/10.48550/arXiv.1507.06593
dc.identifier.urihttp://hdl.handle.net/11603/37213
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Information Systems Department
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectComputer Science - Human-Computer Interaction
dc.subjectComputer Science - Information Retrieval
dc.titleLDAExplore: Visualizing Topic Models Generated Using Latent Dirichlet Allocation
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-5989-8543

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1507.06593v1.pdf
Size:
760.49 KB
Format:
Adobe Portable Document Format