On Mining Web Access Logs

dc.contributor.authorJoshi, Anupam
dc.contributor.authorJoshi, Karuna
dc.contributor.authorKrishnapuram, Raghu
dc.date.accessioned2019-02-12T17:53:25Z
dc.date.available2019-02-12T17:53:25Z
dc.date.issued1999-10-24
dc.description.abstractThe proliferation of information on the world wide web has made the personalization of this information space a necessity. One possible approach to web personalization is to mine typical user profiles from the vast amount of historical data stored in access logs. In the absence of any a priori knowledge, unsupervised classification or clustering methods seem to be ideally suited to analyze the semi-structured log data of user accesses. In this paper, we define the notion of a “user session”, as well as a dissimilarity measure between two web sessions that captures the organization of a web site. To extract a user access profile, we cluster the user sessions based on the pair-wise dissimilarities using a robust fuzzy clustering algorithm that we have developed. We report the results of experiments with our algorithm and show that this leads to extraction of interesting user profiles. We also show that it outperforms association rule based approaches for this task.en_US
dc.description.sponsorshipThis work was partially supported by cooperative NSF awards (IIS 9801711 and IIS 9800899) to Joshi and Krishnapuram respectively, a grant from the Office of Naval Research (N00014-96-1-0439 to R. Krishnapuram), and an IBM faculty development award (to A. Joshi).en_US
dc.description.urihttps://ebiquity.umbc.edu/paper/html/id/98/On-Mining-Web-Access-Logsen_US
dc.format.extent17 pagesen_US
dc.genretechnical reportsen_US
dc.identifierdoi:10.13016/m2vrk6-04xp
dc.identifier.urihttp://hdl.handle.net/11603/12771
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectworld wide weben_US
dc.subjectuser sessionen_US
dc.subjectaccess logsen_US
dc.subjectdata miningen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleOn Mining Web Access Logsen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
45.pdf
Size:
140.37 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: