Characterizing the Splogosphere

dc.contributor.authorKolari, Pranam
dc.contributor.authorJava, Akshay
dc.contributor.authorFinin, Tim
dc.date.accessioned2018-12-10T16:44:33Z
dc.date.available2018-12-10T16:44:33Z
dc.date.issued2006-05-23
dc.descriptionProceedings of the 3rd Annual Workshop on Weblogging Ecosystem: Aggregation, Analysis and Dynamics, 15th World Wid Web Conferenceen_US
dc.description.abstractWeblogs or blogs collectively constitute the Blogosphere, forming an influential and interesting subset on theWeb. As with most Internet-enabled applications, the ease of content creation and distribution makes the blogosphere spam prone. Spam blogs or splogs are blogs hosting spam posts, created using machine generated or hijacked content for the sole purpose of hosting ads or raising the PageRank of target sites. These splogs make up the splogosphere, and are now inundating blog search engines and update ping servers. In this work we characterize splogs by comparing them against authentic blogs. Our analysis is based on a dataset made publicly available by BlogPulse, and employs a machine learning model that detects splogs with an accuracy of 90%. To round off this analysis and to better understand splogs, we also present our study of a popular blog update ping server, and show how they are overwhelmed by pings sent by splogs. This overall study will facilitate finding effective new techniques to detect and weed out splogs from the blogosphere.en_US
dc.description.sponsorshipThis work is supported by NSF Awards NSF-ITR-IIS- 0326460 and NSF-ITR-IDM-0219649en_US
dc.description.urihttps://ebiquity.umbc.edu/paper/html/id/299/Characterizing-the-Splogosphereen_US
dc.format.extent7 pagesen_US
dc.genreconference papers and proceedings preprintsen_US
dc.identifierdoi:10.13016/M22V2CF1H
dc.identifier.urihttp://hdl.handle.net/11603/12200
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectsplogosphereen_US
dc.subjectblogen_US
dc.subjectsocial mediaen_US
dc.subjectspamen_US
dc.subjectweb spamen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleCharacterizing the Splogosphereen_US
dc.typeTexten_US

Files

License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: