Ensembles in Adversarial Classification for Spam

dc.contributor.authorChinavle, Deepak
dc.contributor.authorKolari, Pranam
dc.contributor.authorOates, Tim
dc.contributor.authorFinin, Tim
dc.date.accessioned2018-11-15T16:53:37Z
dc.date.available2018-11-15T16:53:37Z
dc.date.issued2009-11-02
dc.descriptionProceedings of the 18th ACM Conference on Information and Knowledge Managementen
dc.description.abstractThe standard method for combating spam, either in email or on the web, is to train a classifier on manually labeled instances. As the spammers change their tactics, the performance of such classifiers tends to decrease over time. Gathering and labeling more data to periodically retrain the classifier is expensive. We present a method based on an ensemble of classifiers that can detect when its performance might be degrading and retrain itself, all without manual intervention. Experiments with a real-world dataset from the blog domain show that our methods can significantly reduce the number of times classifiers are retrained when compared to a fixed retraining schedule, and they maintain classification accuracy even in the absence of manually labeled examples.en
dc.description.urihttps://dl.acm.org/citation.cfm?doid=1645953.1646290en
dc.format.extent4 pagesen
dc.genreconference papers and proceedings preprintsen
dc.identifierdoi:10.13016/M20G3H28K
dc.identifier.citationDeepak Chinavle, Pranam Kolari, Tim Oates, and Tim Finin, Ensembles in Adversarial Classification for Spam, Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009, DOI : 10.1145/1645953.1646290en
dc.identifier.uri10.1145/1645953.1646290
dc.identifier.urihttp://hdl.handle.net/11603/12002
dc.language.isoenen
dc.publisherACMen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectSpamen
dc.subjectWeblogsen
dc.subjectEnsemblesen
dc.subjectAdversarial Classificationen
dc.subjectNonstationarityen
dc.subjectRetrainingen
dc.subjectUMBC Ebiquity Research Groupen
dc.titleEnsembles in Adversarial Classification for Spamen
dc.typeTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
460.pdf
Size:
171.69 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: