Ensembles in Adversarial Classification for Spam

dc.contributor.authorChinavle, Deepak
dc.contributor.authorKolari, Pranam
dc.contributor.authorOates, Tim
dc.contributor.authorFinin, Tim
dc.date.accessioned2018-11-15T16:53:37Z
dc.date.available2018-11-15T16:53:37Z
dc.date.issued2009-11-02
dc.descriptionProceedings of the 18th ACM Conference on Information and Knowledge Managementen_US
dc.description.abstractThe standard method for combating spam, either in email or on the web, is to train a classifier on manually labeled instances. As the spammers change their tactics, the performance of such classifiers tends to decrease over time. Gathering and labeling more data to periodically retrain the classifier is expensive. We present a method based on an ensemble of classifiers that can detect when its performance might be degrading and retrain itself, all without manual intervention. Experiments with a real-world dataset from the blog domain show that our methods can significantly reduce the number of times classifiers are retrained when compared to a fixed retraining schedule, and they maintain classification accuracy even in the absence of manually labeled examples.en_US
dc.description.urihttps://dl.acm.org/citation.cfm?doid=1645953.1646290en_US
dc.format.extent4 pagesen_US
dc.genreconference papers and proceedings preprintsen_US
dc.identifierdoi:10.13016/M20G3H28K
dc.identifier.citationDeepak Chinavle, Pranam Kolari, Tim Oates, and Tim Finin, Ensembles in Adversarial Classification for Spam, Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009, DOI : 10.1145/1645953.1646290en_US
dc.identifier.uri10.1145/1645953.1646290
dc.identifier.urihttp://hdl.handle.net/11603/12002
dc.language.isoen_USen_US
dc.publisherACMen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subjectSpamen_US
dc.subjectWeblogsen_US
dc.subjectEnsemblesen_US
dc.subjectAdversarial Classificationen_US
dc.subjectNonstationarityen_US
dc.subjectRetrainingen_US
dc.subjectUMBC Ebiquity Research Groupen_US
dc.titleEnsembles in Adversarial Classification for Spamen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
460.pdf
Size:
171.69 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description: