Ensembles in Adversarial Classification for Spam

Chinavle, Deepak; Kolari, Pranam; Oates, Tim; Finin, Tim

Ensembles in Adversarial Classification for Spam

dc.contributor.author	Chinavle, Deepak
dc.contributor.author	Kolari, Pranam
dc.contributor.author	Oates, Tim
dc.contributor.author	Finin, Tim
dc.date.accessioned	2018-11-15T16:53:37Z
dc.date.available	2018-11-15T16:53:37Z
dc.date.issued	2009-11-02
dc.description	Proceedings of the 18th ACM Conference on Information and Knowledge Management	en_US
dc.description.abstract	The standard method for combating spam, either in email or on the web, is to train a classifier on manually labeled instances. As the spammers change their tactics, the performance of such classifiers tends to decrease over time. Gathering and labeling more data to periodically retrain the classifier is expensive. We present a method based on an ensemble of classifiers that can detect when its performance might be degrading and retrain itself, all without manual intervention. Experiments with a real-world dataset from the blog domain show that our methods can significantly reduce the number of times classifiers are retrained when compared to a fixed retraining schedule, and they maintain classification accuracy even in the absence of manually labeled examples.	en_US
dc.description.uri	https://dl.acm.org/citation.cfm?doid=1645953.1646290	en_US
dc.format.extent	4 pages	en_US
dc.genre	conference papers and proceedings preprints	en_US
dc.identifier	doi:10.13016/M20G3H28K
dc.identifier.citation	Deepak Chinavle, Pranam Kolari, Tim Oates, and Tim Finin, Ensembles in Adversarial Classification for Spam, Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009, DOI : 10.1145/1645953.1646290	en_US
dc.identifier.uri	10.1145/1645953.1646290
dc.identifier.uri	http://hdl.handle.net/11603/12002
dc.language.iso	en_US	en_US
dc.publisher	ACM	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Faculty Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.subject	Spam	en_US
dc.subject	Weblogs	en_US
dc.subject	Ensembles	en_US
dc.subject	Adversarial Classification	en_US
dc.subject	Nonstationarity	en_US
dc.subject	Retraining	en_US
dc.subject	UMBC Ebiquity Research Group	en_US
dc.title	Ensembles in Adversarial Classification for Spam	en_US
dc.type	Text	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 460.pdf
Size:: 171.69 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.68 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Faculty Collection
UMBC Student Collection