Sparse Private LASSO Logistic Regression
Loading...
Links to Files
Author/Creator
Author/Creator ORCID
Date
2023-04-24
Type of Work
Department
Program
Citation of Original Publication
Rights
This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
Public Domain Mark 1.0
Public Domain Mark 1.0
Subjects
Abstract
LASSO regularized logistic regression is particularly useful
for its built-in feature selection, allowing coefficients to be removed from
deployment and producing sparse solutions. Differentially private versions of LASSO logistic regression have been developed, but generally
produce dense solutions, reducing the intrinsic utility of the LASSO
penalty. In this paper, we present a differentially private method for
sparse logistic regression that maintains hard zeros. Our key insight is to
first train a non-private LASSO logistic regression model to determine
an appropriate privatized number of non-zero coefficients to use in final model selection. To demonstrate our method’s performance, we run
experiments on synthetic and real-world datasets.