Semi-supervised Expectation Maximization with Contrastive Outlier Removal.

Menon, Sumeet

Semi-supervised Expectation Maximization with Contrastive Outlier Removal.

dc.contributor.advisor	Chapman, David
dc.contributor.author	Menon, Sumeet
dc.contributor.department	Computer Science and Electrical Engineering
dc.contributor.program	Computer Science
dc.date.accessioned	2022-09-29T15:37:59Z
dc.date.available	2022-09-29T15:37:59Z
dc.date.issued	2022-01-01
dc.description.abstract	Semi-supervised learning has proven to be one of the most widely used techniques to overcome the concern of limited labels. One of the concerns while using neural networks for semi-supervised learning in presence of an extremely small labeled dataset is the occurrence of confidently predicted incorrect labels. This phenomenon of confidently predicting incorrect labels for unsupervised data is called confounding bias. Even though pseudo-labeling and consistency regularization are among the state-of-the-art techniques for semi-supervised learning, these techniques are susceptible to the problem of confounding bias while using neural networks. We propose a methodology that could help neural networks overcome this problem by leveraging information from unlabeled images using cluster generating techniques and smoothness generating techniques in a tightly-coupled way to overcome the fundamental problem of outliers. These techniques could help the model to learn certain attributes from the image which could not be learned from the original resolution of the unlabeled images. We argue both theoretically and empirically that contrastive outlier suppression is a necessary yet overlooked criteria in the application of EM-derived latent bootstrapping, because discrimination models such as neural networks have the potential to make erronous predictions with high confidence if these datasets are far from the decision boundary, whereas generative methods for which Expectation Maximization (EM) was originally designed have no such issue. Contrastive outlier suppression is derived under the assumption that the latent feature vector predictions should follow a multivariate gaussian mixture distribution. Our results show that contrastive latent bootstrapping greatly improves semi-supervised classification accuracy over a baseline, and furthermore when combined with a state-of-the-art consistency regularization method, our results achieve the highest reported semi-supervised accuracy for the CIFAR-10 classification using only 250 labeled sample images.
dc.format	application:pdf
dc.genre	dissertations
dc.identifier	doi:10.13016/m2iuae-ynix
dc.identifier.other	12528
dc.identifier.uri	http://hdl.handle.net/11603/25989
dc.language	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Theses and Dissertations Collection
dc.relation.ispartof	UMBC Graduate School Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	This item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
dc.source	Original File Name: Menon_umbc_0434D_12528.pdf
dc.subject	Consistency Regularization
dc.subject	Outlier Removal
dc.subject	Proxy-Label
dc.subject	Semi-Supervised Learning
dc.title	Semi-supervised Expectation Maximization with Contrastive Outlier Removal.
dc.type	Text
dcterms.accessRights	Distribution Rights granted to UMBC by the author.
dcterms.accessRights	Access limited to the UMBC community. Item may possibly be obtained via Interlibrary Loan thorugh a local library, pending author/copyright holder's permission.

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Menon_umbc_0434D_12528.pdf
Size:: 1.39 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: Menon-Sumeet_Open.pdf
Size:: 265.38 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

UMBC Theses and Dissertations
UMBC Computer Science and Electrical Engineering Department
UMBC Graduate School
UMBC Student Collection