Adaptively Solving the Local-Minimum Problem for Deep Neural Networks

dc.contributor.authorWang, Huachuan
dc.contributor.authorLo, James Ting-Ho
dc.date.accessioned2021-02-04T18:34:14Z
dc.date.available2021-02-04T18:34:14Z
dc.description.abstractThis paper aims to overcome a fundamental problem in the theory and application of deep neural networks (DNNs). We propose a method to solve the local minimum problem in training DNNs directly. Our method is based on the cross-entropy loss criterion's convexification by transforming the cross-entropy loss into a risk averting error (RAE) criterion. To alleviate numerical difficulties, a normalized RAE (NRAE) is employed. The convexity region of the cross-entropy loss expands as its risk sensitivity index (RSI) increases. Making the best use of the convexity region, our method starts training with an extensive RSI, gradually reduces it, and switches to the RAE as soon as the RAE is numerically feasible. After training converges, the resultant deep learning machine is expected to be inside the attraction basin of a global minimum of the cross-entropy loss. Numerical results are provided to show the effectiveness of the proposed method.en_US
dc.description.urihttps://arxiv.org/abs/2012.13632en_US
dc.format.extent9 pagesen_US
dc.genrejournal articles preprintsen_US
dc.identifierdoi:10.13016/m2dcon-ntjx
dc.identifier.citationHuachuan Wang and James Ting-Ho Lo, Adaptively Solving the Local-Minimum Problem for Deep Neural Networks, https://arxiv.org/abs/2012.13632en_US
dc.identifier.urihttp://hdl.handle.net/11603/20939
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Mathematics Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.titleAdaptively Solving the Local-Minimum Problem for Deep Neural Networksen_US
dc.typeTexten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2012.13632.pdf
Size:
171.4 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: