Learning a Sparse Neural Network using IHT

dc.contributor.authorDamadi, Saeed
dc.contributor.authorZolfaghari, Soroush
dc.contributor.authorRezaie, Mahdi
dc.contributor.authorShen, Jinglai
dc.date.accessioned2024-05-29T14:38:23Z
dc.date.available2024-05-29T14:38:23Z
dc.date.issued2024-04-29
dc.description.abstractThe core of a good model is in its ability to focus only on important information that reflects the basic patterns and consistencies, thus pulling out a clear, noise-free signal from the dataset. This necessitates using a simplified model defined by fewer parameters. The importance of theoretical foundations becomes clear in this context, as this paper relies on established results from the domain of advanced sparse optimization, particularly those addressing nonlinear differentiable functions. The need for such theoretical foundations is further highlighted by the trend that as computational power for training NNs increases, so does the complexity of the models in terms of a higher number of parameters. In practical scenarios, these large models are often simplified to more manageable versions with fewer parameters.
dc.description.urihttp://arxiv.org/abs/2404.18414
dc.format.extent10 pages
dc.genrejournal articles
dc.genrepreprints
dc.identifierdoi:10.13016/m21llt-c5qz
dc.identifier.urihttps://doi.org/10.48550/arXiv.2404.18414
dc.identifier.urihttp://hdl.handle.net/11603/34347
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Mathematics and Statistics Department
dc.rightsCC BY 4.0 DEED Attribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectComputer Science - Machine Learning
dc.titleLearning a Sparse Neural Network using IHT
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-2806-1476
dcterms.creatorhttps://orcid.org/0000-0003-2172-4182

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2404.18414v1.pdf
Size:
1.72 MB
Format:
Adobe Portable Document Format