Performance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictions

dc.contributor.authorBasalyga, Jonathan N.
dc.contributor.authorBarajas, Carlos A.
dc.contributor.authorGobbert, Matthias
dc.contributor.authorWang, Jianwu
dc.date.accessioned2022-09-26T16:39:32Z
dc.date.available2022-09-26T16:39:32Z
dc.date.issued2021-07-15
dc.description.abstractPredicting violent storms and dangerous weather conditions with current mod- els can take a long time due to the immense complexity associated with weather simulation. Machine learning has the potential to classify tornadic weather patterns much more rapidly, thus allowing for more timely alerts to the pub- lic. To deal with class imbalance challenges in machine learning, different data augmentation approaches have been proposed. In this work, we examine the wall time difference between live data augmentation methods versus the use of preaugmented data when they are used in a convolutional neural network based training for tornado prediction. We also compare CPU and GPU based training over varying sizes of augmented data sets. Additionally we examine what impact varying the number of GPUs used for training will produce given a convolutional neural network on wall time and accuracy. We conclude that using multiple GPUs to train a single network has no signficant advantage over using a single GPU. The number of GPUs used during training should be kept as small as possible for maximum search throughput as the native keras multi-GPU model provides little speedup with optimal learning parameters.en_US
dc.description.sponsorshipThis work is supported by the grant “CyberTraining: DSE: Cross-Training of 575 Researchers in Computing, Applied Mathematics and Atmospheric Sciences us- ing Advanced Cyberinfrastructure Resources” from the National Science Foun- dation (grant no. OAC–1730250). Co-author Carlos Barajas additionally ac- knowledges support as HPCF RA. The hardware used in the computational studies is part of the UMBC High Performance Computing Facility (HPCF). 580 The facility is supported by the U.S. National Science Foundation through the MRI program (grant nos. CNS–0821258, CNS–1228778, and OAC–1726023) and the SCREMS program (grant no. DMS–0821311), with additional substan- tial support from the University of Maryland, Baltimore County (UMBC). See hpcf.umbc.edu for more information on HPCF and the projects using its re- 585 sources.en_US
dc.description.urihttps://www.sciencedirect.com/science/article/pii/S2214579621000290en_US
dc.format.extent35 pagesen_US
dc.genrejournal articlesen_US
dc.genrepreprintsen_US
dc.identifierdoi:10.13016/m2xlmo-btc0
dc.identifier.citationJonathan N. Basalyga, Carlos A. Barajas, Matthias K. Gobbert, Jianwu Wang. Performance Benchmarking of Parallel Hyperparameter Tuning for Deep Learning based Tornado Predictions. Big Data Research, vol. 25, no. 100212, 2021. DOI:10.1016/j.bdr.2021.100212en_US
dc.identifier.urihttps://doi.org/10.1016/j.bdr.2021.100212
dc.identifier.urihttp://hdl.handle.net/11603/25889
dc.language.isoen_USen_US
dc.publisherElsevieren_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Mathematics and Statistics Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en_US
dc.subjectUMBC Big Data Analytics Laben_US
dc.subjectUMBC High Performance Computing Facility (HPCF)
dc.titlePerformance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictionsen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0003-1745-2292en_US
dcterms.creatorhttps://orcid.org/0000-0002-9933-1170en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2021-Performance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictions.pdf
Size:
799.83 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: