Performance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictions

dc.contributor.authorBasalyga, Jonathan N.
dc.contributor.authorBarajas, Carlos A.
dc.contributor.authorGobbert, Matthias
dc.contributor.authorWang, Jianwu
dc.date.accessioned2022-09-26T16:39:32Z
dc.date.available2022-09-26T16:39:32Z
dc.date.issued2021-07-15
dc.description.abstractPredicting violent storms and dangerous weather conditions with current mod- els can take a long time due to the immense complexity associated with weather simulation. Machine learning has the potential to classify tornadic weather patterns much more rapidly, thus allowing for more timely alerts to the pub- lic. To deal with class imbalance challenges in machine learning, different data augmentation approaches have been proposed. In this work, we examine the wall time difference between live data augmentation methods versus the use of preaugmented data when they are used in a convolutional neural network based training for tornado prediction. We also compare CPU and GPU based training over varying sizes of augmented data sets. Additionally we examine what impact varying the number of GPUs used for training will produce given a convolutional neural network on wall time and accuracy. We conclude that using multiple GPUs to train a single network has no signficant advantage over using a single GPU. The number of GPUs used during training should be kept as small as possible for maximum search throughput as the native keras multi-GPU model provides little speedup with optimal learning parameters.en
dc.description.sponsorshipThis work is supported by the grant “CyberTraining: DSE: Cross-Training of 575 Researchers in Computing, Applied Mathematics and Atmospheric Sciences us- ing Advanced Cyberinfrastructure Resources” from the National Science Foun- dation (grant no. OAC–1730250). Co-author Carlos Barajas additionally ac- knowledges support as HPCF RA. The hardware used in the computational studies is part of the UMBC High Performance Computing Facility (HPCF). 580 The facility is supported by the U.S. National Science Foundation through the MRI program (grant nos. CNS–0821258, CNS–1228778, and OAC–1726023) and the SCREMS program (grant no. DMS–0821311), with additional substan- tial support from the University of Maryland, Baltimore County (UMBC). See hpcf.umbc.edu for more information on HPCF and the projects using its re- 585 sources.en
dc.description.urihttps://www.sciencedirect.com/science/article/pii/S2214579621000290en
dc.format.extent35 pagesen
dc.genrejournal articlesen
dc.genrepreprintsen
dc.identifierdoi:10.13016/m2xlmo-btc0
dc.identifier.citationJonathan N. Basalyga, Carlos A. Barajas, Matthias K. Gobbert, Jianwu Wang. Performance Benchmarking of Parallel Hyperparameter Tuning for Deep Learning based Tornado Predictions. Big Data Research, vol. 25, no. 100212, 2021. DOI:10.1016/j.bdr.2021.100212en
dc.identifier.urihttps://doi.org/10.1016/j.bdr.2021.100212
dc.identifier.urihttp://hdl.handle.net/11603/25889
dc.language.isoenen
dc.publisherElsevieren
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Mathematics and Statistics Department
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en
dc.subjectUMBC High Performance Computing Facility (HPCF)
dc.subjectUMBC Big Data Analytics Laben
dc.titlePerformance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictionsen
dc.typeTexten
dcterms.creatorhttps://orcid.org/0000-0003-1745-2292en
dcterms.creatorhttps://orcid.org/0000-0002-9933-1170en

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2021-Performance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictions.pdf
Size:
799.83 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: