Performance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictions
dc.contributor.author | Basalyga, Jonathan N. | |
dc.contributor.author | Barajas, Carlos A. | |
dc.contributor.author | Gobbert, Matthias | |
dc.contributor.author | Wang, Jianwu | |
dc.date.accessioned | 2022-09-26T16:39:32Z | |
dc.date.available | 2022-09-26T16:39:32Z | |
dc.date.issued | 2021-07-15 | |
dc.description.abstract | Predicting violent storms and dangerous weather conditions with current mod- els can take a long time due to the immense complexity associated with weather simulation. Machine learning has the potential to classify tornadic weather patterns much more rapidly, thus allowing for more timely alerts to the pub- lic. To deal with class imbalance challenges in machine learning, different data augmentation approaches have been proposed. In this work, we examine the wall time difference between live data augmentation methods versus the use of preaugmented data when they are used in a convolutional neural network based training for tornado prediction. We also compare CPU and GPU based training over varying sizes of augmented data sets. Additionally we examine what impact varying the number of GPUs used for training will produce given a convolutional neural network on wall time and accuracy. We conclude that using multiple GPUs to train a single network has no signficant advantage over using a single GPU. The number of GPUs used during training should be kept as small as possible for maximum search throughput as the native keras multi-GPU model provides little speedup with optimal learning parameters. | en_US |
dc.description.sponsorship | This work is supported by the grant “CyberTraining: DSE: Cross-Training of 575 Researchers in Computing, Applied Mathematics and Atmospheric Sciences us- ing Advanced Cyberinfrastructure Resources” from the National Science Foun- dation (grant no. OAC–1730250). Co-author Carlos Barajas additionally ac- knowledges support as HPCF RA. The hardware used in the computational studies is part of the UMBC High Performance Computing Facility (HPCF). 580 The facility is supported by the U.S. National Science Foundation through the MRI program (grant nos. CNS–0821258, CNS–1228778, and OAC–1726023) and the SCREMS program (grant no. DMS–0821311), with additional substan- tial support from the University of Maryland, Baltimore County (UMBC). See hpcf.umbc.edu for more information on HPCF and the projects using its re- 585 sources. | en_US |
dc.description.uri | https://www.sciencedirect.com/science/article/pii/S2214579621000290 | en_US |
dc.format.extent | 35 pages | en_US |
dc.genre | journal articles | en_US |
dc.genre | preprints | en_US |
dc.identifier | doi:10.13016/m2xlmo-btc0 | |
dc.identifier.citation | Jonathan N. Basalyga, Carlos A. Barajas, Matthias K. Gobbert, Jianwu Wang. Performance Benchmarking of Parallel Hyperparameter Tuning for Deep Learning based Tornado Predictions. Big Data Research, vol. 25, no. 100212, 2021. DOI:10.1016/j.bdr.2021.100212 | en_US |
dc.identifier.uri | https://doi.org/10.1016/j.bdr.2021.100212 | |
dc.identifier.uri | http://hdl.handle.net/11603/25889 | |
dc.language.iso | en_US | en_US |
dc.publisher | Elsevier | en_US |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Information Systems Department Collection | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.relation.ispartof | UMBC Student Collection | |
dc.relation.ispartof | UMBC Mathematics and Statistics Department | |
dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | en_US |
dc.subject | UMBC Big Data Analytics Lab | en_US |
dc.subject | UMBC High Performance Computing Facility (HPCF) | |
dc.title | Performance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictions | en_US |
dc.type | Text | en_US |
dcterms.creator | https://orcid.org/0000-0003-1745-2292 | en_US |
dcterms.creator | https://orcid.org/0000-0002-9933-1170 | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2021-Performance Benchmarking of Data Augmentation and GPU Count Variability for Deep Learning Tornado Predictions.pdf
- Size:
- 799.83 KB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 2.56 KB
- Format:
- Item-specific license agreed upon to submission
- Description: