Performance Benchmarking of Data Augmentation and Deep Learning for Tornado Prediction
Links to Files
https://ieeexplore.ieee.org/document/9006531Collections
Metadata
Show full item recordDate
2019Type of Work
9 pagesText
conference papers and proceedings postprints
Citation of Original Publication
C. A. Barajas, M. K. Gobbert and J. Wang, "Performance Benchmarking of Data Augmentation and Deep Learning for Tornado Prediction," 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, pp. 3607-3615, doi: 10.1109/BigData47090.2019.9006531.Rights
This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.© 2020 IEEE
Subjects
deep learningdata augmentation
parallel performance
TensorFlow
Keras
UMBC High Performance Computing Facility (HPCF)
Abstract
Predicting violent storms and dangerous weather conditions with current models can take a long time due to the immense complexity associated with weather simulation. Machine learning has the potential to classify tornadic weather patterns much more rapidly, thus allowing for more timely alerts to the public. To deal with class imbalance challenges in machine learning, different data augmentation approaches have been proposed. In this work, we examine the wall time difference between live data augmentation methods versus the use of preaugmented data when they are used in a convolutional neural network based training for tornado prediction. We also compare CPU and GPU based training over varying sizes of augmented data sets. Additionally we examine what impact varying the number of GPUs used for training will produce given a convolutional neural network