SuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction

dc.contributor.authorRoy, Sujit
dc.contributor.authorHegde, Dinesha V.
dc.contributor.authorSchmude, Johannes
dc.contributor.authorLin, Amy
dc.contributor.authorGaur, Vishal
dc.contributor.authorLal, Rohit
dc.contributor.authorMandal, Kshitiz
dc.contributor.authorSingh, Talwinder
dc.contributor.authorMuñoz-Jaramillo, Andrés
dc.contributor.authorYang, Kang
dc.contributor.authorPandey, Chetraj
dc.contributor.authorHong, Jinsu
dc.contributor.authorAydin, Berkay
dc.contributor.authorMcGranaghan, Ryan
dc.contributor.authorKasapis, Spiridon
dc.contributor.authorUpendran, Vishal
dc.contributor.authorBahauddin, Shah
dc.contributor.authorda Silva, Daniel
dc.contributor.authorFreitag, Marcus
dc.contributor.authorGurung, Iksha
dc.contributor.authorPogorelov, Nikolai
dc.contributor.authorWatson, Campbell
dc.contributor.authorMaskey, Manil
dc.contributor.authorBernabe-Moreno, Juan
dc.contributor.authorRamachandran, Rahul
dc.date.accessioned2025-10-29T19:15:09Z
dc.date.issued2025-08-18
dc.description.abstractThis paper introduces a high resolution, machine learning-ready heliophysics dataset derived from NASA's Solar Dynamics Observatory (SDO), specifically designed to advance machine learning (ML) applications in solar physics and space weather forecasting. The dataset includes processed imagery from the Atmospheric Imaging Assembly (AIA) and Helioseismic and Magnetic Imager (HMI), spanning a solar cycle from May 2010 to July 2024. To ensure suitability for ML tasks, the data has been preprocessed, including correction of spacecraft roll angles, orbital adjustments, exposure normalization, and degradation compensation. We also provide auxiliary application benchmark datasets complementing the core SDO dataset. These provide benchmark applications for central heliophysics and space weather tasks such as active region segmentation, active region emergence forecasting, coronal field extrapolation, solar flare prediction, solar EUV spectra prediction, and solar wind speed estimation. By establishing a unified, standardized data collection, this dataset aims to facilitate benchmarking, enhance reproducibility, and accelerate the development of AI-driven models for critical space weather prediction tasks, bridging gaps between solar physics, machine learning, and operational forecasting.
dc.description.sponsorshipThis work is supported by NASA Grant 80MSFC22M004. The Authors acknowledge the National Artificial Intelligence Research Resource (NAIRR) Pilot and NVIDIA for providing support under grant no. NAIRR240178.
dc.description.urihttp://arxiv.org/abs/2508.14107
dc.format.extent25 pages
dc.genrejournal articles
dc.identifierdoi:10.13016/m2nilh-d0bx
dc.identifier.urihttps://doi.org/10.48550/arXiv.2508.14107
dc.identifier.urihttp://hdl.handle.net/11603/40721
dc.language.isoen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Goddard Planetary Heliophysics Institute (GPHI)
dc.rightsThis work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.
dc.rightsPublic Domain
dc.rights.urihttps://creativecommons.org/publicdomain/mark/1.0/
dc.subjectAstrophysics - Instrumentation and Methods for Astrophysics
dc.subjectComputer Science - Artificial Intelligence
dc.subjectAstrophysics - Solar and Stellar Astrophysics
dc.titleSuryaBench: Benchmark Dataset for Advancing Machine Learning in Heliophysics and Space Weather Prediction
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0001-7537-3539

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2508.14107v1.pdf
Size:
17.07 MB
Format:
Adobe Portable Document Format