A Data Intensive Statistical Aggregation Engine: A Case Study for Gridded Climate Records

dc.contributor.authorChapman, David
dc.contributor.authorSimon, Tyler A.
dc.contributor.authorNguyen, Phuong
dc.contributor.authorHalem, Milton
dc.date.accessioned2023-10-26T19:14:49Z
dc.date.available2023-10-26T19:14:49Z
dc.date.issued2013-10-31
dc.description2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, 20-24 May 2013, Cambridge, MA, USAen_US
dc.description.abstractSatellite derived climate instrument records are often highly structured and conform to the "Data-Cube" topology. However, data scales on the order of tens to hundreds of Terabytes make it more difficult to perform the rigorous statistical aggregation and analytics necessary to investigate how our climate is changing over time and space. It is especially cumbersome to supply the full derivation (provenance) of this analysis, as is increasingly required by scientific conferences and journals. In this paper, we address our approach toward the creation of a 55 Terabyte decadal record of Outgoing Long wave Spectrum (OLS) from the NASA Atmospheric Infrared Sounder (AIRS), and describe our open source data-intensive statistical aggregation engine "Gridderama" intended primarily for climate trend analysis, and may be applicable to other aggregation problems involving large structured datasets.en_US
dc.description.urihttps://ieeexplore.ieee.org/document/6651122en_US
dc.format.extent8 pagesen_US
dc.genreconference papers and proceedingsen_US
dc.genrepreprintsen_US
dc.identifierdoi:10.13016/m2qyno-f0ky
dc.identifier.citationChapman, David, Tyler A. Simon, Phuong Nguyen, and Milton Halem. “A Data Intensive Statistical Aggregation Engine: A Case Study for Gridded Climate Records.” In 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, 2157–64, 2013. https://doi.org/10.1109/IPDPSW.2013.87.en_US
dc.identifier.urihttps://doi.org/10.1109/IPDPSW.2013.87
dc.identifier.urihttp://hdl.handle.net/11603/30411
dc.language.isoen_USen_US
dc.publisherIEEEen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rights© 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.titleA Data Intensive Statistical Aggregation Engine: A Case Study for Gridded Climate Recordsen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0002-4862-8396en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
gre_cloudflow13.pdf
Size:
1.27 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: