Design Space Exploration of Data-centric Architectures

dc.contributor.advisorHalem, Milton
dc.contributor.authorPrathapan, Smriti
dc.contributor.departmentComputer Science and Electrical Engineering
dc.contributor.programComputer Science
dc.date.accessioned2021-09-01T13:55:21Z
dc.date.available2021-09-01T13:55:21Z
dc.date.issued2020-01-20
dc.description.abstractThe era of "big data" is leading to changes in the compute paradigm, in particular to the notion of moving computation to data, known as Near Data Processing (NDP). Technological advancements have enabled the application of NDP at many levels of the memory hierarchy from cache to DRAM, from non-volatile storage-class memory to processors embedded in storage devices. This dissertations explores the effectiveness of data-centric compute architectures using Active Storage, Processing-in-Memory and Coherent Access Processor Interface (CAPI) accelerated Flash storage. We developed a compute framework Active In-Storage (AiSTOR) that enables scalable distributed Big Data Processing by directly performing the computations on active storage devices. AiSTOR has the following three major advantages: (i) active storage utilizes the processor capabilities on the storage devices and this significantly reducing the bandwidth requirement of the network, (ii) the computations can take advantage of the inherent map/reduce parallelism by using the array of the distributed storage processors available on the active data storage devices, thereby aggregating the processing power of a cluster of active devices, (iii) it can perform coherent processing of streaming data as it arrives on the storage devices. We define a generic NDP architecture which is well-suited for memory-bound computations and implement the software kernels for NDP-based algorithmic mapping.We show for a modest sized NDP system, that the AiSTOR architecture framework employing distributed processing algorithms can yield efficient and accurate computational processing performance. In comparison with Hadoop based MapReduce, the compute times on AiSTOR has significant performance benefits by up to 18%, while providing very competitive results compared to Spark-based in-memory processing. The effectiveness of the NDP architecture is demonstrated by evaluating the row-buffer management policies (open-page and closed-page) with the controller modifications and a generic unmodified architecture. The PIM open-page policy has 52% higher operation throughput than the host and 3.7% higher throughput and 50% lesser DRAM activations than PIM-closed page policy. Further, this dissertations explores the potential impact of hidden CPU usage in handling the IO requests in heterogeneous storage systems when using CAPIFlash library and finding the optimal IO/s and OP/s for heterogeneous storage memory devices such as NVM, SSD and RAM. FS900 with CAPI, when using the RAM metadata cache, performed 2x as many read OP/s in synchronous mode and 3x in asynchronous mode. SSD and NVM had 66% higher IO/s in comparison with RAM in asynchronous mode.
dc.formatapplication:pdf
dc.genredissertations
dc.identifierdoi:10.13016/m2ctv2-wote
dc.identifier.other12276
dc.identifier.urihttp://hdl.handle.net/11603/22824
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.sourceOriginal File Name: Prathapan_umbc_0434D_12276.pdf
dc.subjectActive Storage
dc.subjectCoherent Accelerator Processor Interface
dc.subjectData-centric computing
dc.subjectNear Data Computing
dc.subjectProcessing in Memory
dc.titleDesign Space Exploration of Data-centric Architectures
dc.typeText
dcterms.accessRightsDistribution Rights granted to UMBC by the author.
dcterms.accessRightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Prathapan_umbc_0434D_12276.pdf
Size:
3.06 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Prathapan-Smriti_Open.pdf
Size:
7.18 MB
Format:
Adobe Portable Document Format
Description: