A Survey of Large-Scale Deep Learning Serving System Optimization: Challenges and Opportunities

dc.contributor.authorYu, Fuxun
dc.contributor.authorWang, Di
dc.contributor.authorShangguan, Longfei
dc.contributor.authorZhang, Minjia
dc.contributor.authorTang, Xulong
dc.contributor.authorLiu, Chenchen
dc.contributor.authorChen, Xiang
dc.date.accessioned2022-01-10T17:41:31Z
dc.date.available2022-01-10T17:41:31Z
dc.date.issued2021-11-28
dc.description.abstractDeep Learning (DL) models have achieved superior performance in many application domains, including vision, language, medical, commercial ads, entertainment, etc. With the fast development, both DL applications and the underlying serving hardware have demonstrated strong scaling trends, i.e., Model Scaling and Compute Scaling, for example, the recent pre-trained model with hundreds of billions of parameters with ∼TB level memory consumption, as well as the newest GPU accelerators providing hundreds of TFLOPS. With both scaling trends, new problems and challenges emerge in DL inference serving systems, which gradually trends towards Large-scale Deep learning Serving system (LDS). This survey aims to summarize and categorize the emerging challenges and optimization opportunities for large-scale deep learning serving systems. By providing a novel taxonomy, summarizing the computing paradigms, and elaborating the recent technique advances, we hope that this survey could shed lights on new optimization perspectives and motivate novel works in large-scale deep learning system optimization.en_US
dc.description.urihttps://arxiv.org/abs/2111.14247en_US
dc.format.extent11 pagesen_US
dc.genrejournal articlesen_US
dc.genrepreprintsen_US
dc.identifierdoi:10.13016/m2aqlo-ehkb
dc.identifier.urihttp://hdl.handle.net/11603/23911
dc.language.isoen_USen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.en_US
dc.rightsAttribution 4.0 International (CC BY 4.0)*
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.titleA Survey of Large-Scale Deep Learning Serving System Optimization: Challenges and Opportunitiesen_US
dc.typeTexten_US
dcterms.creatorhttps://orcid.org/0000-0001-7749-0640en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2111.14247.pdf
Size:
2.25 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: