A Comparison of Big Data Application Programming Approaches: A Travel Companion Case Study

dc.contributor.authorGuo, Pei
dc.contributor.authorWang, Jianwu
dc.contributor.authorChen, Zhiyuan
dc.date.accessioned2018-02-27T20:34:14Z
dc.date.available2018-02-27T20:34:14Z
dc.date.issued2017
dc.description.abstractWith advances of big data technologies, there are many possible ways to program for each big data application. A challenge is to know the differences of the program approaches and decide which programming approach is the best for a particular big data application. In this paper, we use vehicle travel companion as a case study to explore four different programming approaches, including Spark RDD (with GroupBy or Join), Spark SQL with Hive and Hive on Hadoop, and tune the programmed big data applications. Our experiments show that the execution time of one programming approach could be more than 100-fold longer than that of another for the same application logic, which verifies that programming approach decision is important. We also explain the reasons for the differences. The findings could be applied to the selection of programming approach for other big data applications.en_US
dc.format.extent10 pagesen_US
dc.genreconference papers and proceedingsen_US
dc.identifierdoi:10.13016/M2P55DJ51
dc.identifier.citationP. Guo, J. Wang and Z. Chen, "A comparison of big data application programming approaches: A travel companion case study," 2017 IEEE International Conference on Big Data (Big Data), Boston, MA, 2017, pp. 2869-2878. DOI: 10.1109/BigData.2017.8258255en_US
dc.identifier.urihttp://hdl.handle.net/11603/7832
dc.language.isoen_USen_US
dc.publisherIEEEen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please contact the author.
dc.rights.holder© 2017 IEEE
dc.subjectbig dataen_US
dc.subjecthiveen_US
dc.subjectSQLen_US
dc.subjectprogramming comparisonen_US
dc.subjectbenchmarkingen_US
dc.subjecttravel companionen_US
dc.subjectSparksen_US
dc.subjectbig data application programmingen_US
dc.titleA Comparison of Big Data Application Programming Approaches: A Travel Companion Case Studyen_US
dc.typeTexten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A Comparison of Big Data Application Programming Approaches_ A Travel Companion Case Study.pdf
Size:
752.35 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
license.txt
Size:
1.68 KB
Format:
Item-specific license agreed upon to submission
Description:
No Thumbnail Available
Name:
GuoLicense.pdf
Size:
392.67 KB
Format:
Adobe Portable Document Format
Description: