E3: An elastic execution engine for scalable data processing

Please use this identifier to cite or link to this item: https://doi.org/10.2197/ipsjjip.vol.20.65

DC Field	Value
dc.title	E3: An elastic execution engine for scalable data processing
dc.contributor.author	Chen, G.
dc.contributor.author	Chen, K.
dc.contributor.author	Jiang, D.
dc.contributor.author	Ooi, B.C.
dc.contributor.author	Shi, L.
dc.contributor.author	Vo, H.T.
dc.contributor.author	Wu, S.
dc.date.accessioned	2013-07-04T07:42:26Z
dc.date.available	2013-07-04T07:42:26Z
dc.date.issued	2012
dc.identifier.citation	Chen, G.,Chen, K.,Jiang, D.,Ooi, B.C.,Shi, L.,Vo, H.T.,Wu, S. (2012). E3: An elastic execution engine for scalable data processing. Journal of Information Processing 20 (1) : 65-76. ScholarBank@NUS Repository. <a href="https://doi.org/10.2197/ipsjjip.vol.20.65" target="_blank">https://doi.org/10.2197/ipsjjip.vol.20.65</a>
dc.identifier.issn	03875806
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/39475
dc.description.abstract	With the unprecedented growth of data generated by mankind nowadays, it has become critical to de- velop efficient techniques for processing these massive data sets. To tackle such challenges, analytical data processing systems must be extremely efficient, scalable, and flexible as well as economically effective. Recently, Hadoop, an open-source implementation of MapReduce, has gained interests as a promising big data processing system. Although Hadoop offers the desired flexibility and scalability, its performance has been noted to be suboptimal when it is used to process complex analytical tasks. This paper presents E3, an elastic and efficient execution engine for scalable data processing. E3 adopts a "middle" approach between MapReduce and Dryad in that E3 has a simpler communication model than Dryad yet it can support multi-stages job better than MapReduce. E3 avoids reprocessing intermediate results by adopting a stage-based evaluation strategy and collocating data and user-defined (map or reduce) functions into independent processing units for parallel execution. Furthermore, E3 supports block-level indexes, and built-in functions for specifying and optimizing data processing flows. Benchmarking on an in-house cluster shows that E3 achieves significantly better performance than Hadoop, or put it another way, building an elastically scalable and efficient data processing system is possible. © 2012 Information Processing Society of Japan.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.2197/ipsjjip.vol.20.65
dc.source	Scopus
dc.subject	Cloud computing
dc.subject	Elastic exection engine
dc.subject	Parallel processing
dc.type	Article
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.2197/ipsjjip.vol.20.65
dc.description.sourcetitle	Journal of Information Processing
dc.description.volume	20
dc.description.issue	1
dc.description.page	65-76
dc.identifier.isiut	NOT_IN_WOS
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM