Open Access Open Access  Restricted Access Subscription or Fee Access

Study of Map Reduce Over Different Cube Computation Approaches: Survey Paper

Swati B. Gawade, T.A. Dhaygude

Abstract


MapReduce is a programing poser and an related effort for processing and generating super aggregation sets. Users determine a map operate that processes a key/value place to make a set of medium key/value pairs, and a Reduce function work that merges all middle values associated with the very mediate key. Efficient extract of aggregations very significant role in Data wearhouse Store systems. Multidimensionalaccumulation psychotherapy applications data crosswise much dimensions designer for anomalies or unusual patterns. The SQL mass functions and the GROUP BY operator are utilised for accumulation. But Data psychotherapy applications requirement the N-dimensional generality of these operators. Data cube is introduced which is a way of structuring information in N-dimensions so as to spread analysis over few measure. Datawearhouse implementation for the essential part of data cube computation. The precomputation of all or endeavor of a data cube can greatly restrict the salutation quantify and deepen the executing of online analytical processing. Various strategies to Cube Materlization, there are various methods for cube computation and  specific computation algorithms, namely Star Cubing, BUC, Multiway array aggregation, parallel algorithms, the computation of shell fragments and. But these techniques some rule so new MapReduce based approach is used.


Keywords


Bottom Up Computaton, Cube Computing Techniques, Data Cubes, Hadoop Map Reduce, Star Cubing.

Full Text:

PDF

References


J. Dean and S. Ghemawat. “MapReduce: Simplified Data Processing on Large Clusters.” In OSDI, 2004.

A. Pavlo, A. Rasin, S. Madden, M. Stonebraker, D. DeWitt,E. Paulson, L. Shrinivas, and D. J. Abadi.” A Comparison of Approaches to Large Scale Data Analysis.”In Proc.Of SIGMOD, 2009.

A. Abouzeid et al., “HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads,” Proc. VLDB Endowment, vol. 2, pp. 922-933, 2009.

D. Xin et al., “Star-Cubing: Computing Iceberg Cubes by Top- Down And Bottom-Up Integration,” Proc. 29th Int’l Conf. Very Large Data Bases (VLDB), 2003.

R.T. Ng, A.S. Wagner, and Y. Yin, “Iceberg-Cube Computation with PC Clusters,” Proc. ACM SIGMOD Int’l Conf. Management of Data, 2001.

S. Agarwal, R. Agrawal, P. Deshpande, A. Gupta, J. Naughton, R.Ramakrishnan,and S. Sarawagi, "On the Computation of Multidimensional Aggregates," Proc.22nd Int’l Conf. Very Large Data Bases (VLDB), 1996.

K. Sergey and K. Yury, “Applying Map-Reduce Paradigm for Parallel Closed Cube Computation,” Proc. First Int’l Conf. Advances in Databases, Knowledge, and Data Applications (DBKDA), 2009.

J. Gray, S. Chaudhuri, A. Bosworth, A. Layman, D. Reichart, M. Venkatrao, F.Pellow, and H. Pirahesh, "Data Cube: A Relational Operator Generalizing Group-By, Cross-Tab and Sub-Totals," Proc.12th Int’l Conf. Data Eng. (ICDE), 1996.

K.V. Shvachko and A.C. Murthy, “Scaling Hadoop to 4000 Nodes at Yahoo!,” Yahoo! Developer Network Blog, 2008.

K. Ross and D. Srivastava, "Fast Computation of Sparse Datacubes," Proc. 23rd Int'l Conf. Very Large Data Bases(VLDB),1997

Y. Zhao, P. M. Deshpande, and J. F. Naughton.”An array-based algorithm for simultaneous multidimensional aggregates”. In SIGMOD'97.

A. Nandi, C. Yu, P. Bohannon, and R. Ramakrishnan, “Distributed Cube Materialization on Holistic Measures,” Proc. IEEE 27th Int’l Conf. Data Eng. (ICDE), 2011.

K. Beyer and R. Ramakrishnan, "Bottom-Up Computation of Sparse and Iceberg CUBEs," Proc. ACM SIGMOD Int’l Conf.Management of Data, 1999.

J. Hah, J. Pei, G. Dong, and K. Wang, "Efficient Computation of Iceberg Cubes with Complex Measures," Proc. ACM SIGMOD Int’l Conf. Management of Data,2001.

Xiaolei Li, JiaweiHan,Hector Gonzalez “High-Dimensional OLAP:A Minimal Cubing Approach“ University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.