Open Access Open Access  Restricted Access Subscription or Fee Access

Survey on Performance Analysis of Hadoop ETL for Disaster Management

M. Saranya

Abstract


The term big data refers to data sets whose volume, variability and Speed of velocity make them difficult to capture, manage, procedure or analyzed. To examine this huge amount of data Hadoop is able to be used. Hadoop is an open source software project that enables the Spread giving out of large data sets across a cluster of creationservers.ETL Tools extract important information from various data sources, various transFormation’s of data are established out transformation phase and then load into the big data. HDFS (Hadoop Distributed File System), is a spread file system design to hold the very huge of data (peta bytes or even zetta bytes), and there high throughput Admission to this information. Map Reduce method has been calculated n this paper which is required for implement Big Data Analysis using HDFS. In this paper the related topics of Big Data Analytics, and Hadoop, ETL, Map Reduce are reviewed.


Keywords


Big Data, Hadoop, ETL, Map Reduce, HDFS

Full Text:

PDF

References


Varsha B.Bobad, “International Research Journal of Engineering and Technology (IRJET)”, Volume: 03 Issue: 01

Inmon, William "Data Mart Does Not Equal Data Warehouse".DM Review.com. (2000-07-18)

Jeffrey R. Bocarsly, “The Data Warehouse Toolkit.” Complex ETL Testing-A Strategic Approach

R. Kimball and M. Ross. WileyPublishing, Inc., 2002.

“Survey of Recent Research Progress and Issues in Big Data” www.cse.wustl.edu/~jain/cse570-13/ftp/bigdata2/index.html 1/13

Ms. Vibhavari Chavan, Prof. Rajesh. N. Phursule, ―”Survey Paper on Big Data”‖ International Journal of Computer Science and Information Technologies, Vol. 5 (6), 2014.

Ms. Vibhavari Chavan, Prof. Rajesh. N. Phursule, ―”Survey Paper on Big Data” International Journal of Computer Science and Information Technologies, Vol. 5 (6), 2014.

Amogh Pramod Kulkarni, Mahesh Khandewal, ―”Survey on Hadoop and Introduction to YARN”, International Journal of Emerging Technology and Advanced Engineering Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 4, Issue 5, May 2014)

Tekiner F. and Keane J.A., Systems, Man and Cybernetics (SMC), ―”Big Data Framework” 2013 IEEE International Conference on 13–16 Oct. 2013, 1494–1499

Inmon, William (2000-07-18). "Data Mart Does Not Equal Data Warehouse". DMReview.com

V. Hristidis, S. Chen, T. Li, S. Luis, and Y. Deng, "Survey of Data Management and Analysis in Disaster Situations," Journal of Systems and Software, vol. 83, no. 10, pp. 17011714, 2010

Kuldeep Deshpande, and dr. Bhimappa desai limitations of dataware house platforms and Assessment of Hadoop as an Alternative, Volume 5, Issue 2, pp. 51-58,IJITMIS, 2014

Saleem, K., Luis, S., Deng, Y., Chen, S.-C., Hristidis, V., Li, T., 2008. "Towards a business Continuity information network for rapid disaster recovery.” In: Proceedings of the 9th Annual International Conference on Digital Government Research, Montreal, Canada, May 18–21, pp. 107–116

Sagiroglu, S.Sinanc, D.,‖ Big Data: A Review‖,2013, 20-24.

Tekiner F. and Keane J.A., Systems, Man and Cybernetics (SMC), ―”Big Data Framework”‖ 2013 IEEE International Conference on 13–16 Oct. 2013, 1494–1499

S.Vikram Phaneendra & E.Madhusudhan Reddy “Big Data-solutions for RDBMS problems-

A survey” In 12th IEEE/IFIP Network Operations & Management Symposium (NOMS 2010) (Osaka, Japan, Apr 19{23 2013).

Senthi Vadivel Bhupatthi Rav” Disaster Management: A Global Issue” International journal of civil and structural engineering Volume 1, No 1, 2010

Wang, F. Hadoop High Availability through Metadata Replication. ACM (2009).

Vibhavari Chavan et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (6) , 2014, 7932-7939




DOI: http://dx.doi.org/10.36039/AA042018004.

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.