251640585-Big-Data-Analytics-Using-Apache-Hadoop.docx - BIG DATA ANALYTICS USING APACHE HADOOP SEMINAR REPORT Submitted in partial fulfilment of the

251640585-Big-Data-Analytics-Using-Apache-Hadoop.docx - BIG...

This preview shows page 1 - 5 out of 35 pages.

BIG DATA ANALYTICS USING APACHE HADOOP SEMINAR REPORT Submitted in partial fulfilment of the requirements for the award of Bachelor of Technology Degree in Computer Science and Engineering of the University of Kerala Submitted by ABIN BABY Roll No : 1 Seventh Semester B.Tech Computer Science and Engineering DEPARTMENT DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING COLLEGE OF ENGINEERING TRIVANDRUM 2014 i
Image of page 1
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING COLLEGE OF ENGINEERING TRIVANDRUM CERTIFICATE This is to certify that this seminar report entitled “ BIG DATA ANALYTICS USING APACHE HADOOP ” is a bonafide record of the work done by Abin Baby , under our guidance towards partial fulfilment of the requirements for the award of the Degree of Bachelor of Technology in Computer Science and Engineering of the University of Kerala during the year 2011-2015. Dr. Abdul Nizar A Mrs. Sabitha S Mrs. Rani Koshi Professor Assoc. Professor Assoc. Professor Dept. of CSE Dept. of CSE Dept. of CSE (Head of the Department) (Guide) (Guide) ii
Image of page 2
ACKNOWLEDGEMENTS I would like to express my sincere gratitude and heartful indebtedness to my guide Dr. Abdul Nizar , Head of Department, Department of Computer Science and Engineering for her valuable guidance and encouragement in pursuing this seminar. I am also very much thankful to, Mrs. Sabitha S , Associate Professor, Department of Computer Science and Engineering for their help and support. I also extend my hearty gratitude to Seminar Co-ordinator, Mrs. Rani Koshi , Associate Professor, Department of CSE, College of Engineering Trivandrum for providing necessary facilities and their sincere co-operation. My sincere thanks is extended to all the teachers of the department of CSE and to all my friends for their help and support. Above all, I thank God for the immense grace and blessings at all stages of the project. Abin Baby iii
Image of page 3
ABSTRACT The paradigm of processing huge datasets has been shifted from centralized architecture to distributed architecture. As the enterprises faced issues of gathering large chunks of data they found that the data cannot be processed using any of the existing centralized architecture solutions. Apart from time constraints, the enterprises faced issues of efficiency, performance and elevated infrastructure cost with the data processing in the centralized environment. With the help of distributed architecture these large organizations were able to overcome the problems of extracting relevant information from a huge data dump. One of the best open source tools used in the market to harness the distributed architecture in order to solve the data processing problems is Apache Hadoop. Using Apache Hadoop’s various components such as data clusters, map-reduce algorithms and distributed processing, we will resolve various location-based complex data problems and provide the relevant information back into the system, thereby increasing the user experience.
Image of page 4
Image of page 5

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture

  • Left Quote Icon

    Student Picture