Big Data - Hadoop Illuminated

Submitted by: Submitted by

Views: 51

Words: 7232

Pages: 29

Category: Science and Technology

Date Submitted: 12/04/2014 01:23 PM

Report This Essay

Hadoop Illuminated

Mark Kerzner Sujee Maniyam

Hadoop Illuminated

by Mark Kerzner and Sujee Maniyam

Dedication

To the open source community This book on GitHub [https://github.com/hadoop-illuminated/hadoop-book] Companion project on GitHub [https://github.com/hadoop-illuminated/HI-labs]

i

Acknowledgements

From Mark I would like to express gratitude to my editors, co-authors, colleagues, and bosses who shared the thorny path to working clusters - with the hope to make it less thorny for those who follow. Seriously, folks, Hadoop is hard, and Big Data is tough, and there are many related products and skills that you need to master. Therefore, have fun, provide your feedback [http://groups.google.com/group/hadoop-illuminated], and I hope you will find the book entertaining. "The author's opinions do not necessarily coincide with his point of view." - Victor Pelevin, "Generation P" [http://lib.udm.ru/lib/PELEWIN/pokolenie_engl.txt] From Sujee To the kind souls who helped me along the way Copyright © 2013 Hadoop illuminated LLC. All Rights Reserved.

ii

Table of Contents

1. Who is this book for? ...................................................................................................... 1 1.1. About "Hadoop illuminated" ................................................................................... 1 2. About Authors ................................................................................................................ 2 3. Why do I Need Hadoop ? ................................................................................................. 5 3.1. Hadoop provides storage for Big Data at reasonable cost ............................................. 5 3.2. Hadoop allows to capture new or more data .............................................................. 6 3.3. With Hadoop, you can store data longer ................................................................... 6 3.4. Hadoop provides scalable analytics...