On Github PierreZ / big-data-presentation-2014
Created by Pierre Zemb / @PierreZ
I'm just a student!
Hadoop doesn't belong to this !
90% of the data in the world today has been created in the last two years alone
Bring together and analyze large pools of data to discern patterns and make better decisions, which are impossible with regular technologies
Quote from Wikipedia:
Apache Hadoop is an open-source software framework for storage and large scale processing of data-sets on clusters of commodity hardware created by Yahoo.Apache Hadoop's MapReduce and HDFS components originally derived respectively from:
Data locality optimization