Google’s Distributed File System GFS - 2003
Google’s Distributed Parallel Processing algorithm – Map Reduce - 2004
Google’s Distributed Structure Database – Big Table - 2006
Google’s Modular Data Center
Apache Hadoop is a collection of Open Source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation