First page Back Continue Last page Image

Google And Hadoop

Google’s Distributed File System GFS - 2003

Google’s Distributed Parallel Processing algorithm – Map Reduce - 2004

Google’s Distributed Structure Database – Big Table - 2006

Google’s Modular Data Center

Apache Hadoop is a collection of Open Source software utilities that facilitate using a network of many computers to solve problems involving massive amounts of data and computation