Wednesday, April 23, 2008

Kai-Fu Lee on Cloud Computing

The numbers below can be looked at 3 says:
-- (5 years ago) Crazy Talk!
-- (now) That's pretty cool!
-- (5 years from now) Yeah so? My Pc does that.


Kai-Fu Lee on Cloud Computing: "6. Programmable. 'For fault tolerance, Google uses GFS or distributed disk storage. Every piece of data is replicated three times. If one machine dies, a master redistributes the data to a new server. There are around 200 clusters (some with over 5 PB of disk space on 500 machines). The Big Table is used for distributed memory. The largest cells in the Big Table are 700 TB, spread over 2000 machines. MapReduce is the solution for new programming paradigms. It cuts a trillion records into a thousand parts on a thousand machines. Each machine will then load a billion records and will run the same program over these records, and then the results are recombined. While in 2005, there were some 72,000 jobs being run on MapReduce, in 2007, there were two million jobs (use seems to be increasing exponentially).'"

No comments: