In the video above, Doug Cutting, the creator of the Apache Hadoop project, discusses how the technology was first developed for large web companies (like Facebook, Google, Yahoo, which all use the open-source technology). Essentially, Cutting says Hadoop was born out of need: the data landscape was changing, and fast. Data volumes, especially complex data (video logs, visuals, etc.), were growing everywhere and there was a need for a cost-effective place to collect this data and mine it.
Cutting now works at Cloudera, the startup that commercially distributes and services Hadoop. Hadoop is a Java software framework born out of an open-source implementation of Google’s published computing infrastructure which is fostered within the Apache Software Foundation. Hadoop supports distributed applications running on large clusters of commodity computers processing enormous amounts of data.
Via Cloudera, Hadoop is currently used by most of the giants in the space including Google, Yahoo, Facebook (we wrote about Facebook’s use of Cloudera here), Amazon, AOL, Baidu and more. To date, Cloudera has raised $11 million in funding from Accel Partners and Greylock Partners.