• battlefield-13a_01battlefield-13a_02

  • Video: Developer Doug Cutting Talks About The Founding Of Hadoop

    Leena Rao

    Leena Rao is currently a Senior Editor for TechCrunch. She recently finished graduate school at the Medill School of Journalism at Northwestern University, where she studied business journalism and videography. From 2004 to 2007, she helped lead Congresswoman Carloyn Maloney’s community outreach and relations efforts in New York City. She graduated from Columbia University in 2003, where she was... → Learn More

    Friday, March 12th, 2010

    http://www.veeple.com/swf/VeeplePlayer.swf?siteId=BrFxYpvi%252FnE%253D&videoId=6fad3bc5-b2b4-4387-af3b-1cf44cc8aa3a&userId=&baseUrl=http://www.veeple.com/&showSpots=1&showViewBar=1&showTabBar=1&mute=0&spotScaleMode=noScale&autoPlay=0&allowAddComments=0&allowShare=0&allowEmbedding=0&allowFullscreen=0&allowRating=0&stopPlayingOnInteractiveClick=1&displayRelatedVideos=0&showWorm=0&showLogo=0&logoIcon=0&whiteLabel=1&showTabClickableObjects=0&showTabDetails=0&showTabComments=0&playerMode=player&playerWidth=640&playerHeight=396&isFlex=0&recordEvents=1&deploymentUrl=http://www.micro-documentaries.com

    In the video above, Doug Cutting, the creator of the Apache Hadoop project, discusses how the technology was first developed for large web companies (like Facebook, Google, Yahoo, which all use the open-source technology). Essentially, Cutting says Hadoop was born out of need: the data landscape was changing, and fast. Data volumes, especially complex data (video logs, visuals, etc.), were growing everywhere and there was a need for a cost-effective place to collect this data and mine it.

    Cutting now works at Cloudera, the startup that commercially distributes and services Hadoop. Hadoop is a Java software framework born out of an open-source implementation of Google’s published computing infrastructure which is fostered within the Apache Software Foundation. Hadoop supports distributed applications running on large clusters of commodity computers processing enormous amounts of data.

    Via Cloudera, Hadoop is currently used by most of the giants in the space including Google, Yahoo, Facebook (we wrote about Facebook’s use of Cloudera here), Amazon, AOL, Baidu and more. To date, Cloudera has raised $11 million in funding from Accel Partners and Greylock Partners.

    Tags:
    blog comments powered by Disqus