Yahoo Releases Internal Hadoop Source Code

Next Story

CrunchDeals: 1TB external drive for $75 after rebate

Yahoo! is releasing their tested source code used to help power its sites and products, called Hadoop. Hadoop is free Java software framework born out of an open-source implementation of Google’s published computing infrastructure and fostered within the Apache Software Foundation. Yahoo made the announcement at the second annual Hadoop Summit today in Santa Clara, California, which was co-sponsored by other cloud computing vendors Amazon Web Services, Cloudera, IBM, and Sun Microsystems.

Yahoo! has been the primary developer and investor to Apache’s Hadoop. In 2006, Hadoop founder Doug Cutting joined Yahoo to lead the project of developing the open-source software. Hadoop now provides the framework for many Yahoo properties including Yahoo Search, Yahoo Mail, and several content and ad services. Hadoop runs on more than 25,000 servers and analyzes billions of Web pages.

Yahoo says its opening up the source code to Hadoop to “increase the pace of innovation around open and collaborative research and development.” Hadoop is currently being used by a number of cloud computing vendors, including Amazon Web Services (to power its Elastic MapReduce feature,) IBM (for its Blue Cloud Initiative) and Google. Startup Cloudera offers its own Hadoop-powered computational services on top of Amazon’s EC2.

Yahoo hasn’t been doing much in the cloud computing space but releasing this code could further its commitment to making a name in the cloud. It has a ways to go to catch up to Amazon, IBM, Google, Microsoft and others but this release may engage developers in Hadoop.

  • http://www.irishmillionaire.com Mike

    Thanks great news…

  • http://www.mobilparca.com parcakontor

    hi… i wait everybody i1907kahveci@hotmail.com

  • http://tony.abou-assaleh.net Tony A.A.

    Good to see useful things come out of Yahoo!. Hopefully this will improve Hadoops efficiency and lead to a wider adoption in industry and academia.

  • Joe Bauers

    TechCrunch + Anything remotely technical = COMEDY!

    • Some

      I will have to agree.

      Guys, look at how you introduce Hadoop vs how Wikipedia introduces it.

      Techcrunch – Hadoop is free Java software framework born out of an open-source implementation of Google’s published computing infrastructure and fostered within the Apache Software Foundation.

      Really? Hadoop is a Java framework which is an implementation of Google’s computing infrastructure?

      Wikipedia – Apache Hadoop is a free Java software framework that supports data intensive distributed applications. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google’s MapReduce and Google File System (GFS) papers.

      • Hunt

        word.
        techcrunchit fails again :)

  • http://www.emtain.tv Björn Wilmsmann

    From a developer’s point of view, Yahoo is just an amazing company, having released stuff like BOSS, YUI and contributing to projects like Hadoop.

  • http://www.gadget-hero.com hero

    let’s hope it will be useful …

  • http://www.irishmillionaire.com Mike

    thank you i got an email irishmillionaire.com

  • aa

    leena…

    you do realize that hadoop was open source already, right?

    the same as nutch, and lucene…

    but i’m willing to bet you have no idea how any of it works….!!!

    jeeze…

    • Dan

      I believe her post was referring to the specific Yahoo distribution of Hadoop:

      “As a service to the Hadoop community, we are releasing the Yahoo! Distribution of Hadoop — a source code distribution that is based entirely on code found in the Apache Hadoop project. “

  • http://www.cloudheads.net Wil Sinclair

    So does anyone know exactly what they’re OS’ing, then? Is there something online already?

    ,Wil

  • kg

    Seriously, this article is BS. Writer knows nothing about hadoop. I was in the hadoop summit and statements like “Yahoo hasn’t been doing much in the cloud computing space” makes me wonder how these writers think

  • deb

    Leena Rao , do you gave any idea what is happening in cloud computing? don’t write stupid comment

  • http://www.toplinkworld.com ayme

    well it seems a great application, but the name leaves to be desired…hadoop

  • http://amygdala.wordpress.com/2009/06/13/links-for-2009-06-12/ links for 2009-06-12 « Amy G. Dala

    […] Yahoo Releases Internal Hadoop Source Code (tags: hadoop distributed computing open_source) […]

  • http://andrewmccall.com/2009/06/scaling-up-vs-scaling-out/ » Scaling up vs scaling out andrewmccall.com

    […] Yahoo Releases Internal Hadoop Source Code (techcrunchit.com) […]

  • http://www.techcrunch.com/2009/11/02/yahoo-open-sources-traffic-server/ Yahoo Open Sources Traffic Server

    […] is also announcing an update to the Yahoo! Distribution of Hadoop which is now deployed extensively in Yahoo data centers worldwide. These include new features and […]

  • http://teknotopia.com/?p=30 Yahoo Open Sources Traffic Server - teknotopia

    […] is also announcing an update to the Yahoo Distribution of Hadoop which is now deployed extensively in Yahoo data centers worldwide. These include new features and […]

  • http://www.techfeed.in/?p=2885 Technology blog » Yahoo Open Sources Traffic Server

    […] is also announcing an update to the Yahoo Distribution of Hadoop which is now deployed extensively in Yahoo data centers worldwide. These include new features and […]

blog comments powered by Disqus