• Twitter Turns On The Firehose For Realtime Search Startups

    Monday, March 1st, 2010

    Erick Schonfeld is a technology journalist and the former Editor in Chief of TechCrunch. At TechCrunch, he oversaw the editorial content of the site, helped to program the Disrupt conferences and CrunchUps, produced TCTV shows, and wrote daily for the blog. He joined TechCrunch as Co-Editor in 2007, and helped take it from a popular blog to a thriving... → Learn More

    When it comes to getting access to all the data that flows through Twitter, there are the 50,000 apps that drink from Twitter’s Streaming API, which is subject to various limits. And then there are the chosen few who get the full unlimited firehose of data, the more than 50 million Tweets a day coursing through Twitter.

    In the past, only select partners, particularly big search engines such as Google or Bing, got the full firehose. Search engines need it more than others to be able to index and serve up results in realtime. Today, smaller search startups are also getting the firehose. These include Ellerdale, Collecta, Kosmix, Scoopler, twazzup, CrowdEye, and Chainn Search (which has not yet launched).

    In December at Le Web, Twitter indicated that the firehose would open up to more and more developers. This is a step in that direction.

    And while Twitter makes deep-pocketed partners like Google and Bing pay for the firehose, it treats startups differently with more favorable (even free) terms, or at least it used to. These aren’t the first startups to get the firehose, and they won’t be the last. Once they get to a certain size though, they’ll pay if they don’t already. Apparently it works on a sliding scale.

    Photo credit: Flickr/ZeroOne

    Company: Collecta
    Website: collecta.com
    Launch Date: November 2008
    Funding: $6.58M

    Collecta monitors the update streams of popular realtime blogs and sites like Twitter, Wordpress, and Flickr, and shows results as they happen. Results can be filtered by status updates, comments, stories, or photos. The entire engine is built around the XMPP standard, which pushes out data on a continual basis, so that for every search you end up watching a stream that keeps updating itself.

    Learn more
    Company: Ellerdale
    Website: ellerdale.com
    Launch Date: 2008

    Ellerdale, operating in stealth mode for most of the past year, has built an exciting site using leading-edge semantic technology that filters the real-time stream by topics, instead of keyword strings. This enables Ellerdale to automatically display trending topics in real-time within categories, e.g. trending people, trending films, trending companies, trending sports, etc. Users can select from over a million individual topics to track in real-time. The latest tweets and rss articles are tracked, as well as the most...

    Learn more
    Company: Crowdeye
    Website: crowdeye.com
    Launch Date: September 2008

    CrowdEye is a Twitter Search engine that returns not only recent tweets, but the number of tweets on a subject over time, related news articles, and related keywords.

    Learn more

    Sponsored Ads

    blog comments powered by Disqus

    Sponsored Ads

    Sponsored Ads

    Upcoming Events

    Disrupt SF 2012

    San Francisco, CA