I met with Tony Conrad, one of the founders of Sphere, today at our office (ok, its a house, but it’s also an office – just ask the IRS). Sphere is a new blog search engine that quite frankly blows everything, and I mean everything, I’ve seen out of the water in terms of relevance (and, by the way, design…Adaptive Path was involved).
Tony, along with co-founders Steve Nieker and Martin Remy, and advisors Matt Mullenweg, Mary Hodder and Toni Schnieder, have created Sphere in a very short period of time and for “less than $200k”.
I saw the live site and was allowed to bang on it as much as I liked. I did, and came away very impressed.
Relevance in blog search is very difficult. Google-type PageRank analysis, which looks at incoming links to a piece of content, simply doesn’t work because new content doesn’t have much in the way of links. Until now, no one has come up with a way to properly sort blog posts by relevance, and the general default way of showing results is “reverse-chrono”, which simply puts the newest stuff at the top.
Sphere appears to have solved the problem, or at least taken big steps in the right direction. Their approach involves three key algorithms – an analysis of links into and out of a blog, an analysis of metadata around a post (links, post frequency, length of posts, etc.), and something Tony calls their “secret sauce”, which is content semantic analysis to filter out spam and to understand what a blog post is talking about.
Result sets show only two posts per blog on the first page, so no one blog can dominate a category. The result set has auto-generated profiles of blogs that include recent links in and out of the blog, average posts per week, average words per post and a link to a full page profile that can be edited by the blog author.
To the right of the main result set are blog (as opposed to post) links that are relevant to the query, and something they call the “Daily Sphere”, which is links to relevant non-blog news stories. I imagine they may add additional content in the side area as well.
The results page integrates both ajax and flash features in an intelligent way (it’s not there just for show).
While Sphere has been indexing blogs since January of 2003, their index only shows results for the previous four months. They will lengthen this window as they scale up operations.
Sphere has tens of thousands of beta requests and is opening up to 100 lucky people this weekend. Sign up for the beta here.