Google's Wizard Of Oz Search Algorithm And The Threat Of Facebook Search

Comment

Google search is powered by algorithms. Computers slice and dice data looking for signals that a web page is more or less interesting than other web pages for a given query. PageRank is a big part of this, where Google looks at inbound links to a site as well as the text relevant to that link. But Google also uses lots of other signals to determine the relevance of a web page. They have to, because PageRank on its own is infinitely gameable.

If no one ever tried to game search results PageRank would work just fine.

Inbound links are simply votes for various web pages. If you take the authority of the site linking into account, it makes for really good search results. That’s why Google was so great in 1999, when there was less incentive to game search results, and less expertise by the people doing it.

But today all that’s changed. There’s a feeling that Google’s algorithm is falling further and further behind the very motivated people and companies out there fighting that algorithm. It’s an arms race, and Google is losing that arms race.

Today we saw yet another algorithm change by Google, designed to fight some of the more annoying internet polluters – content farms and scrapers. The arms race continues.

No Humans Involved!

What fascinated me most today was Google’s insistence that they are not directly using the block data they crowdsource from their Chrome extension in determining search relevance.

It’s worth noting that this update does not rely on the feedback we’ve received from the Personal Blocklist Chrome extension, which we launched last week.

But then they talk about how the algorithm is coming up with very similar decisions anyway:

However, we did compare the Blocklist data we gathered with the sites identified by our algorithm, and we were very pleased that the preferences our users expressed by using the extension are well represented. If you take the top several dozen or so most-blocked domains from the Chrome extension, then this algorithmic change addresses 84% of them, which is strong independent confirmation of the user benefits.

The more I think about this, the more strange it seems to me.

There’s a good explanation for not relying on that data – if they publicly said they did there would then be a huge incentive for SEOists to start to manipulate that block data, too. Forget linkfarms, just hire thousands of people on Mechanical turk to download the extension and block competitor’s sites. Another angle on the arms race.

But I don’t think that’s why. Like the Wizard of Oz, Google hides behind their mighty and mysterious search algorithm. If good search was as easy as analyzing simple clicks of a mouse on a web page, all the magic could vaporize.

And if you could somehow remove as much spam as possible from that data, and even slice it demographically, geographically and even personally for a given user, then things might really get sticky.

Particularly if Google didn’t have access to any of that data.

And Facebook did.

One of the most interesting experiments going on in search right now is Blekko’s Facebook Like powered search engine. Search results and search relevance is determined by what your friends have “liked” on Facebook, a very deep store of data indeed.

Facebook has more than half a billion users, and half of those log on every day. These people spend 700 billion minutes on the site and share 30 billion pieces of content. Links are being shared and people are clicking “like” to vote for that content. And it turns out that it all adds up to a pretty useful search engine experiment on Blekko.

Imagine what Google could do with all that data and you start to understand why social is so darned important for them right now. Not to kill Facebook, but to try to neutralize the threat that the next great leap in search engine evolution doesn’t happen completely without them. A lot of the searches that Google is really bad at – commerce and travel, for example – can get really good really fast if you can look at deep data from friends about those very things. I don’t need pages and pages of results. Just a nice hotel in Paris that a friend vouches for. Or a movie I’ll enjoy. Or the right set of pots and pans. All that data is right there on Facebook.

It may take Facebook a few years to really start to get interested in search. But there is so much advertising revenue in that business that they can’t ignore it forever. And that must scare Google more than just about anything else.

More TechCrunch

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

For cancer patients, medicines administered in clinical trials can help save or extend lives. But despite thousands of trials in the United States each year, only 3% to 5% of…

Triomics raises $15M Series A to automate cancer clinical trials matching

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Tap, tap.…

Tesla drives Luminar lidar sales and Motional pauses robotaxi plans

The newly announced “Public Content Policy” will now join Reddit’s existing privacy policy and content policy to guide how Reddit’s data is being accessed and used by commercial entities and…

Reddit locks down its public data in new content policy, says use now requires a contract

Eva Ho plans to step away from her position as general partner at Fika Ventures, the Los Angeles-based seed firm she co-founded in 2016. Fika told LPs of Ho’s intention…

Fika Ventures co-founder Eva Ho will step back from the firm after its current fund is deployed

In a post on Werner Vogels’ personal blog, he details Distill, an open-source app he built to transcribe and summarize conference calls.

Amazon’s CTO built a meeting-summarizing app for some reason

Paris-based Mistral AI, a startup working on open source Large Language Models — the building block for generative AI services — has been raising money at a $6 billion valuation,…

Sources: Mistral AI raising at a $6B valuation, SoftBank ‘not in’ but DST is

You can expect plenty of AI, but probably not a lot of hardware.

Google I/O 2024: What to expect

Dating apps and other social friend-finders are being put on notice: Dating app giant Bumble is looking to make more acquisitions.

Bumble says it’s looking to M&A to drive growth

When Class founder Michael Chasen was in college, he and a buddy came up with the idea for Blackboard, an online classroom organizational tool. His original company was acquired for…

Blackboard founder transforms Zoom add-on designed for teachers into business tool

Groww, an Indian investment app, has become one of the first startups from the country to shift its domicile back home.

Groww joins the first wave of Indian startups moving domiciles back home from US

Technology giant Dell notified customers on Thursday that it experienced a data breach involving customers’ names and physical addresses. In an email seen by TechCrunch and shared by several people…

Dell discloses data breach of customers’ physical addresses

Featured Article

Fairgen ‘boosts’ survey results using synthetic data and AI-generated responses

The Israeli startup has raised $5.5M for its platform that uses “statistical AI” to generate synthetic data that it says is as good as the real thing.

2 hours ago
Fairgen ‘boosts’ survey results using synthetic data and AI-generated responses

Hydrow, the at-home rowing machine maker, announced Thursday that it has acquired a majority stake in Speede Fitness, the company behind the AI-enabled strength training machine. The rowing startup also…

Rowing startup Hydrow acquires a majority stake in Speede Fitness as their CEO steps down

Call centers are embracing automation. There’s debate as to whether that’s a good thing, but it’s happening — and quite possibly accelerating. According to research firm TechSci Research, the global…

Retell AI lets companies build ‘voice agents’ to answer phone calls

TikTok is starting to automatically label AI-generated content that was made on other platforms, the company announced on Thursday. With this change, if a creator posts content on TikTok that…

TikTok will automatically label AI-generated content created on platforms like DALL·E 3

India’s mobile payments regulator is likely to extend the deadline for imposing market share caps on the popular UPI (unified payments interface) payments rail by one to two years, sources…

India likely to delay UPI market caps in win for PhonePe-Google Pay duopoly

Line Man Wongnai, an on-demand food delivery service in Thailand, is considering an initial public offering on a Thai exchange or the U.S. in 2025.

Thai food delivery app Line Man Wongnai weighs IPO in Thailand, US in 2025

The problem is not the media, but the message.

Apple’s ‘Crush’ ad is disgusting

Ever wonder why conversational AI like ChatGPT says “Sorry, I can’t do that” or some other polite refusal? OpenAI is offering a limited look at the reasoning behind its own…

OpenAI offers a peek behind the curtain of its AI’s secret instructions

The federal government agency responsible for granting patents and trademarks is alerting thousands of filers whose private addresses were exposed following a second data spill in as many years. The…

US Patent and Trademark Office confirms another leak of filers’ address data

As part of an investigation into people involved in the pro-independence movement in Catalonia, the Spanish police obtained information from the encrypted services Wire and Proton, which helped the authorities…

Encrypted services Apple, Proton and Wire helped Spanish police identify activist

Match Group, the company that owns several dating apps, including Tinder and Hinge, released its first-quarter earnings report on Tuesday, which shows that Tinder’s paying user base has decreased for…

Match looks to Hinge as Tinder fails

Private social networking is making a comeback. Gratitude Plus, a startup that aims to shift social media in a more positive direction, is expanding its wellness-focused, personal reflections journal to…

Gratitude Plus makes social networking positive, private and personal

With venture totals slipping year-over-year in key markets like the United States, and concern that venture firms themselves are struggling to raise more capital, founders might be worried. After all,…

Can AI help founders fundraise more quickly and easily?

Google has found a way to bring a variation of its clever “Circle to Search” gesture to iPhone users. The new interaction, launched in January, allows Android users to search…

Google brings a variation on ‘Circle to Search’ to iPhone users

A new sculpture going live on Wednesday in the Flatiron South Public Plaza in New York is not your typical artwork. It combines technology, sociology, anthropology and art to let…

Always-on video portal lets people in NYC and Dublin interact in real time

Apple’s iPad event had a lot to like. New iPads with new chips and new sizes, a new Apple Pencil, and even some software updates. If you are a big…

TechCrunch Minute: When did iPads get as expensive as MacBooks?

Autonomous, AI-based players are coming to a gaming experience near you, and a new startup, Altera, is joining the fray to build this new guard of AI agents. The company announced…

Bye-bye bots: Altera’s game-playing AI agents get backing from Eric Schmidt

Google DeepMind has taken the wraps off a new version of AlphaFold, their transformative machine learning model that predicts the shape and behavior of proteins. AlphaFold 3 is not only…

Google DeepMind debuts huge AlphaFold update and free proteomics-as-a-service web app