AI

The AI revolution has outgrown the Turing Test: Introducing a new framework

Comment

Human head and stairs - idea concept
Image Credits: themacx (opens in a new window) / Getty Images (Image has been modified)

Chris Saad

Contributor

Chris Saad is a startup and product builder, angel investor and strategic adviser. Previously, he was head of product for the Uber Developer Platform.

As AI becomes a transformative part of our technology landscape, a common vocabulary about the capabilities of each new tool and technique is essential. Common vocabularies create shared intellectual spaces allowing all stakeholders to accelerate understanding, increase adoption, facilitate collaboration, benchmark progress and drive innovation.

So far, the most widely known tool for benchmarking AI is the Turing Test.

AI Classifcation Framework - General Diagram
Image Credits: Chris Saad

However, the field of artificial intelligence (AI) has come a long way since the inception of the Turing Test in 1950. As such, it is becoming increasingly clear that the Turing Test is insufficient for evaluating the full range of AI capabilities that are emerging today — or are likely to emerge in the future.

The Turing Test operates on a simplistic pass/fail basis and focuses heavily on chat/linguistic capability, which is only one aspect of human intelligence. This narrow focus on language ignores many other critical dimensions of intelligence, such as problem-solving, creativity and social awareness. Additionally, the Turing Test presupposes a level of human-like intelligence that may not be relevant or useful for evaluating AI.

The framework

To address these limitations, there is an urgent and critical need to develop a more nuanced and comprehensive framework for evaluating AI capabilities across multiple dimensions of intelligence.

This insight led me to develop the “AI Classification Framework.” The ACF is a new approach to evaluating AI capabilities based on the Theory of Multiple Intelligences.

The Theory of Multiple Intelligences was first proposed by psychologist Howard Gardner in 1983. Gardner argued that intelligence was not a single, unified entity but rather a collection of different abilities that could manifest in a variety of ways. Gardner identified eight different types of intelligence: According to Gardner, individuals may excel in one or more of these areas, and each type of intelligence is independent of the others. The theory challenged the traditional view of intelligence as a singular, fixed entity and opened up new avenues for exploring the diversity of human cognition. While the theory of multiple intelligences has been subject to some criticism and debate over the years, it has had a significant impact on the field of psychology and education, particularly in the development of alternative approaches to teaching and learning.

This seemed perfect as a basis for the AI Classification Framework. Following the theory, the framework supports evaluating AI tools across multiple dimensions of intelligence, including linguistic, logical-mathematical, musical, spatial, bodily-kinesthetic, interpersonal and intrapersonal intelligence.

For each dimension of intelligence, the framework provides a scale from 1–5 with 1 being “No Capability” or the equivalent of a human infant and 5 being “Self-agency” or capability that might be considered “Super Intelligence” — beyond human ability.

Examples

The framework itself is a detailed table of descriptions that can be found here. I’ve also created a simple visual representation for easy high-level reference. What follows are two simple examples of the segments that respond to the capabilities of ChatGPT and DALL-E 2.

AI Classifcation Framework - ChatGPT Case Study
AI classification framework for ChatGPT. Image Credits: Chris Saad

AI classification framework for DALL-E 2 Case Study
AI classification framework for DALL-E 2. Image Credits: Chris Saad

Use cases

For founders building AI startups, the framework represents a methodology to find and communicate (both to potential users and potential investors) opportunities to break new ground and differentiate.

For operators looking to add AI capabilities to their products, the framework represents a way to find and evaluate potential vendors against their internal requirements.

For investors looking to fund AI startups, the framework can inform their investment thesis and ensure that they’re making informed bets across the field of possible intelligence and capability levels.

For end users, the framework is a tool to help them search for, and choose, products that might help them be more productive.

Next steps and acknowledgments

The first version of the framework is intended to kick-start a conversation. I plan to update it as feedback is received from the community.

I also want to thank my friends Ben Parr and Jeremiah Owyang for their helpful feedback. With their input and the input of the larger AI community, I’m confident that this framework can be a valuable tool in evaluating the full range of AI tools and capabilities present and emerging.

Get the framework

View the framework here.

Send feedback to me@chrissaad.com.

Disclosure: In the spirit of using the tools that the framework is intended to classify, the first draft of this post was written by ChatGPT and then edited by me. What a time to be alive!

More TechCrunch

To give AI-focused women academics and others their well-deserved — and overdue — time in the spotlight, TechCrunch has been publishing a series of interviews focused on remarkable women who’ve contributed to…

Women in AI: Rep. Dar’shun Kendrick wants to pass more AI legislation

We took the pulse of emerging fund managers about what it’s been like for them during these post-ZERP, venture-capital-winter years.

A reckoning is coming for emerging venture funds, and that, VCs say, is a good thing

It’s been a busy weekend for union organizing efforts at U.S. Apple stores, with the union at one store voting to authorize a strike, while workers at another store voted…

Workers at a Maryland Apple store authorize strike

Alora Baby is not just aiming to manufacture baby cribs in an environmentally friendly way but is attempting to overhaul the whole lifecycle of a product

Alora Baby aims to push baby gear away from the ‘landfill economy’

Bumble founder and executive chair Whitney Wolfe Herd raised eyebrows this week with her comments about how AI might change the dating experience. During an onstage interview, Bloomberg’s Emily Chang…

Go on, let bots date other bots

Welcome to Week in Review: TechCrunch’s newsletter recapping the week’s biggest news. This week Apple unveiled new iPad models at its Let Loose event, including a new 13-inch display for…

Why Apple’s ‘Crush’ ad is so misguided

The U.K. Safety Institute, the U.K.’s recently established AI safety body, has released a toolset designed to “strengthen AI safety” by making it easier for industry, research organizations and academia…

U.K. agency releases tools to test AI model safety

AI startup Runway’s second annual AI Film Festival showcased movies that incorporated AI tech in some fashion, from backgrounds to animations.

At the AI Film Festival, humanity triumphed over tech

Rachel Coldicutt is the founder of Careful Industries, which researches the social impact technology has on society.

Women in AI: Rachel Coldicutt researches how technology impacts society

SAP Chief Sustainability Officer Sophia Mendelsohn wants to incentivize companies to be green because it’s profitable, not just because it’s right.

SAP’s chief sustainability officer isn’t interested in getting your company to do the right thing

Here’s what one insider said happened in the days leading up to the layoffs.

Tesla’s profitable Supercharger network is in limbo after Musk axed the entire team

StrictlyVC events deliver exclusive insider content from the Silicon Valley & Global VC scene while creating meaningful connections over cocktails and canapés with leading investors, entrepreneurs and executives. And TechCrunch…

Meesho, a leading e-commerce startup in India, has secured $275 million in a new funding round.

Meesho, an Indian social commerce platform with 150M transacting users, raises $275M

Some Indian government websites have allowed scammers to plant advertisements capable of redirecting visitors to online betting platforms. TechCrunch discovered around four dozen “gov.in” website links associated with Indian states,…

Scammers found planting online betting ads on Indian government websites

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The company is describing the event as “a chance to demo some ChatGPT and GPT-4 updates.”

OpenAI’s ChatGPT announcement: What we know so far

The deck included some redacted numbers, but there was still enough data to get a good picture.

Pitch Deck Teardown: Cloudsmith’s $15M Series A deck

Unlike ChatGPT, Claude did not become a new App Store hit.

Anthropic’s Claude sees tepid reception on iOS compared with ChatGPT’s debut

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Look,…

Startups Weekly: Trouble in EV land and Peloton is circling the drain

Scarcely five months after its founding, hard tech startup Layup Parts has landed a $9 million round of financing led by Founders Fund to transform composites manufacturing. Lux Capital and Haystack…

Founders Fund leads financing of composites startup Layup Parts

AI startup Anthropic is changing its policies to allow minors to use its generative AI systems — in certain circumstances, at least.  Announced in a post on the company’s official…

Anthropic now lets kids use its AI tech — within limits

Zeekr’s market hype is noteworthy and may indicate that investors see value in the high-quality, low-price offerings of Chinese automakers.

The buzziest EV IPO of the year is a Chinese automaker

Venture capital has been hit hard by souring macroeconomic conditions over the past few years and it’s not yet clear how the market downturn affected VC fund performance. But recent…

VC fund performance is down sharply — but it may have already hit its lowest point

The person who claims to have 49 million Dell customer records told TechCrunch that he brute-forced an online company portal and scraped customer data, including physical addresses, directly from Dell’s…

Threat actor says he scraped 49M Dell customer addresses before the company found out

The social network has announced an updated version of its app that lets you offer feedback about its algorithmic feed so you can better customize it.

Bluesky now lets you personalize main Discover feed using new controls

Microsoft will launch its own mobile game store in July, the company announced at the Bloomberg Technology Summit on Thursday. Xbox president Sarah Bond shared that the company plans to…

Microsoft is launching its mobile game store in July

Smart ring maker Oura is launching two new features focused on heart health, the company announced on Friday. The first claims to help users get an idea of their cardiovascular…

Oura launches two new heart health features

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

This Week in AI: OpenAI considers allowing AI porn

Garena is quietly developing new India-themed games even though Free Fire, its biggest title, has still not made a comeback to the country.

Garena is quietly making India-themed games even as Free Fire’s relaunch remains doubtful

The U.S.’ NHTSA has opened a fourth investigation into the Fisker Ocean SUV, spurred by multiple claims of “inadvertent Automatic Emergency Braking.”

Fisker Ocean faces fourth federal safety probe