Startups

Measuring AI startups by the right yardstick

Comment

Close-Up Of Wooden Rulers On Table
Image Credits: Nicklas Karlsson / EyeEm (opens in a new window)

Ivy Nguyen

Contributor

Ivy Nguyen is an associate at Zetta Venture Partners.

More posts from Ivy Nguyen

Building a B2B AI startup is hard enough between struggling to obtain training data and fighting with major tech companies to secure talent. Building a B2B AI startup held to the well-established software-as-a-service (SaaS) metrics is even harder. While many AI businesses deliver value via software monetized by a recurring subscription like their SaaS counterparts, the similarities between the two types of businesses end there.

AI startups are a different animal

SaaS products built without data and AI offer generalized solutions to their customers. AI businesses more closely resemble a services business or consultancies because they provide solutions that become tailored to that customer’s specific needs. Like services providers or consultants, an AI product improves as it knows a customer better (as in, as it collects more data from customers with continued usage), and as it serves a broader customer base, from which it can collect best practices and make better predictions over a bigger data set.

Services revenue has been the antithesis of venture-style growth because it yields lower margins and lacks repeatability and scalability; as your services business brings on more customers, you will need to scale headcount accordingly to support those accounts, which keeps margins low. Palantir, a big data analytics unicorn, is one company mired in services demands. Unlike services providers, AI businesses have the potential to deliver that targeted and greater ROI at scale.

AI businesses are not scalable right out of the gate: AI models take time and require data to train. Moreover, not all AI businesses will scale. Here are the metrics we use to tell the difference early on.

AI metrics

Intervention ratio
Hype will make enterprise customers trigger-happy to pilot AI solutions, but at the end of the day, enterprise buyers buy the best solution available to address their problems and don’t care whether that solution comes in the form of SaaS software, a consultancy or an AI product. It is very difficult to build a high-performing MVP version of an AI model without data from customers. In order to demonstrate value right out of the box and be competitive against other vendors, you might automate which processes you can right off the bat using a rules engine, and provide a human operator to perform the rest of the work while simultaneously labeling the collecting data in order to train the AI.

As the AI improves over time, the human operator will offload more of the work and only jump in to intervene when the AI falls below a predetermined accuracy or confidence threshold. This enables you to serve an increasing number of customers with a limited number of staff. Lilt, which provides machine translation for enterprise, uses professional translators in this role. The translation AI automatically translates a text excerpt from one language to another. A human translator goes over the text looking for errors in translation or contextual corrections. As the translation AI improves, the human translator will have to make fewer corrections per translation task. More generally, the ratio of human interventions over total automated tasks should be decreasing.

ROI curve
As with SaaS products, exactly how that compounding AI performance increase is tied to bringing value for the customer is key to the startup’s long-term stickiness. The key difference with AI products is once the AI’s performance ramps up, it could very quickly exhaust all low-hanging fruit opportunities. If the AI cannot continue to provide value to the customer, the difference in value from one renewal cycle to the next may seem stark to the customer, who may decide to not renew.

Choosing the right applications of AI to enable long-term payoffs and avoiding hitting a wall with ROI is key. Typically, applications that improve the customer’s bottom line face finite opportunities for improvement, and applications that improve the customer’s top line have no ceiling on opportunities to grow. For example, once an AI improves the operating efficiency of a production line to the point where it is rate-limited by the time it takes for the raw materials to chemically react, the AI can no longer find value for the customer for that specific application.

There are only so many opportunities to take out costs before you are constrained by the laws of physics. An AI that helps customers find new opportunities for revenue like, Constructor.io, which provides AI-powered site search as a service and helps customers such as Jet.com increase cart conversions, will not hit that wall.

You should closely track the cumulative ROI for each customer over time to make sure the curve does not plateau and lead the customer to churn. Sometimes the long-term application is harder to sell because the value is difficult to demonstrate immediately, and you might get a foot in the door with the cost take-out value proposition. Understanding its ROI curve would enable you to design a longer contract period so that the AI has time to ramp on new problems before it exhausts the initial application. To ensure customer retention, you should make sure that the customer ROI increases over time and not plateau or taper off.

Rev-up costs
Deploying an AI product is a complicated process that leaves you at the mercy of each customer’s idiosyncratic tech stack and org chart. AI needs data to train, so an AI product may take more time than a SaaS product to deliver value. Acquiring or creating data for the AI model, integrating the product into the customer’s tech stack and workflows and getting the product to deliver value before the model is sufficiently trained on the customer’s data may significantly impact your own bottom line.

Many sectors have only recently begun to digitize, and valuable data might be in difficult-to-extract formats, such as handwritten notes, unstructured observation logs or PDFs. In order to capture this data, you may have to spend significant manpower on low-margin data preparation services before AI systems can be deployed. Depending on how the data is captured and organized, your deployment engineer may have to build new integrations to a data source before the model can be fully functional.

The way data is structured might also vary from one customer to the next, requiring AI engineers to spend additional hours normalizing the data or converting it to a standardized schema so the AI model can be applied. Over time, these costs may decrease as you build up a library of reusable integrations and ETL pipelines.

Products sold by SaaS companies either work or they don’t. AI performance is not binary; it works less well out of the box and improves with more data. Each application and each customer will accept a different minimum algorithmic performance (MAP). The deployment process should make sure to get the product to that customer’s specific MAP, and you might revert back to Wizard of Oz stop-gap approaches to deliver MAP until the model can perform at MAP on its own.

If you are selling to customers that allow you to pool anonymized data or use a model trained on their data with other customers, the AI product will perform better “out of the box” with each subsequent customer. Inside sales customers, for example, can get immediate suggestions on how to optimally target a sales lead using its sales acceleration platform thanks to that data pooled from its customer network.

AI products incur more significant rev-up costs than a typical SaaS product rollout and may have as much impact on margins as customer acquisition costs (CAC). You should carefully track how much time these rollouts and ramp-ups take, and how much it costs for each new customer. If there are true data network effects, these numbers should decrease over time.

Data moat
Unlike SaaS businesses that compete on new features, AI startups have an opportunity to build long-term defensibility. The AI startups that can scale will kick off a virtuous loop where the better the product performs, the more customers come on board to contribute and generate data, which improves the product’s performance. This reinforcement loop builds a compounding defensibility that was previously unheard of for SaaS businesses.

It’s too simplistic to merely aim for the largest volume of data. A defensible data strategy takes into account whether the appropriate data is being collected at a pace that is appropriate for the problem at hand. Ask yourselves these questions about your data to determine where you can strengthen your data strategy on the following dimensions:

  • Accessibility: how easy was it to get?
  • Time: how quickly can the data be amassed and used in the model?
  • Cost: how much money is needed to acquire and/or label this data?
  • Uniqueness: is similar data widely available to others who could then build a model and achieve the same result?
  • Dimensionality: how many different attributes are described in a data set?
  • Breadth: how widely do the values of attributes vary, such that they may account for edge cases and rare exceptions?
  • Perishability: will the data be useful for a long time?

AI models perform better with more data, but that performance may plateau over time. You should take care to track the time and volume of data necessary to achieve an incremental unit of value for your customer, to make sure that the data moat continues to grow. In short, how much time, and how much data, would a copycat need to match your level of performance?

SaaS metrics aren’t enough

The higher upfront work necessary to launch an AI business means that most will look more like services businesses or will appear to underperform when they are evaluated under the framework of SaaS metrics. A small subset of AI startups will resemble SaaS businesses from the beginning, before AI is deployed in the product. In order to collect data for their AI models, some businesses first sell SaaS workflow tools and can even achieve meaningful revenue from that workflow tool alone. By SaaS metrics, that company may be blowing the competition out of the water. Without the reinforcement loop generating a compounding volume of data and an increasingly powerful AI over time, however, that company’s product remains vulnerable to copycats and will eventually be commoditized.

AI metrics captures this difference. AI offers the opportunity to deliver the customized and specialized ROI of a services business with the scalability of software, with the ability to defend against copycats. The high start-up costs of this approach to company-building may mean you will realize smaller profits and build the company prioritizing different elements than what has worked before. Vertical AI is so new as a category that many companies are not yet tracking these metrics, so we don’t yet have enough data points to establish benchmarks. In the meantime, these numbers will serve as helpful barometers for you to monitor the health and performance of this new type of company.

More TechCrunch

Tags

Line Man Wongnai, an on-demand food delivery service in Thailand, is considering an initial public offering on a Thai exchange or the U.S. in 2025.

Thai food delivery app Line Man Wongnai weighs IPO in Thailand, US in 2025

The problem is not the media, but the message.

Apple’s ‘Crush’ ad is disgusting

Ever wonder why conversational AI like ChatGPT says “Sorry, I can’t do that” or some other polite refusal? OpenAI is offering a limited look at the reasoning behind its own…

OpenAI offers a peek behind the curtain of its AI’s secret instructions

The federal government agency responsible for granting patents and trademarks is alerting thousands of filers whose private addresses were exposed following a second data spill in as many years. The…

US Patent and Trademark Office confirms another leak of filers’ address data

As part of an investigation into people involved in the pro-independence movement in Catalonia, the Spanish police obtained information from the encrypted services Wire and Proton, which helped the authorities…

Encrypted services Apple, Proton and Wire helped Spanish police identify activist

Match Group, the company that owns several dating apps, including Tinder and Hinge, released its first-quarter earnings report on Tuesday, which shows that Tinder’s paying user base has decreased for…

Match looks to Hinge as Tinder fails

Private social networking is making a comeback. Gratitude Plus, a startup that aims to shift social media in a more positive direction, is expanding its wellness-focused, personal reflections journal to…

Gratitude Plus makes social networking positive, private and personal

With venture totals slipping year-over-year in key markets like the United States, and concern that venture firms themselves are struggling to raise more capital, founders might be worried. After all,…

Can AI help founders fundraise more quickly and easily?

Google has found a way to bring a variation of its clever “Circle to Search” gesture to iPhone users. The new interaction, launched in January, allows Android users to search…

Google brings a variation on ‘Circle to Search’ to iPhone users

A new sculpture going live on Wednesday in the Flatiron South Public Plaza in New York is not your typical artwork. It combines technology, sociology, anthropology and art to let…

Always-on video portal lets people in NYC and Dublin interact in real time

Apple’s iPad event had a lot to like. New iPads with new chips and new sizes, a new Apple Pencil, and even some software updates. If you are a big…

TechCrunch Minute: When did iPads get as expensive as MacBooks?

Autonomous, AI-based players are coming to a gaming experience near you, and a new startup, Altera, is joining the fray to build this new guard of AI agents. The company announced…

Bye-bye bots: Altera’s game-playing AI agents get backing from Eric Schmidt

Google DeepMind has taken the wraps off a new version of AlphaFold, their transformative machine learning model that predicts the shape and behavior of proteins. AlphaFold 3 is not only…

Google DeepMind debuts huge AlphaFold update and free proteomics-as-a-service web app

Uber plans to deliver more perks to Uber One members, like member-exclusive events, in a bid to gain more revenue through subscriptions.  “You will see more member-exclusives coming up where…

Uber promises member exclusives as Uber One passes $1B run-rate

We’ve all seen them. The inspector with a clipboard, walking around a building, ticking off the last time the fire extinguishers were checked, or if all the lights are working.…

Checkfirst raises $1.5M pre-seed to apply AI to remote inspections and audits

Close to a decade ago, brothers Aviv and Matteo Shapira co-founded a company, Replay, that created a video format for 360-degree replays — the sorts of replays that have become…

Controversial drone company Xtend leans into defense with new $40 million round

Usually, when something starts to rot, it gets pitched in the trash. But Joanne Rodriguez wants to turn the concept of rot on its head by growing fungus on trash…

Mycocycle uses mushrooms to upcycle old tires and construction waste

Monzo has raised another £150 million ($190 million), as the challenger bank looks to expand its presence internationally — particularly in the U.S. The new round comes just two months…

UK challenger bank Monzo nabs another $190M as US expansion beckons

iRobot has announced the successor to longtime CEO, Colin Angle. Gary Cohen, who previous held chief executive role at Timex and Qualitor Automotive, will be heading up the company, marking a major…

iRobot names former Timex head Gary Cohen as CEO

Reddit — now a publicly-traded company with more scrutiny on revenue growth — is putting a big focus on boosting its international audience, starting with francophones. In their first-ever earnings…

Reddit tests automatic, whole-site translation into French using LLM-based AI

Mushrooms continue to be a big area for alternative proteins. Canada-based Maia Farms recently raised $1.7 million to develop a blend of mushroom and plant-based protein using biomass fermentation. There’s…

Meati Foods bites into another $100M amid growth to 7,000 retail locations

Cleaning the outside of buildings is a dirty job, and it’s also dangerous. Lucid Bots came on the scene in 2018 with its Sherpa line of drones to clean windows…

Lucid Bots secures $9M for drones to clean more than your windows

High interest rates and financial pressures make it more important than ever for finance teams to have a better handle on their cash flow, and several startups are hoping to…

Israeli startup Panax raises a $10M Series A for its AI-driven cash flow management platform

The European Union has deepened the investigation of Elon Musk-owned social network, X, that it opened back in December under the bloc’s online governance and content moderation rulebook, the Digital Services Act…

EU grills Elon Musk’s X about content moderation and deepfake risks

For the founders of Atlan, a data governance startup, data has always been at the heart of what they do, even before they launched the company. In fact, co-founders Prukalpa…

Atlan scores $105M for its data control plane, as LLMs boost importance of data

It is estimated that about 2 billion people, especially those in lower and middle-income countries, lack access to quality and affordable essential medicines. The situation is exacerbated by low-quality or even killer…

Axmed raises $2M from Founderful to streamline drug supply chains in underserved markets

For decades, the Global Positioning System (GPS) has maintained a de facto monopoly on positioning, navigation and timing, because it’s cheap and already integrated into billions of devices around the…

Xona Space Systems closes $19M Series A to build out ultra-accurate GPS alternative

Bankruptcy lawyers representing customers impacted by the dramatic crash of cryptocurrency exchange FTX 17 months ago say that the vast majority of victims will receive their money back — plus interest. The…

FTX crypto fraud victims to get their money back — plus interest

On Wednesday, Google launched its digital wallet in India with local integrations, nearly two years after the app was relaunched as a digital wallet platform in the U.S. As TechCrunch exclusively reported last month,…

Google Wallet is now available in India

Bluesky has launched a new product roadmap for the coming months. The decentralized social network said on Tuesday that it is planning to introduce direct messages, support for videos, improved…

Bluesky to add DMs, video support and in-app custom feed curation