Lakera launches to protect large language models from malicious prompts

With $10M in backing, Swiss startup launches API to protect companies from prompt injections and more

5:25 AM PDT • October 12, 2023

Chat with AI — **Image Credits:** Supatman / Getty Images

Large language models (LLMs) are the driving force behind the burgeoning generative AI movement, capable of interpreting and creating human-language texts from simple prompts — this could be anything from summarizing a document to writing a poem to answering a question using data from myriad sources.

However, these prompts can also be manipulated by bad actors to achieve far more dubious outcomes, using so-called “prompt injection” techniques whereby an individual inputs carefully crafted text prompts into an LLM-powered chatbot with the purpose of tricking it into giving unauthorized access to systems, for example, or otherwise enabling the user to bypass strict security measures.

And it’s against that backdrop that Swiss startup Lakera is officially launching to the world today, with the promise of protecting enterprises from various LLM security weaknesses such as prompt injections and data leakage. Alongside its launch, the company also revealed that it raised a hitherto undisclosed $10 million round of funding earlier this year.

Data wizardry

Lakera has developed a database comprising insights from various sources, including publicly available open source datasets, its own in-house research and — interestingly — data gleaned from an interactive game the company launched earlier this year called Gandalf.

With Gandalf, users are invited to “hack” the underlying LLM through linguistic trickery, trying to get it to reveal a secret password. If the user manages this, they advance to the next level, with Gandalf getting more sophisticated at defending against this as each level progresses.

Lakera's Gandalf — Lakera’s Gandalf. **Image Credits:** TechCrunch

Powered by OpenAI’s GPT3.5, alongside LLMs from Cohere and Anthropic, Gandalf — on the surface, at least — seems little more than a fun game designed to showcase LLMs’ weaknesses. Nonetheless, insights from Gandalf will feed into the startup’s flagship Lakera Guard product, which companies integrate into their applications through an API.

“Gandalf is literally played all the way from like six-year-olds to my grandmother, and everyone in between,” Lakera CEO and co-founder David Haber explained to TechCrunch. “But a large chunk of the people playing this game is actually the cybersecurity community.”

Haber said the company has recorded some 30 million interactions from 1 million users over the past six months, allowing it to develop what Haber calls a “prompt injection taxonomy” that divides the types of attacks into 10 different categories. These are: direct attacks; jailbreaks; sidestepping attacks; multi-prompt attacks; role-playing; model duping; obfuscation (token smuggling); multi-language attacks; and accidental context leakage.

From this, Lakera’s customers can compare their inputs against these structures at scale.

“We are turning prompt injections into statistical structures — that’s ultimately what we’re doing,” Haber said.

Prompt injections are just one cyber risk vertical Lakera is focused on though, as it’s also working to protect companies from private or confidential data inadvertently leaking into the public domain, as well as moderating content to ensure that LLMs don’t serve up anything unsuitable for kids.

“When it comes to safety, the most popular feature that people are asking for is around detecting toxic language,” Haber said. “So we are working with a big company that is providing generative AI applications for children, to make sure that these children are not exposed to any harmful content.”

On top of that, Lakera is also addressing LLM-enabled misinformation or factual inaccuracies. According to Haber, there are two scenarios where Lakera can help with so-called “hallucinations” — when the output of the LLM contradicts the initial system instructions, and where the output of the model is factually incorrect based on reference knowledge.

“In either case, our customers provide Lakera with the context that the model interacts in, and we make sure that the model doesn’t act outside of those bounds,” Haber said.

So really, Lakera is a bit of a mixed bag spanning security, safety and data privacy.

EU AI Act

With the first major set of AI regulations on the horizon in the form of the EU AI Act, Lakera is launching at an opportune moment in time. Specifically, Article 28b of the EU AI Act focuses on safeguarding generative AI models through imposing legal requirements on LLM providers, obliging them to identify risks and put appropriate measures in place.

In fact, Haber and his two co-founders have served in advisory roles to the Act, helping to lay some of the technical foundations ahead of the introduction — which is expected some time in the next year or two.

“There are some uncertainties around how to actually regulate generative AI models, distinct from the rest of AI,” Haber said. “We see technological progress advancing much more quickly than the regulatory landscape, which is very challenging. Our role in these conversations is to share developer-first perspectives, because we want to complement policymaking with an understanding of when you put out these regulatory requirements, what do they actually mean for the people in the trenches that are bringing these models out into production?”

Lakera founders: CEO David Haber flanked by CPO Matthias Kraft (left) and CTO Mateo Rojas-Carulla. **Image Credits:** Lakera

The security blocker

The bottom line is that while ChatGPT and its ilk have taken the world by storm these past nine months like few other technologies have in recent times, enterprises are perhaps more hesitant to adopt generative AI in their applications due to security concerns.

“We speak to some of the coolest startups, to some of the world’s leading enterprises — they either already have these [generative AI apps] in production, or they’re looking at the next three to six months,” Haber said. “And we are already working with them behind the scenes to make sure they can roll this out without any problems. Security is a big blocker for many of these [companies] to bring their generative AI apps to production, which is where we come in.”

Founded out of Zurich in 2021, Lakera already claims major paying customers, which it says it’s not able to name-check due to the security implications of revealing too much about the kinds of protective tools that they’re using. However, the company has confirmed that LLM developer Cohere — a company that recently attained a $2 billion valuation — is a customer, alongside a “leading enterprise cloud platform” and “one of the world’s largest cloud storage services.”

With $10 million in the bank, the company is fairly well-financed to build out its platform now that it’s officially in the public domain.

“We want to be there as people integrate generative AI into their stacks, to make sure these are secure and the risks are mitigated,” Haber said. “So we will evolve the product based on the threat landscape.”

Lakera’s investment was led by Swiss VC Redalpine, with additional capital provided by Fly Ventures, Inovia Capital and several angel investors.

More TechCrunch

Why Apple’s ‘Crush’ ad is so misguided

Cody Corrall

10 hours ago

Welcome to Week in Review: TechCrunch’s newsletter recapping the week’s biggest news. This week Apple unveiled new iPad models at its Let Loose event, including a new 13-inch display for…

U.K. agency releases tools to test AI model safety

Kyle Wiggers

10 hours ago

The U.K. Safety Institute, the U.K.’s recently established AI safety body, has released a toolset designed to “strengthen AI safety” by making it easier for industry, research organizations and academia…

U.K. agency releases tools to test AI model safety

At the AI Film Festival, humanity triumphed over tech

Kyle Wiggers

12 hours ago

AI startup Runway’s second annual AI Film Festival showcased movies that incorporated AI tech in some fashion, from backgrounds to animations.

At the AI Film Festival, humanity triumphed over tech

Women in AI: Rachel Coldicutt researches how technology impacts society

Dominic-Madori Davis

13 hours ago

Rachel Coldicutt is the founder of Careful Industries, which researches the social impact technology has on society.

Women in AI: Rachel Coldicutt researches how technology impacts society

Enterprise

SAP’s chief sustainability officer isn’t interested in getting your company to do the right thing

Ron Miller

Tim De Chant

14 hours ago

SAP Chief Sustainability Officer Sophia Mendelsohn wants to incentivize companies to be green because it’s profitable, not just because it’s right.

SAP’s chief sustainability officer isn’t interested in getting your company to do the right thing

Transportation

Tesla’s profitable Supercharger network is in limbo after Musk axed the entire team

Tim De Chant

15 hours ago

Here’s what one insider said happened in the days leading up to the layoffs.

Tesla’s profitable Supercharger network is in limbo after Musk axed the entire team

StrictlyVC London welcomes Phoenix Court and WEX

Cindy Zackney

22 hours ago

StrictlyVC events deliver exclusive insider content from the Silicon Valley & Global VC scene while creating meaningful connections over cocktails and canapés with leading investors, entrepreneurs and executives. And TechCrunch…

Commerce

Meesho, an Indian social commerce platform with 150M transacting users, raises $275M

Manish Singh

24 hours ago

Meesho, a leading e-commerce startup in India, has secured $275 million in a new funding round.

Meesho, an Indian social commerce platform with 150M transacting users, raises $275M

Security

Scammers found planting online betting ads on Indian government websites

Jagmeet Singh

1 day ago

Some Indian government websites have allowed scammers to plant advertisements capable of redirecting visitors to online betting platforms. TechCrunch discovered around four dozen “gov.in” website links associated with Indian states,…

Scammers found planting online betting ads on Indian government websites

Transportation

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

Rebecca Bellan

1 day ago

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company. Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

Fundraising

Pitch Deck Teardown: Cloudsmith’s $15M Series A deck

Haje Jan Kamps

1 day ago

The deck included some redacted numbers, but there was still enough data to get a good picture.

Pitch Deck Teardown: Cloudsmith’s $15M Series A deck

OpenAI’s ChatGPT announcement: What we know so far

Anthony Ha

1 day ago

The company is describing the event as “a chance to demo some ChatGPT and GPT-4 updates.”

OpenAI’s ChatGPT announcement: What we know so far

Anthropic’s Claude sees tepid reception on iOS compared with ChatGPT’s debut

Sarah Perez

1 day ago

Unlike ChatGPT, Claude did not become a new App Store hit.

Anthropic’s Claude sees tepid reception on iOS compared with ChatGPT’s debut

Startups

Startups Weekly: Trouble in EV land and Peloton is circling the drain

Haje Jan Kamps

1 day ago

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Look,…

Startups

Founders Fund leads financing of composites startup Layup Parts

Aria Alamalhodaei

1 day ago

Scarcely five months after its founding, hard tech startup Layup Parts has landed a $9 million round of financing led by Founders Fund to transform composites manufacturing. Lux Capital and Haystack…

Founders Fund leads financing of composites startup Layup Parts

Anthropic now lets kids use its AI tech — within limits

Kyle Wiggers

1 day ago

AI startup Anthropic is changing its policies to allow minors to use its generative AI systems — in certain circumstances, at least. Announced in a post on the company’s official…

Anthropic now lets kids use its AI tech — within limits

Transportation

The buzziest EV IPO of the year is a Chinese automaker

Rebecca Bellan

1 day ago

Zeekr’s market hype is noteworthy and may indicate that investors see value in the high-quality, low-price offerings of Chinese automakers.

The buzziest EV IPO of the year is a Chinese automaker

Market Analysis

VC fund performance is down sharply — but it may have already hit its lowest point

Rebecca Szkutak

2 days ago

Venture capital has been hit hard by souring macroeconomic conditions over the past few years and it’s not yet clear how the market downturn affected VC fund performance. But recent…

VC fund performance is down sharply — but it may have already hit its lowest point

Security

Threat actor says he scraped 49M Dell customer addresses before the company found out

Lorenzo Franceschi-Bicchierai

2 days ago

The person who claims to have 49 million Dell customer records told TechCrunch that he brute-forced an online company portal and scraped customer data, including physical addresses, directly from Dell’s…

Social

Bluesky now lets you personalize main Discover feed using new controls

Sarah Perez

2 days ago

The social network has announced an updated version of its app that lets you offer feedback about its algorithmic feed so you can better customize it.

Bluesky now lets you personalize main Discover feed using new controls

Apps

Microsoft is launching its mobile game store in July

Aisha Malik

2 days ago

Microsoft will launch its own mobile game store in July, the company announced at the Bloomberg Technology Summit on Thursday. Xbox president Sarah Bond shared that the company plans to…

Microsoft is launching its mobile game store in July

Hardware

Oura launches two new heart health features

Aisha Malik

2 days ago

Smart ring maker Oura is launching two new features focused on heart health, the company announced on Friday. The first claims to help users get an idea of their cardiovascular…

Oura launches two new heart health features

This Week in AI: OpenAI considers allowing AI porn

Kyle Wiggers

2 days ago

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

Gaming

Garena is quietly making India-themed games even as Free Fire’s relaunch remains doubtful

Jagmeet Singh

2 days ago

Garena is quietly developing new India-themed games even though Free Fire, its biggest title, has still not made a comeback to the country.

Garena is quietly making India-themed games even as Free Fire’s relaunch remains doubtful

Transportation

Fisker Ocean faces fourth federal safety probe

Sean O'Kane

2 days ago

The U.S.’ NHTSA has opened a fourth investigation into the Fisker Ocean SUV, spurred by multiple claims of “inadvertent Automatic Emergency Braking.”

Fisker Ocean faces fourth federal safety probe

CoreWeave, a $19B AI compute provider, opens European HQ in London with plans for 2 UK data centers

Paul Sawers

2 days ago

CoreWeave has formally opened an office in London that will serve as its European headquarters and home to two new data centers.

CoreWeave, a $19B AI compute provider, opens European HQ in London with plans for 2 UK data centers

Fundraising

AI chip startup DEEPX secures $80M Series C at a $529M valuation

Kate Park

2 days ago

The Series C funding, which brings its total raise to around $95 million, will go toward mass production of the startup’s inaugural products

AI chip startup DEEPX secures $80M Series C at a $529M valuation

Startups

Infighting among fintech players has caused TabaPay to ‘pull out’ from buying bankrupt Synapse

Mary Ann Azevedo

2 days ago

A dust-up between Evolve Bank & Trust, Mercury and Synapse has led TabaPay to abandon its acquisition plans of troubled banking-as-a-service startup Synapse.

Infighting among fintech players has caused TabaPay to ‘pull out’ from buying bankrupt Synapse

Media & Entertainment

Apple’s ‘Crush’ ad is disgusting

Devin Coldewey

2 days ago

The problem is not the media, but the message.

Apps

Google built some of the first social apps for Android, including Twitter and others

Sarah Perez

2 days ago

The Twitter for Android client was “a demo app that Google had created and gave to us,” says Particle co-founder and ex-Twitter employee Sara Beykpour.

Lakera launches to protect large language models from malicious prompts

With $10M in backing, Swiss startup launches API to protect companies from prompt injections and more

Data wizardry

EU AI Act

The security blocker

More TechCrunch

Get the industry’s biggest tech news

TechCrunch Daily News

Startups Weekly

TechCrunch Fintech

TechCrunch Mobility

Tags