Security

When generative AI cyberthreats arrive, Wraithwatch will be ready and waiting

Comment

Artificial intelligence cpu is generating user requests, 3d rendering
Image Credits: mesh cube / Getty Images

Generative AI is pervading just about every industry already, whether we like it or not, and cybersecurity is no exception. The possibility of AI-accelerated malware development and autonomous attacks should alarm any sysadmin even at this early stage. Wraithwatch is a new security outfit that aims to fight fire with fire, deploying good AI to fight the bad ones.

The image of righteous AI agents battling against evil ones in cyberspace is already pretty romanticized, so let’s be clear from the outset that it’s not a Matrix-style melee. This is about software automation enabling malicious actors the same way it enables the rest of us.

Employees at SpaceX and Anduril until just a few months ago, Nik Seetharaman, Grace Clemente and Carlos Más witnessed firsthand the storm of threats every company with something valuable to hide (think aerospace, defense, finance) is subject to at all hours.

“This has been going on for 30-plus years, and LLMs are only going to make it worse,” said Seetharaman. “There’s not enough dialogue about the implications of generative AI on the offensive side of the landscape.”

A simple version of the threat model is a variation on a normal software development process. A developer working on an ordinary project might do one part of the code personally, then tell an AI copilot to use that code as a guide to make a similar function in five other languages. And if it doesn’t work, the system can iterate until it does, or even create variants to see if one performs better or is more easily audited. Useful, but not a miracle. Someone’s still responsible for that code.

But think about a malware developer. They can use the same process to create multiple versions of a piece of malicious software in a few minutes, shielding them from the surface-level “brittle” detection methods that search for package sizes, common libraries and other telltale signs of a piece of malware or its creator.

“It’s trivial for a foreign power to point a worm at an LLM and say ‘hey, mutate yourself into a thousand versions,’ and then launch all 1,000 at once. In our testing, there are uncensored open source models that are happy to take your malware and mutate them in any direction you wish,” explained Seetharaman. “The bad guys are out there, and they don’t care about alignment — you yourself have to force the LLMs to explore the dark side, and map those to how you’ll actually defend if it happens.”

A reactive industry

The platform Wraithwatch is building, and hopes to have operational commercially next year, has more in common with war games than traditional cybersecurity operations, which tend to be “fundamentally reactive” to threats others have detected, they said. The speed and variety of attacks may soon overwhelm the largely manual and human-driven cybersecurity response policies most companies use.

As the company writes in a blog post:

New vulnerabilities and attack techniques — a weekly occurrence — are difficult to understand and mitigate, requiring in-depth analysis in order to comprehend underlying attack mechanics and manually translate that understanding into appropriate defensive strategies.

“Part of the challenge for cyber teams is, we wake up in the morning and learn about a zero day [the name given to security vulnerabilities where the vendor has no advance notice to fix them] — but by the time we are reading about it, there are already blogs about the new variation that it has mutated to,” said Clemente. “And if you’re at SpaceX or Anduril or the U.S. government, you’re getting some fresh custom version made just for you. We can’t rely on waiting until someone else gets hit.”

Though these custom attacks are largely human-made now, like the defenses against them, we have already seen the beginnings of generative cyberthreats in things like WormGPT. That one may have been rudimentary, but it’s a question of when, not if, improved models are brought to bear on the problem.

There’s no reason to panic over WormGPT

Más noted that current LLMs have limitations in their capabilities and alignment. But security researchers have already demonstrated how mainstream code-generation APIs like OpenAI’s can be tricked into aiding a malicious actor, as well as the above-mentioned open models that can be run without alignment restrictions (evading “Sorry, I can’t create malware”-type responses).

“If you start getting creative with how you use an API, you can get a response that you might not expect,” Más said. But it’s about more than just coding. “One of the ways in which agencies detect or suspect who is behind an attack is they have signatures: the attacks they use, the binaries they use… imagine a world where you can have an LLM generate signatures like that. You click a bot and you have a brand new APT [advanced persistent threat, e.g. a state-sponsored hacking outfit].”

It’s even possible, Seetharaman said, that the new agent-type AIs trained to interact with multiple software platforms and APIs as if they’re human users, could be spun up to act as semi-autonomous threats to attack persistently and in coordination. If your cybersecurity team is prepared to counter this level of constant attack, it is likely only a matter of time before there’s a breach.

War games

So what’s the solution? Basically, a cybersecurity platform that leverages AI to tailor its detection and countermeasures to what an offensive AI is likely to throw at it.

“We were very deliberate about being a security company that does AI, and not an AI company that does security. We’ve been on the other side of the keyboard, and we saw until the last few days [at their respective companies] the kind of attacks they were throwing at us. We know the lengths they will go to,” said Clemente.

From left, Wraithwatch co-founders Carlos Más, Nik Seetharaman and Grace Clemente. Image Credits: Wraithwatch

And while a company like Meta or SpaceX may have top-tier security experts on site, not every company can stand up a team like that (think a 10-person subcontractor for an aerospace prime), and at any rate the tools they’re working with might not be up to the task. The entire system of reporting, responding and disclosing may be challenged by malicious actors empowered by LLMs.

“We’ve seen every cybersecurity tool on the planet and they are all lacking in some way. We want to sit as a command and control layer on top of those tools, tie a thread through them and transform what needs transforming,” Seetharaman said.

By using the same methods as attackers would in a sandboxed environment, Wraithwatch can characterize and predict the types of variations and attacks that LLM-infused malware could deploy, or so they hope. The ability of AI models to spot signal in noise is potentially useful in setting up layers of perception and autonomy that can detect and possibly even respond to threats without human intervention — not to say that it’s all automated, but the system could prepare to block a hundred likely variants of a new attack, for instance, as quickly as its admins want to run out patches to the original.

“The vision is that there’s a world where when you wake up wondering if you’ve already been breached, but Wraithwatch is already simulating these attacks in the thousands and saying here are the changes you need to make, and automating those changes as far as possible,” said Clemente.

Though the small team is “several thousand lines of code” into the project, it’s still early days. Part of the pitch, however, is that as certain as it is that malicious actors are exploring this technology, big corporations and nation-states are likely to be as well — or at the very least, it is healthy to assume this rather than the opposite. A small, agile startup comprising veterans of companies under serious threat, armed with a pile of VC money, could very well leapfrog the competition, being unfettered with the usual corporate baggage.

The $8 million seed round was led by Founders Fund, with participation by XYZ Capital and Human Capital. The aim is to put it to work as fast as possible, since at this point it is fair to consider it a race. “Since we come from companies with aggressive timelines, the goal is to have a resilient MVP with most features deployed to our design partners in Q1 of next year,” with a wider commercial product coming by the end of 2024, Seetharaman said.

It may all seem a little over the top, talking about AI agents laying siege to U.S. secrets in a secret war in cyberspace, and we’re still a ways off from that particular airport thriller blurb. But an ounce of preparation is worth a hell of a lot of cure, especially when things are as unpredictable and fast-moving as they are in the world of AI. Let’s hope that the problems Wraithwatch and others warn of are at least a few years off — but in the meantime, it’s clear that investors think those with secrets to protect will want to take preventative action.

More TechCrunch

Welcome to Week in Review: TechCrunch’s newsletter recapping the week’s biggest news. This week Apple unveiled new iPad models at its Let Loose event, including a new 13-inch display for…

Why Apple’s ‘Crush’ ad is so misguided

The U.K. Safety Institute, the U.K.’s recently established AI safety body, has released a toolset designed to “strengthen AI safety” by making it easier for industry, research organizations and academia…

U.K. agency releases tools to test AI model safety

AI startup Runway’s second annual AI Film Festival showcased movies that incorporated AI tech in some fashion, from backgrounds to animations.

At the AI Film Festival, humanity triumphed over tech

Rachel Coldicutt is the founder of Careful Industries, which researches the social impact technology has on society.

Women in AI: Rachel Coldicutt researches how technology impacts society

SAP Chief Sustainability Officer Sophia Mendelsohn wants to incentivize companies to be green because it’s profitable, not just because it’s right.

SAP’s chief sustainability officer isn’t interested in getting your company to do the right thing

Here’s what one insider said happened in the days leading up to the layoffs.

Tesla’s profitable Supercharger network is in limbo after Musk axed the entire team

StrictlyVC events deliver exclusive insider content from the Silicon Valley & Global VC scene while creating meaningful connections over cocktails and canapés with leading investors, entrepreneurs and executives. And TechCrunch…

Meesho, a leading e-commerce startup in India, has secured $275 million in a new funding round.

Meesho, an Indian social commerce platform with 150M transacting users, raises $275M

Some Indian government websites have allowed scammers to plant advertisements capable of redirecting visitors to online betting platforms. TechCrunch discovered around four dozen “gov.in” website links associated with Indian states,…

Scammers found planting online betting ads on Indian government websites

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The deck included some redacted numbers, but there was still enough data to get a good picture.

Pitch Deck Teardown: Cloudsmith’s $15M Series A deck

The company is describing the event as “a chance to demo some ChatGPT and GPT-4 updates.”

OpenAI’s ChatGPT announcement: What we know so far

Unlike ChatGPT, Claude did not become a new App Store hit.

Anthropic’s Claude sees tepid reception on iOS compared with ChatGPT’s debut

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Look,…

Startups Weekly: Trouble in EV land and Peloton is circling the drain

Scarcely five months after its founding, hard tech startup Layup Parts has landed a $9 million round of financing led by Founders Fund to transform composites manufacturing. Lux Capital and Haystack…

Founders Fund leads financing of composites startup Layup Parts

AI startup Anthropic is changing its policies to allow minors to use its generative AI systems — in certain circumstances, at least.  Announced in a post on the company’s official…

Anthropic now lets kids use its AI tech — within limits

Zeekr’s market hype is noteworthy and may indicate that investors see value in the high-quality, low-price offerings of Chinese automakers.

The buzziest EV IPO of the year is a Chinese automaker

Venture capital has been hit hard by souring macroeconomic conditions over the past few years and it’s not yet clear how the market downturn affected VC fund performance. But recent…

VC fund performance is down sharply — but it may have already hit its lowest point

The person who claims to have 49 million Dell customer records told TechCrunch that he brute-forced an online company portal and scraped customer data, including physical addresses, directly from Dell’s…

Threat actor says he scraped 49M Dell customer addresses before the company found out

The social network has announced an updated version of its app that lets you offer feedback about its algorithmic feed so you can better customize it.

Bluesky now lets you personalize main Discover feed using new controls

Microsoft will launch its own mobile game store in July, the company announced at the Bloomberg Technology Summit on Thursday. Xbox president Sarah Bond shared that the company plans to…

Microsoft is launching its mobile game store in July

Smart ring maker Oura is launching two new features focused on heart health, the company announced on Friday. The first claims to help users get an idea of their cardiovascular…

Oura launches two new heart health features

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

This Week in AI: OpenAI considers allowing AI porn

Garena is quietly developing new India-themed games even though Free Fire, its biggest title, has still not made a comeback to the country.

Garena is quietly making India-themed games even as Free Fire’s relaunch remains doubtful

The U.S.’ NHTSA has opened a fourth investigation into the Fisker Ocean SUV, spurred by multiple claims of “inadvertent Automatic Emergency Braking.”

Fisker Ocean faces fourth federal safety probe

CoreWeave has formally opened an office in London that will serve as its European headquarters and home to two new data centers.

CoreWeave, a $19B AI compute provider, opens European HQ in London with plans for 2 UK data centers

The Series C funding, which brings its total raise to around $95 million, will go toward mass production of the startup’s inaugural products

AI chip startup DEEPX secures $80M Series C at a $529M valuation 

A dust-up between Evolve Bank & Trust, Mercury and Synapse has led TabaPay to abandon its acquisition plans of troubled banking-as-a-service startup Synapse.

Infighting among fintech players has caused TabaPay to ‘pull out’ from buying bankrupt Synapse

The problem is not the media, but the message.

Apple’s ‘Crush’ ad is disgusting

The Twitter for Android client was “a demo app that Google had created and gave to us,” says Particle co-founder and ex-Twitter employee Sara Beykpour.

Google built some of the first social apps for Android, including Twitter and others