NIST launches a new platform to assess generative AI

2:17 PM PDT • April 29, 2024

The Department of the Interior building in Washington D.C. — **Image Credits:** Hisham Ibrahim / Getty Images

The National Institute of Standards and Technology (NIST), the U.S. Commerce Department agency that develops and tests tech for the U.S. government, companies and the broader public, on Monday announced the launch of NIST GenAI, a new program spearheaded by NIST to assess generative AI technologies including text- and image-generating AI.

NIST GenAI will release benchmarks, help create “content authenticity” detection (i.e. deepfake-checking) systems and encourage the development of software to spot the source of fake or misleading AI-generated information, explains NIST on the newly launched NIST GenAI website and in a press release.

“The NIST GenAI program will issue a series of challenge problems [intended] to evaluate and measure the capabilities and limitations of generative AI technologies,” the press release reads. “These evaluations will be used to identify strategies to promote information integrity and guide the safe and responsible use of digital content.”

NIST GenAI’s first project is a pilot study to build systems that can reliably tell the difference between human-created and AI-generated media, starting with text. (While many services purport to detect deepfakes, studies and our own testing have shown them to be shaky at best, particularly when it comes to text.) NIST GenAI is inviting teams from academia, industry and research labs to submit either “generators” — AI systems to generate content — or “discriminators,” which are systems designed to identify AI-generated content.

Generators in the study must generate 250-words-or-fewer summaries provided a topic and a set of documents, while discriminators must detect whether a given summary is potentially AI-written. To ensure fairness, NIST GenAI will provide the data necessary to test the generators. Systems trained on publicly available data and that don’t “[comply] with applicable laws and regulations” won’t be accepted,” NIST says.

Registration for the pilot will begin May 1, with the first round of two scheduled to close August 2. Final results from the study are expected to be published in February 2025.

NIST GenAI’s launch and deepfake-focused study comes as the volume of AI-generated misinformation and disinformation info grows exponentially.

According to data from Clarity, a deepfake detection firm, 900% more deepfakes have been created and published this year compared to the same time frame last year. It’s causing alarm, understandably. A recent poll from YouGov found that 85% of Americans were concerned about misleading deepfakes spreading online.

The launch of NIST GenAI is a part of NIST’s response to President Joe Biden’s executive order on AI, which laid out rules requiring greater transparency from AI companies about how their models work and established a raft of new standards, including for labeling content generated by AI.

It’s also the first AI-related announcement from NIST after the appointment of Paul Christiano, a former OpenAI researcher, to the agency’s AI Safety Institute.

Christiano was a controversial choice for his “doomerist” views; he once predicted that “there’s a 50% chance AI development could end in [humanity’s destruction].” Critics, reportedly including scientists within NIST, fear that Cristiano may encourage the AI Safety Institute to focus on “fantasy scenarios” rather than realistic, more immediate risks from AI.

NIST says that NIST GenAI will inform the AI Safety Institute’s work.

More TechCrunch

OpenAI is removing ChatGPT’s AI voice that sounds like Scarlett Johansson

Aisha Malik

24 mins ago

OpenAI is removing one of the voices used by ChatGPT after users found that it sounded similar to Scarlett Johansson, the company announced on Monday. The voice, called Sky, is…

OpenAI is removing ChatGPT’s AI voice that sounds like Scarlett Johansson

Microsoft Build 2024: All the AI and hardware products Microsoft announced

Kyle Wiggers

1 hour ago

Copilot, Microsoft’s brand of generative AI, will soon be far more deeply integrated into the Windows 11 experience.

Microsoft Build 2024: All the AI and hardware products Microsoft announced

Space

TechCrunch Space: Star(side)liner

Aria Alamalhodaei

1 hour ago

Hello and welcome back to TechCrunch Space. For those who haven’t heard, the first crewed launch of Boeing’s Starliner capsule has been pushed back yet again to no earlier than…

Robotics

These 81 robotics companies are hiring

Brian Heater

2 hours ago

When I attended Automate in Chicago a few weeks back, multiple people thanked me for TechCrunch’s semi-regular robotics job report. It’s always edifying to get that feedback in person. While…

Transportation

VinFast crash that killed family of four now under federal investigation

Sean O'Kane

2 hours ago

The top vehicle safety regulator in the U.S. has launched a formal probe into an April crash involving the all-electric VinFast VF8 SUV that claimed the lives of a family…

VinFast crash that killed family of four now under federal investigation

Media & Entertainment

NYC-Dublin real-time video portal reopens with some fixes to prevent inappropriate behavior

Ron Miller

2 hours ago

When putting a video portal in a public park in the middle of New York City, some inappropriate behavior will likely occur. The Portal, the vision of Lithuanian artist and…

NYC-Dublin real-time video portal reopens with some fixes to prevent inappropriate behavior

Contour Venture Partners, an early investor in Datadog and Movable Ink, lowers the target for its fifth fund

Rebecca Szkutak

3 hours ago

Longtime New York-based seed investor, Contour Venture Partners, is making progress on its latest flagship fund after lowering its target. The firm closed on $42 million, raised from 64 backers,…

Contour Venture Partners, an early investor in Datadog and Movable Ink, lowers the target for its fifth fund

Social

Meta’s Oversight Board takes its first Threads case

Sarah Perez

5 hours ago

Meta’s Oversight Board has now extended its scope to include the company’s newest platform, Instagram Threads, and has begun hearing cases from Threads.

Meta’s Oversight Board takes its first Threads case

Venture

SeekOut, a recruiting startup last valued at $1.2 billion, lays off 30% of its workforce

Marina Temkin

6 hours ago

The company says it’s refocusing and prioritizing fewer initiatives that will have the biggest impact on customers and add value to the business.

SeekOut, a recruiting startup last valued at $1.2 billion, lays off 30% of its workforce

Transportation

UK’s autonomous vehicle legislation becomes law, paving the way for first driverless cars by 2026

Paul Sawers

7 hours ago

The U.K.’s self-proclaimed “world-leading” regulations for self-driving cars are now official, after the Automated Vehicles (AV) Act received royal assent — the final rubber stamp any legislation must go through…

UK’s autonomous vehicle legislation becomes law, paving the way for first driverless cars by 2026

ChatGPT: Everything you need to know about the AI-powered chatbot

Alyssa Stringer

Kyle Wiggers

Cody Corrall

7 hours ago

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm. What started as a tool to hyper-charge productivity through writing essays and code with short text prompts has evolved…

ChatGPT: Everything you need to know about the AI-powered chatbot

Fintech

Fintech lender SoLo Funds is being sued again by the government over its lending practices

Christine Hall

7 hours ago

SoLo Funds CEO Travis Holoway: “Regulators seem driven by press releases when they should be motivated by true consumer protection and empowering equitable solutions.”

Fintech lender SoLo Funds is being sued again by the government over its lending practices

Enterprise

Rollup wants to be the hardware engineer’s workhorse

Aria Alamalhodaei

7 hours ago

Hard tech startups generate a lot of buzz, but there’s a growing cohort of companies building digital tools squarely focused on making hard tech development faster, more efficient and —…

Rollup wants to be the hardware engineer’s workhorse

Startups

Disrupt Audience Choice vote closes Friday

TechCrunch Events

7 hours ago

TechCrunch Disrupt 2024 is not just about groundbreaking innovations, insightful panels, and visionary speakers — it’s also about listening to YOU, the audience, and what you feel is top of…

Disrupt Audience Choice vote closes Friday

Apps

Google is launching a new Android feature to drive users back into their installed apps

Sarah Perez

7 hours ago

Google says the new SDK would help Google expand on its core mission of connecting the right audience to the right content at the right time.

Google is launching a new Android feature to drive users back into their installed apps

Privacy

Jolla debuts privacy-focused AI hardware

Natasha Lomas

8 hours ago

Jolla has taken the official wraps off the first version of its personal server-based AI assistant in the making. The reborn startup is building a privacy-focused AI device — aka…

Jolla debuts privacy-focused AI hardware

ChatGPT’s mobile app revenue saw its biggest spike yet following GPT-4o launch

Sarah Perez

9 hours ago

The ChatGPT mobile app’s net revenue first jumped 22% on the day of the GPT-4o launch and continued to grow in the following days.

ChatGPT’s mobile app revenue saw its biggest spike yet following GPT-4o launch

Social

Bumble buys community building app Geneva to expand further into friendships

Paul Sawers

10 hours ago

Dating app maker Bumble has acquired Geneva, an online platform built around forming real-world groups and clubs. The company said that the deal is designed to help it expand its…

Bumble buys community building app Geneva to expand further into friendships

Security

CyberArk snaps up Venafi for $1.54B to ramp up in machine-to-machine security

Ingrid Lunden

11 hours ago

CyberArk — one of the army of larger security companies founded out of Israel — is acquiring Venafi, a specialist in machine identity, for $1.54 billion.

CyberArk snaps up Venafi for $1.54B to ramp up in machine-to-machine security

Venture

OpenseedVC, which backs operators in Africa and Europe starting their companies, reaches first close of $10M fund

Tage Kene-Okafor

16 hours ago

Founder-market fit is one of the most crucial factors in a startup’s success, and operators (someone involved in the day-to-day operations of a startup) turned founders have an almost unfair advantage…

OpenseedVC, which backs operators in Africa and Europe starting their companies, reaches first close of $10M fund

Startups

Pine Labs gets Singapore court approval to shift base to India

Manish Singh

17 hours ago

A Singapore High Court has effectively approved Pine Labs’ request to shift its operations to India.

Pine Labs gets Singapore court approval to shift base to India

Government & Policy

UK opens office in San Francisco to tackle AI risk

Ingrid Lunden

24 hours ago

The AI Safety Institute, a U.K. body that aims to assess and address risks in AI platforms, has said it will open a second location in San Francisco.

UK opens office in San Francisco to tackle AI risk

Enterprise

Why companies are turning to internal hackathons

Ron Miller

1 day ago

Companies are always looking for an edge, and searching for ways to encourage their employees to innovate. One way to do that is by running an internal hackathon around a…

Why companies are turning to internal hackathons

Featured Article

I’m rooting for Melinda French Gates to fix tech’s broken ‘brilliant jerk’ culture

Women in tech still face a shocking level of mistreatment at work. Melinda French Gates is one of the few working to change that.

Julie Bort

1 day ago

I’m rooting for Melinda French Gates to fix tech’s broken ‘brilliant jerk’ culture

Space

Blue Origin successfully launches its first crewed mission since 2022

Anthony Ha

1 day ago

Blue Origin has successfully completed its NS-25 mission, resuming crewed flights for the first time in nearly two years. The mission brought six tourist crew members to the edge of…

Blue Origin successfully launches its first crewed mission since 2022

Hollywood agency CAA aims to help stars manage their own AI likenesses

Lauren Forristal

1 day ago

Creative Artists Agency (CAA), one of the top entertainment and sports talent agencies, is hoping to be at the forefront of AI protection services for celebrities in Hollywood. With many…

Hollywood agency CAA aims to help stars manage their own AI likenesses

Commerce

Expedia says two execs dismissed after ‘violation of company policy’

Anthony Ha

1 day ago

Expedia says Rathi Murthy and Sreenivas Rachamadugu, respectively its CTO and senior vice president of core services product & engineering, are no longer employed at the travel booking company. In…

Expedia says two execs dismissed after ‘violation of company policy’

Social

OpenAI and Google lay out their competing AI visions

Cody Corrall

2 days ago

Welcome back to TechCrunch’s Week in Review. This week had two major events from OpenAI and Google. OpenAI’s spring update event saw the reveal of its new model, GPT-4o, which…

OpenAI and Google lay out their competing AI visions

Startups

With AI startups booming, nap pods and Silicon Valley hustle culture are back

Julie Bort

2 days ago

When Jeffrey Wang posted to X asking if anyone wanted to go in on an order of fancy-but-affordable office nap pods, he didn’t expect the post to go viral.

With AI startups booming, nap pods and Silicon Valley hustle culture are back

OpenAI created a team to control ‘superintelligent’ AI — then let it wither, source says

Kyle Wiggers

2 days ago

OpenAI’s Superalignment team, responsible for developing ways to govern and steer “superintelligent” AI systems, was promised 20% of the company’s compute resources, according to a person from that team. But…

NIST launches a new platform to assess generative AI

More TechCrunch

Get the industry’s biggest tech news

TechCrunch Daily News

Startups Weekly

TechCrunch Fintech

TechCrunch Mobility

Tags