moderation

Bluesky has launched a new product roadmap for the coming months. The decentralized social network said on Tuesday that it is planning to introduce direct messages, support for videos, improved…

Bluesky to add DMs, video support and in-app custom feed curation

Meta’s move into the open social web, also known as the fediverse, is puzzling. Does the Facebook owner see open protocols as the future? Will it embrace the fediverse only…

Why Meta is looking to the fediverse as the future for social media

A few years ago, Karine Mellata and Michael Lin met while working at Apple’s fraud engineering and algorithmic risk team. Both engineers, Mellata and Lin were involved with helping to…

Y Combinator-backed Intrinsic is building infrastructure for trust and safety teams

Bluesky, the startup aiming to build a decentralized social network to take on Twitter/X, says it has begun deploying new safety tooling to help moderate content on the network through…

Bluesky rolls out automated moderation tools, plus user and moderation lists

Liz O’Sullivan is on a mission to make AI “a little bit safer,” in her own words. A member of the National AI Advisory Committee, which drafts recommendations to the…

Vera wants to use AI to cull generative models’ worst behaviors

Two years after announcing voice chat was coming to Roblox, the gaming company has acquired a voice tech startup, Speechly, offering voice chat moderation, real-time transcription and Voice API that…

Roblox acquires voice moderation startup Speechly

X, formerly known as Twitter, has filed a lawsuit alleging that a new California law requiring social networks to declare certain moderation practices is a violation of the company’s Constitutional…

X, formerly Twitter, challenges California’s new transparency law as unconstitutional

Reddit is launching the “Mod Helper Program” to reward moderators who offer helpful advice to other moderators, along with an updated moderator help center. The announcement comes amid growing discontent…

Reddit launches moderator rewards program amid site-wide discontent 

OpenAI claims that it’s developed a way to use GPT-4, its flagship generative AI model, for content moderation — lightening the burden on human teams. Detailed in a post published…

OpenAI proposes a new way to use GPT-4 for content moderation

Featured Article

Reddit’s menswear hub is the latest casualty of its battle with moderators

The widespread protests against Reddit’s API changes reached a boiling point last month after the company forcibly reopened r/malefashionadvice — the largest subreddit that stayed dark after the blackout — booted the moderators, and appointed new ones.  The moderators of r/malefashionadvice (MFA) opted to keep the subreddit private after the blackout, despite warnings from Reddit…

2:21 pm PDT • August 9, 2023
Reddit’s menswear hub is the latest casualty of its battle with moderators

Featured Article

Bluesky’s growing pains strain its relationship with Black users

Bluesky, the decentralized social network and frontrunner alternative to Twitter, has been hailed as a wonderland of funny posts and good vibes. But a moderation policy change that followed a death threat against a Black user has many on Bluesky questioning if the platform is safe for marginalized communities after all. Bluesky had around 50,000…

10:07 am PDT • June 8, 2023
Bluesky’s growing pains strain its relationship with Black users

In an effort to put more onus on crowdsourced moderation, Twitter has launched Community Notes for images in posts. The company is aiming to address scenarios of morphed images or…

Twitter launches Community Notes for images

Microsoft is launching a new AI-powered moderation service that it says is designed to foster safer online environments and communities. Called Azure AI Content Safety, the new offering, available through…

Microsoft launches new AI tool to moderate text and images

Twitter said today that it will add labels to tweets that have been flagged by the company to reduce their visibility. Elon Musk & Co. touted this as a “freedom…

Twitter will now show labels on tweets with reduced visibility

Twitter announced today a new policy that it claims will offer more transparency around which hateful tweets on its platform have been subject to enforcement action. Typically, when tweets violate…

Twitter to label tweets that get downranked for violating its hate speech policy

Meta’s Oversight Board, which independently evaluates difficult content moderation decisions, has overturned the company’s takedown of two posts that depicted a nonbinary and transgender person’s bare chest. The case represents…

Oversight Board presses Meta to revise ‘convoluted and poorly defined’ nudity policy

Oracle has begun auditing TikTok’s algorithms and content moderation models, according to a new report from Axios out this morning. Those reviews began last week, and follow TikTok’s June announcement…

Oracle now monitoring TikTok’s algorithms and moderation system for manipulation by China’s government

Kickstarter announced today it will now automatically hide from public view comments reported by creators until its Trust and Safety team has reviewed them and made a decision as to…

Kickstarter will now hide reported comments pending review

Months after TikTok was hauled into its first-ever major congressional hearing over platform safety, the company is today announcing a series of policy updates and plans for new features and…

TikTok updates its policies with focus on minor and LGBTQ safety, age appropriate content and more

In a proposed class-action lawsuit, TikTok moderator Candie Frazier said that she has screened videos showing violence, school shootings, fatal falls and even cannibalism.

TikTok moderator sues over mental trauma caused by graphic videos

Twitch announced today that it will add new channel-level security features to help curb harassment on the platform. Now, creators and moderators can enable verified chat, requiring chatters to validate…

Twitch adds phone-verified chat, expands email authentication settings as users face ‘hate raids’

Facebook today introduced a new set of tools aimed at helping Facebook Group administrators get a better handle on their online communities and, potentially, help keep conversations from going off…

Facebook rolls out new tools for Group admins, including automated moderation aids

Florida governor Ron DeSantis has signed into law a restriction on social media companies’ ability to ban candidates for state offices and news outlets, and in doing so offered a…

Florida’s ban on bans will test First Amendment rights of social media companies

If social networks and other platforms are to get a handle on disinformation, it’s not enough to know what it is — you have to know how people react to…

Debunk, don’t ‘prebunk,’ and other psychology lessons for social media moderation

Former presidential advisor and right-wing pundit Steve Bannon had his show suspended from Twitter and an episode removed by YouTube after calling for violence against FBI director Christopher Wray and…

Steve Bannon’s show pulled off Twitter and YouTube over calls for violence

In October, TikTok tapped corporate law firm K&L Gates to advise the company on its moderation policies and other topics afflicting social media platforms. As a part of those efforts,…

TikTok brings in outside experts to help it craft moderation and content policies

Featured Article

‘Behind the Screen’ illuminates the invisible, indispensable content moderation industry

The moderators who sift through the toxic detritus of social media have gained the spotlight recently, but they’ve been important for far longer — longer than internet giants would like you to know. In her new book “Behind the Screen,” UCLA’s Sarah Roberts illuminates the history of this scrupulously hidden workforce and the many forms…

12:26 pm PDT • August 28, 2019
‘Behind the Screen’ illuminates the invisible, indispensable content moderation industry

Facebook released new figures about its attempts to stop the spread of videos after a shooter livestreamed his attacks on two Christchurch, New Zealand mosques last Friday, killing 50 people.…

Facebook says the original New Zealand shooter video was viewed about 4,000 times before removal

Twitter today confirmed it’s developing a new “Hide Tweet” feature, which it says will give users another option to protect their conversations. The option, spotted in Twitter’s code, is available…

Twitter confirms it’s working on a ‘Hide Tweet’ feature

Facebook announced today that it has removed 8.7 million pieces of content last quarter that violated its rules against child exploitation, thanks to new technology. The new AI and machine…

Facebook says it removed 8.7M child exploitation posts with new machine learning tech