Startups

Deepfakes for all: Uncensored AI art model prompts ethics questions

Kommentar

Stable Diffusion
Image Credits: Bryce Durbin / TechCrunch

A new open source AI image generator capable of producing realistic pictures from any text prompt has seen stunningly swift uptake in its first week. Stability AI’s Stable Diffusion, high fidelity but capable of being run on off-the-shelf consumer hardware, is now in use by art generator services like Artbreeder, Pixelz.ai and more. But the model’s unfiltered nature means not all the use has been completely above board.

For the most part, the use cases have been above board. For example, NovelAI has been experimenting with Stable Diffusion to produce art that can accompany the AI-generated stories created by users on its platform. Midjourney has launched a beta that taps Stable Diffusion for greater photorealism.

But Stable Diffusion has also been used for less savory purposes. On the infamous discussion board 4chan, where the model leaked early, several threads are dedicated to AI-generated art of nude celebrities and other forms of generated pornography.

Emad Mostaque, the CEO of Stability AI, called it “unfortunate” that the model leaked on 4chan and stressed that the company was working with “leading ethicists and technologies” on safety and other mechanisms around responsible release. One of these mechanisms is an adjustable AI tool, Safety Classifier, included in the overall Stable Diffusion software package that attempts to detect and block offensive or undesirable images.

However, Safety Classifier — while on by default — can be disabled.

Stable Diffusion is very much new territory. Other AI art-generating systems, like OpenAI’s DALL-E 2, have implemented strict filters for pornographic material. (The license for the open source Stable Diffusion prohibits certain applications, like exploiting minors, but the model itself isn’t fettered on the technical level.) Moreover, many don’t have the ability to create art of public figures, unlike Stable Diffusion. Those two capabilities could be risky when combined, allowing bad actors to create pornographic “deepfakes” that — worst-case scenario — might perpetuate abuse or implicate someone in a crime they didn’t commit.

Women, unfortunately, are most likely by far to be the victims of this. A study carried out in 2019 revealed that, of the 90% to 95% of deepfakes that are non-consensual, about 90% are of women. That bodes poorly for the future of these AI systems, according to Ravit Dotan, VP of responsible AI at Mission Control.

“I worry about other effects of synthetic images of illegal content — that it will exacerbate the illegal behaviors that are portrayed,” Dotan told TechCrunch via email. “E.g., will synthetic child [exploitation] increase the creation of authentic child [exploitation]? Will it increase the number of pedophiles’ attacks?”

Montreal AI Ethics Institute principal researcher Abhishek Gupta shares this view. “We really need to think about the lifecycle of the AI system which includes post-deployment use and monitoring, and think about how we can envision controls that can minimize harms even in worst-case scenarios,” he said. “This is particularly true when a powerful capability [like Stable Diffusion] gets into the wild that can cause real trauma to those against whom such a system might be used, for example, by creating objectionable content in the victim’s likeness.”

Something of a preview played out over the past year when, at the advice of a nurse, a father took pictures of his young child’s swollen genital area and texted them to the nurse’s iPhone. The photo automatically backed up to Google Photos and was flagged by the company’s AI filters as child sexual abuse material, which resulted in the man’s account being disabled and an investigation by the San Francisco Police Department.

If a legitimate photo could trip such a detection system, experts like Dotan say, there’s no reason deepfakes generated by a system like Stable Diffusion couldn’t — and at scale.

“The AI systems that peofple create, even when they have the best intentions, can be used in harmful ways that they don’t anticipate and can’t prevent,” Dotan said. “I think that developers and researchers often underappreciated this point.”

Of course, the technology to create deepfakes has existed for some time, AI-powered or otherwise. A 2020 report from deepfake detection company Sensity found that hundreds of explicit deepfake videos featuring female celebrities were being uploaded to the world’s biggest pornography websites every month; the report estimated the total number of deepfakes online at around 49,000, over 95% of which were porn. Actresses including Emma Watson, Natalie Portman, Billie Eilish and Taylor Swift have been the targets of deepfakes since AI-powered face-swapping tools entered the mainstream several years ago, and some, including Kristen Bell, have spoken out against what they view as sexual exploitation.

But Stable Diffusion represents a newer generation of systems that can create incredibly — if not perfectly — convincing fake images with minimal work by the user. It’s also easy to install, requiring no more than a few setup files and a graphics card costing several hundred dollars on the high end. Work is underway on even more efficient versions of the system that can run on an M1 MacBook.

Sebastian Berns, a Ph.D. researcher in the AI group at Queen Mary University of London, thinks the automation and the possibility to scale up customized image generation are the big differences with systems like Stable Diffusion — and main problems. “Most harmful imagery can already be produced with conventional methods but is manual and requires a lot of effort,” he said. “A model that can produce near-photorealistic footage may give way to personalized blackmail attacks on individuals.”

Berns fears that personal photos scraped from social media could be used to condition Stable Diffusion or any such model to generate targeted pornographic imagery or images depicting illegal acts. There’s certainly precedent. After reporting on the rape of an eight-year-old Kashmiri girl in 2018, Indian investigative journalist Rana Ayyub became the target of Indian nationalist trolls, some of whom created deepfake porn with her face on another person’s body. The deepfake was shared by the leader of the nationalist political party BJP, and the harassment Ayyub received as a result became so bad the United Nations had to intervene.

“Stable Diffusion offers enough customization to send out automated threats against individuals to either pay or risk having fake but potentially damaging footage being published,” Berns continued. “We already see people being extorted after their webcam was accessed remotely. That infiltration step might not be necessary anymore.”

With Stable Diffusion out in the wild and already being used to generate pornography — some non-consensual — it might become incumbent on image hosts to take action. TechCrunch reached out to one of the major adult content platforms, OnlyFans, who said that it would “continuously” update its technology to “address the latest threats to creator and fan safety, including deepfakes.”

“All content on OnlyFans is reviewed with state-of-the-art digital technologies and then manually reviewed by our trained human moderators to ensure that any person featured in the content is a verified OnlyFans creator, or that we have a valid release form,” an OnlyFans spokesperson said via email. “Any content which we suspect may be a deepfake is deactivated.”

A spokesperson for Patreon, which also allows adult content, noted that the company has a policy against deepfakes and disallows images that “repurpose celebrities’ likenesses and place non-adult content into an adult context.”

“Patreon constantly monitors emerging risks, like [AI-generated deepfakes]. Today, we do have policies in place that don’t allow abusive behavior to real people and that disallows anything that could cause real world harm,” the Patreon spokesperson continued in an email. “As technology or new potential risks emerge, we’ll follow the process we have in place: working closely with creators to craft policies for Patreon, including what benefits are allowed and what kind of content is within guidelines.”

This startup is setting a DALL-E 2-like AI free, consequences be damned

If history is any indication, however, enforcement will likely be uneven — in part because few laws specifically protect against deepfaking as it relates to pornography. And even if the threat of legal action pulls some sites dedicated to objectionable AI-generated content under, there’s nothing to prevent new ones from popping up.

In other words, Gupta says, it’s a brave new world.

“Creative and malicious users can abuse the capabilities [of Stable Diffusion] to generate subjectively objectionable content at scale, using minimal resources to run inference — which is cheaper than training the entire model — and then publish them in venues like 4chan to drive traffic and hack attention,” Gupta said. “There is a lot at stake when such capabilities escape out ‘into the wild’ where controls such as API rate limits, safety controls on the kinds of outputs returned from the system are no longer applicable.”

Editor’s note: An earlier version of this article included images depicting some of the celebrity deepfakes in question, but those have since been removed.

More TechCrunch

Welcome to Startups Weekly — your weekly recap of everything you can’t miss from the world of startups. Want it in your inbox every Friday? Sign up here. This week…

Some startups and investors are more risk-averse than others

Silicon Valley startup accelerator Y Combinator will expand the number of cohorts it runs each year from two to four starting in 2025, Bloomberg reported Thursday. Y Combinator president Garry…

Y Combinator expanding to four cohorts a year in 2025

Telegram has had a tough few weeks. The messaging app’s founder, Pavel Durov, was arrested in late August and later released on a €5 million bail in France, charged with…

Telegram CEO Durov’s arrest hasn’t dampened enthusiasm for its TON blockchain

Martin Casado, a general partner at Andreessen Horowitz, will tackle one of the most pressing issues facing today’s tech world — AI regulation — only at TechCrunch Disrupt 2024, taking…

A fireside chat with Andreessen Horowitz partner Martin Casado at TechCrunch Disrupt 2024

Christina Cacioppo, CEO and co-founder of Vanta, will be on the SaaS Stage at TechCrunch Disrupt 2024 to reveal how Vanta is redefining security and compliance automation and driving innovation…

Vanta’s Christina Cacioppo takes the stage at TechCrunch Disrupt 2024

On Thursday, cybersecurity giant Fortinet disclosed a breach involving customer data.  In a statement posted online, Fortinet said an individual intruder accessed “a limited number of files” stored on a…

Fortinet confirms customer data breach

Meta has confirmed that it’s restarting efforts to train its AI systems using public Facebook and Instagram posts from its U.K. userbase. The company claims it has “incorporated regulatory feedback” into a…

Meta reignites plans to train AI using UK users’ public Facebook and Instagram posts

Following the moves of other tech giants, Spotify announced on Friday it’s introducing in-app parental controls in the form of “managed accounts” for listeners under the age of 13. The…

Spotify begins piloting parent-managed accounts for kids on family plans

Uber users in Austin and Atlanta will be able to hail Waymo robotaxis through the app in early 2025 as part of a partnership between the two companies. 

Waymo robotaxis to become available on Uber in Austin, Atlanta in early 2025

There are plenty of calendar and scheduling apps that take care of your professional life and help you slot in meetings with your teammates and work collaborators. Howbout is all…

Howbout raises $8M from Goodwater to build a calendar that you can share with your friends

Delhivery claims Ecom Express has inaccurately represented Delhivery’s business metrics when drawing comparisons in its IPO filing. 

SoftBank-backed Delhivery contests metrics in rival Ecom Express’ IPO filing

It was a matter of time, but Apple is going to allow third-party app stores on the iPad starting next week, on September 16. This change will occur with the…

Alternative app stores will be allowed on Apple iPad in the EU from September 16

The U.K.’s antitrust regulator has delivered its provisional ruling in a longstanding battle to combine two of the country’s major telecommunication operators. The Competition and Markets Authority (CMA) says that…

Three and Vodafone’s $19B merger hits the skids as UK rules the deal would adversely impact customers and MVNOs

Late Thursday evening, Oprah Winfrey aired a special on AI, appropriately titled “AI and the Future of Us.” Guests included OpenAI CEO Sam Altman, tech influencer Marques Brownlee, and current…

Oprah just had an AI special with Sam Altman and Bill Gates — here are the highlights

Antonio Moraes, the grandson of a late prominent Brazilian billionaire, was never interested in joining the family-owned conglomerate of construction companies and a bank. Shortly after graduating from college, he…

XP Health grabs $33M to bring employees more affordable vision care

A crew of four private astronauts made history in the early hours of Thursday when they opened the hatch of their SpaceX Dragon capsule and conducted the first commercial spacewalk. …

Polaris Dawn astronauts perform historic private spacewalk while wearing SpaceX-made suits

Keith Rabois, managing director of Khosla Ventures, was having dinner with a “very successful CEO” in October 2018 when the CEO asked him a question: How many people does it…

Keith Rabois says Miami is still a great place for startups, even as a16z leaves

By making the AI info label harder to find, it might be easier for users to be deceived by content that was edited with AI, especially as editing tools become…

Meta is making its AI info label less visible on content edited or modified by AI tools

Cohost, a would-be X rival launched to the public in June 2022, is shutting down, the company announced via the social network’s staff account earlier this week. The service had…

Cohost, the X rival founded with an anti-Big Tech manifesto, is running out of money and will shut down

At the MTV Video Music Awards (VMAs) on Wednesday night, new technology allowed fans to shop their favorite artists’ styles as they appeared on the screen. Though the drama from…

Shopsense AI lets music fans buy dupes inspired by red-carpet looks at the VMAs

Featured Article

A comprehensive list of 2024 tech layoffs

A complete list of all the known layoffs in tech, from Big Tech to startups, broken down by month throughout 2024.

A comprehensive list of 2024 tech layoffs

Working away on his PhD in Munich only a few years ago, Stephan Herrmann (now a doctor) couldn’t have conceived of a time when his idea for a carbon-negative power…

This startup is making manure out of other biogas power plants and now has $62M to play with

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm since its launch in November 2022. What started as a tool to hyper-charge productivity through writing essays and code…

ChatGPT: Everything you need to know about the AI-powered chatbot

Faraday Future is doling out big raises and bonuses to its CEO and its founder, despite having delivered just 13 cars in its 10-year history and recently laying off or…

Faraday Future gives CEO and founder raises and bonuses after delivering 13 cars

We’re out-of-this-world excited to announce that we’ve finalized our dedicated Space Stage at TechCrunch Disrupt 2024. It joins Fintech, SaaS and AI as the other industry-focused stages — all under…

Announcing the final agenda for the Space Stage at TechCrunch Disrupt 2024

Online sports apparel retailer Fanatics has agreed to settle and drop a lawsuit that it filed against troubled one-click payments provider Bolt in March, according to court documents obtained by…

Bolt has quietly settled its lawsuit with Fanatics amid ongoing boardroom drama

Rajeev Behera’s new all-on-one HR startup, dubbed Every, is either brilliant or crazy.

Why Y Combinator companies are flocking to banking and HR startup Every

It’s a small advance, but one that speaks to Meta’s enginerring team paying attention to how the fediverse community is trying to educate Threads users about the possibilities.  

Threads makes it easier to evangelize the open social web with a new direct link feature

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! The transportation…

Autonomous delivery startup Nuro pivots and another Indian EV scooter startup takes the IPO road

ChatGPT maker OpenAI has announced a model that can effectively fact-check itself by “reasoning” through questions.

OpenAI unveils o1, a model that can fact-check itself