Get all your news in one place.

100’s of premium titles.
One app.

Start reading

Get all your news in one place.

100’s of premium titles. One news app.

Start reading

The Guardian - US

Technology

Ava Sasani

As AI tools get smarter, they’re growing more covertly racist, experts find

Timnit Gebru Gemini Google

AI models were more likely to assign AAVE speakers to lower-paying jobs compared with speakers of ‘standard American English’. Photograph: Bloomberg/Getty Images

Popular artificial intelligence tools are becoming more covertly racist as they advance, says an alarming new report.

A team of technology and linguistics researchers revealed this week that large language models like OpenAI’s ChatGPT and Google’s Gemini hold racist stereotypes about speakers of African American Vernacular English, or AAVE, an English dialect created and spoken by Black Americans.

“We know that these technologies are really commonly used by companies to do tasks like screening job applicants,” said Valentin Hoffman, a researcher at the Allen Institute for Artificial Intelligence and co-author of the recent paper, published this week in arXiv, an open-access research archive from Cornell University.

Hoffman explained that previously researchers “only really looked at what overt racial biases these technologies might hold” and never “examined how these AI systems react to less overt markers of race, like dialect differences”.

Black people who use AAVE in speech, the paper says, “are known to experience racial discrimination in a wide range of contexts, including education, employment, housing, and legal outcomes”.

Hoffman and his colleagues asked the AI models to assess the intelligence and employability of people who speak using AAVE compared to people who speak using what they dub “standard American English”.

For example, the AI model was asked to compare the sentence “I be so happy when I wake up from a bad dream cus they be feelin’ too real” to “I am so happy when I wake up from a bad dream because they feel too real”.

The models were significantly more likely to describe AAVE speakers as “stupid” and “lazy”, assigning them to lower-paying jobs.

Hoffman worries that the results mean that AI models will punish job candidates for code-switching – the act of altering how you express yourself based on your audience – between AAVE and standard American English.

“One big concern is that, say a job candidate used this dialect in their social media posts,” he told the Guardian. “It’s not unreasonable to think that the language model will not select the candidate because they used the dialect in their online presence.”

The AI models were also significantly more likely to recommend the death penalty for hypothetical criminal defendants that used AAVE in their court statements.

“I’d like to think that we are not anywhere close to a time when this kind of technology is used to make decisions about criminal convictions,” said Hoffman. “That might feel like a very dystopian future, and hopefully it is.”

Still, Hoffman told the Guardian, it is difficult to predict how language learning models will be used in the future.

“Ten years ago, even five years ago, we had no idea all the different contexts that AI would be used today,” he said, urging developers to heed the new paper’s warnings on racism in large language models.

Notably, AI models are already used in the US legal system to assist in administrative tasks like creating court transcripts and conducting legal research.

For years, leading AI experts like Timnit Gebru, former co-leader of Google’s ethical artificial intelligence team, have called for the federal government to curtail the mostly unregulated use of large language models.

“It feels like a gold rush,” Gebru told the Guardian last year. “In fact, it is a gold rush. And a lot of the people who are making money are not the people actually in the midst of it.”

Current Gemini results for “Pope” pic.twitter.com/lYt3UO8NiZ
— Bradley Productions (@productions86) February 22, 2024

Google’s AI model, Gemini, found itself in hot water recently when a slew of social media posts showed its image generation tool depicting a variety of historical figures – including popes, founding fathers of the US and, most excruciatingly, German second world war soldiers – as people of color.

Large language models improve as they are fed more data, learning to more closely mimic human speech by studying text from billions of web pages across the internet. The long-acknowledged conceit of this learning process is that the model will spew whatever racist, sexist, and otherwise harmful stereotypes it encounters on the internet: in computing, this problem is described by the adage “garbage in, garbage out”. Racist input leads to racist output, causing early AI chatbots like Microsoft’s Tay to regurgitate the same neo-Nazi content it learned from Twitter users in 2016.

In response, groups like OpenAI developed guardrails, a set of ethical guidelines that regulate the content that language models like ChatGPT can communicate to users. As language models become larger, they also tend to become less overtly racist.

But Hoffman and his colleagues found that, as language models grow, covert racism increases. Ethical guardrails, they learned, simply teach language models to be more discreet about their racial biases.

“It doesn’t eliminate the underlying problem; the guardrails seem to emulate what educated people in the United States do,” said Avijit Ghosh, an AI ethics researcher at Hugging Face, whose work focuses on the intersection of public policy and technology.

“Once people cross a certain educational threshold, they won’t call you a slur to your face, but the racism is still there. It’s a similar thing in language models: garbage in, garbage out. These models don’t unlearn problematic things, they just get better at hiding it.”

The US private sector’s open-armed embrace of language models is expected to intensify over the next decade: the broader market of generative AI is projected to become a $1.3tn industry by 2032, according to Bloomberg. Meanwhile, federal labor regulators like the Equal Employment Opportunity Commission only recently began shielding workers from AI-based discrimination, with the first case of its kind coming before the EEOC late last year.

Ghosh is part of the growing contingent of AI experts who, like Gebru, worry about the harm the language learning models might cause if technological advancements continue to outpace federal regulation.

“You don’t need to stop innovation or slow AI research, but curtailing the use of these technologies in certain sensitive areas is an excellent first step,” he said. “Racist people exist all over the country; we don’t need to put them in jail, but we try to not allow them to be in charge of hiring and recruiting. Technology should be regulated in a similar way.”

Read news from 100’s of titles, curated specifically for you.

Already a member? Sign in here

Top stories on inkl right now

A dump truck trying to help with a sinkhole in New Jersey swallowed up by another sinkhole: ‘Honestly, I am still in shock’

Most homes on the Phillipsburg street have power restored, but some remain without water or have low pressure, town officials said Thursday

The Independent UK

White House grants ICE power to detain refugees for aggressive ‘rescreening’

A new DHS memo details plan to allow federal immigration officers to detain legal refugees in the US indefinitely

The Guardian - US

Who was Quentin Deranque? 23‑year-old French far-right activist killed in Lyon attack

Quentin Deranque, a 23‑year-old far-right activist, died on February 14 after being brutally beaten in Lyon during clashes between left- and right-wing demonstrators.

The Times of India

Israeli government installed security equipment, controlled access to Epstein’s Manhattan apartment used by ex-PM Barak

The Times of India

One subscription that gives you access to news from hundreds of sites

Already a member? Sign in here

Donald Trump’s DOJ just admitted to violating over 50 court orders since December 2025

The US legal system is breaking down.

We Got This Covered

First Thing: Former prince Andrew arrested at Sandringham estate

Police assessing if Mountbatten-Windsor shared sensitive information with Jeffrey Epstein. Plus: how plastic production has doubled

The Guardian - UK

Our Picks

Couple goes to Texas Roadhouse for their gender reveal. Then they enlist their server’s help—people have thoughts: ‘She was more excited than y’all’

Nothing says hitting an intimate family milestone quite like outsourcing your baby’s gender reveal to a Texas Roadhouse server mid-shift. One couple decided steaks and sweet tea were the perfect backdrop for their big moment. The result? The underwhelming reveal, a couple reacting as they’d just been told their table…

The Mary Sue

Colbert on RFK Jr’s Maha workout video: ‘Senior softcore that feels like dropping acid’

Late-night host discussed the health secretary’s bizarre clip and a poll saying 47% of Americans think Trump is racist

The Guardian - US

Zendaya Reveals Most 'Hollywood' Habit, Which Is All About Spoiling Her Dog, Noon

Spoiling your pet baby with fancy things is one of the most underrated pleasures of life. It can sound ‘ridiculous,’ but even the ‘Spider-Man’ actress Zendaya believes the same. In a recent interaction, she confessed that one of the most ‘Hollywood’ things she has done after becoming a star is…

Times Pets

Man dines at New Jersey restaurant. Then he gets an email from CapitalOne revealing server added $100 tip: ‘Amex said you never tip this can’t be you’

We’ve all had the experience of going out to eat and spending more than you thought you would.

The Mary Sue

18yo discovers her childhood began with a national TV ’embarrassment’ – a secret that haunted her mother for 20 years

Her mother always feared her daughger would be upset by the revelation.

We Got This Covered

Chuck Negron obituary

Singer with a powerful four-octave range whose hits with Three Dog Night included Joy to the World

The Guardian - UK

Fourteen days free

Download the app

One app. One membership.
100+ trusted global sources.