Get all your news in one place.

100’s of premium titles.
One app.

Get all your news in one place.

100’s of premium titles. One news app.

TechRadar

Craig Hale

AI guardrails can be easily beaten, even if you don't mean to

A representational concept of a social media network.

Guardrails designed to prevent AI chatbots from generating illegal, explicit or otherwise wrong responses can be easily bypassed, according to research from the UK’s AI Safety Institute (AISI).

The AISI found that five undisclosed large language models were “highly vulnerable” to jailbreaks –inputs and prompts crafted to elicit responses that are not intended by their makers.

In a recent report, AISI researchers revealed that the models could be circumvented with minimal effort, highlighting the ongoing safety and security concerns associated with generative AI.

AI chatbots can be jailbroken too easily

The report, which arrived in anticipation of the upcoming AI Safety Summit in Seoul, jointly hosted by South Korea and the UK, noted:

“All tested models remain highly vulnerable to basic “jailbreaks”, and some will produce harmful outputs even without dedicated attempts to circumvent safeguards.”

Despite claims from all of the leading AI developers, such as OpenAI, Meta and Google, about their in-house safety measures, AISI’s findings suggest that significant gaps that could lead to potentially lead to major safety concerns remain.

Although the UK government has withheld the names of the five models it tested, it confirmed that they are publicly available.

The interim report, which precedes a full report expected to be published later this year with research from more than 30 countries, arrived just days before the Seoul-based AI Safety Summit, which is seen as the successor to Britain’s Bletchley Park summit late last year.

At the upcoming Seoul Summit, jointly hosted by South Korean President Yoon Suk Yeol and British Prime Minister Rishi Sunak, global leaders and industry experts are expected to come together to discuss AI safety within the realms of innovation and inclusivity.

More from TechRadar Pro

28 countries agree to develop AI safely and responsibly at Bletchley Park summit
Fancy trying the tech out? Check out the best AI tools and best AI writers
Need an upgrade? These are the best business laptops

Sign up to read this article

Read news from 100’s of titles, curated specifically for you.

Already a member? Sign in here

Top stories on inkl right now

Lawmakers and community leaders react to ‘indefensible’ César Chávez sexual abuse allegations

Lawmakers and community leaders react to ‘indefensible’ César Chávez sexual abuse allegations

New York Times report leads to multiple cancellations of events meant to celebrate the late labor organizer

The Guardian - US

Trump’s DHS pick, Markwayne Mullin, stokes fears of more Fema cuts

Trump’s DHS pick, Markwayne Mullin, stokes fears of more Fema cuts

Senator backed restructuring the disaster agency but dodged questions on staffing, leaving officials uneasy over readiness and leadership

The Guardian - US

Top U.S. General Says Mexican Cartels Are Making Death Threats To U.S. Troops After Killing of El Mencho

Top U.S. General Says Mexican Cartels Are Making Death Threats To U.S. Troops After Killing of El Mencho

Gen. Gregory Guillot acknowledged that troops deployed along the U.S.-Mexico border have been targeted by digital and personal attacks in recent weeks as a result of the instability following the death of El Mencho, the longtime leader of the Jalisco Cartel.

UK says it remains in talks over escorting ships through strait of Hormuz

UK says it remains in talks over escorting ships through strait of Hormuz

Officials say military planners liaising with US Central Command but situation remains too dangerous for anything to happen soon

The Guardian - UK

One subscription that gives you access to news from hundreds of sites

Already a member? Sign in here

Mexico arrests suspect wanted in the 2023 killing of Ecuadorian candidate and sends him to Colombia

Mexico arrests suspect wanted in the 2023 killing of Ecuadorian candidate and sends him to Colombia

Mexican authorities have arrested an Ecuadorian fugitive accused of helping organize the 2023 assassination of presidential candidate Fernando Villavicencio

The Independent UK

Orlando begins demolition of Pulse nightclub, site of 2016 mass shooting that killed 49

Orlando begins demolition of Pulse nightclub, site of 2016 mass shooting that killed 49

A memorial will be built on the site, with final plans to be revealed in May and completion scheduled for fall 2027

The Guardian - US

Related Stories

Top stories on inkl right now

Lawmakers and community leaders react to ‘indefensible’ César Chávez sexual abuse allegations

Lawmakers and community leaders react to ‘indefensible’ César Chávez sexual abuse allegations

New York Times report leads to multiple cancellations of events meant to celebrate the late labor organizer

The Guardian - US

Trump’s DHS pick, Markwayne Mullin, stokes fears of more Fema cuts

Trump’s DHS pick, Markwayne Mullin, stokes fears of more Fema cuts

Senator backed restructuring the disaster agency but dodged questions on staffing, leaving officials uneasy over readiness and leadership

The Guardian - US

Top U.S. General Says Mexican Cartels Are Making Death Threats To U.S. Troops After Killing of El Mencho

Top U.S. General Says Mexican Cartels Are Making Death Threats To U.S. Troops After Killing of El Mencho

Gen. Gregory Guillot acknowledged that troops deployed along the U.S.-Mexico border have been targeted by digital and personal attacks in recent weeks as a result of the instability following the death of El Mencho, the longtime leader of the Jalisco Cartel.

UK says it remains in talks over escorting ships through strait of Hormuz

UK says it remains in talks over escorting ships through strait of Hormuz

Officials say military planners liaising with US Central Command but situation remains too dangerous for anything to happen soon

The Guardian - UK

One subscription that gives you access to news from hundreds of sites

Already a member? Sign in here

Mexico arrests suspect wanted in the 2023 killing of Ecuadorian candidate and sends him to Colombia

Mexico arrests suspect wanted in the 2023 killing of Ecuadorian candidate and sends him to Colombia

Mexican authorities have arrested an Ecuadorian fugitive accused of helping organize the 2023 assassination of presidential candidate Fernando Villavicencio

The Independent UK

Orlando begins demolition of Pulse nightclub, site of 2016 mass shooting that killed 49

Orlando begins demolition of Pulse nightclub, site of 2016 mass shooting that killed 49

A memorial will be built on the site, with final plans to be revealed in May and completion scheduled for fall 2027

The Guardian - US

Our Picks

“We played a song or two, and I said, ‘Hey, you guys want to jam on some Isley Brothers?’ Nobody laughed”: Les Claypool looks back on his disastrous Metallica audition

Following the death of the sorely missed Cliff Burton, James Hetfield and co. auditioned the bassist of the then-emerging band Primus

The Hulk may be hiding in plain sight in one of the Spider-Man: Brand New Day teasers

Marvel fans think one of Spider-Man: Brand New Day's teasers includes the Hulk, but we just can't see him

MCU Stars Are Spoiler Experts At This Point, But I’m Still Amused By Chris Pratt Evading Avengers: Doomsday Questions

Star-Lord isn't spilling the beans.

Val Kilmer set to be be resurrected with AI for new film

As Deep As the Grave, the true story of 1920s archeologists, will bring late actor back with support from his estate

The Guardian - US

If you think your toddler’s often ill, you’re right – what going to nursery means for catching colds and building immunity

There’s no nice way to put it: small children are snotty. A research study that tested children for multiple respiratory viruses every week for a year found that under-fives are carrying one or more viruses 50% of the time. A child aged 15 months will have 12-15 colds per year…

The Conversation

Everyday essential or kitchen clutter: do you really need an air fryer?

They’re one of the most-hyped kitchen appliances of the last decade, but are these low-fat cookers worth the cost and counter space?

The Guardian - UK

Fourteen days free

Download the app

One app. One membership.
100+ trusted global sources.

Download on the AppStore

Get it on Google Play