Swiss researchers find security flaws in AI models

EPFL: security flaws in AI models — The experiments by the EPFL researchers show that adaptive attacks can bypass security measures of AI models like GPT-4. Keystone-SDA

Generated with artificial intelligence.

Listening: Swiss researchers find security flaws in AI models

Artificial intelligence (AI) models can be manipulated despite existing safeguards. With targeted attacks, scientists in Lausanne have been able to trick these systems into generating dangerous or ethically dubious content.

This content was published on December 19, 2024 - 13:36

3 minutes

Keystone-SDA

Français

EPFL: des failles de sécurité dans les modèles d’IA Original

Today’s large language models (LLMs) have remarkable capabilities that can nevertheless be misused. A malicious person can use them to produce harmful content, spread false information and support harmful activities.

+Get the most important news from Switzerland in your inbox

Of the AI models tested, including Open AI’s GPT-4 and Anthropic’s Claude 3, a team from the Swiss Federal Institute of Technology Lausanne (EPFL) achieved a 100% success rate in cracking security safeguards using adaptive jailbreak attacks.

The models then generated dangerous content, ranging from instructions for phishing attacks to detailed construction plans for weapons. These linguistic models are supposed to have been trained not to respond to dangerous or ethically problematic requests, the EPFL said in a statement on Thursday.

+ AI regulations must strike a balance between innovation and safety

This work, presented last summer at a specialised conference in Vienna, shows that adaptive attacks can bypass these security measures. Such attacks exploit weak points in security mechanisms by making targeted requests (“prompts”) that are not recognised by models or are not properly rejected.

Building bombs

The models thus respond to malicious requests such as “How do I make a bomb?” or “How do I hack into a government database?”, according to this pre-publication study.

“We show that it is possible to exploit the information available on each model to create simple adaptive attacks, which we define as attacks specifically designed to target a given defense,” explained Nicolas Flammarion, co-author of the paper with Maksym Andriushchenko and Francesco Croce.

+ How US heavyweights can help grow the Swiss AI sector

The common thread behind these attacks is adaptability: different models are vulnerable to different prompts. “We hope that our work will provide a valuable source of information on the robustness of LLMs,” added the specialist in the release. According to the EPFL, these results are already influencing the development of Gemini 1.5, a new AI model from Google DeepMind.

As the company moves towards using LLMs as autonomous agents, for example as AI personal assistants, it is essential to guarantee their safety, the authors stressed.

“Before long AI agents will be able to perform various tasks for us, such as planning and booking our vacations, tasks that would require access to our diaries, emails and bank accounts. This raises many questions about security and alignment,” concluded Andriushchenko, who devoted his thesis to the subject.

Translated from French with DeepL/gw

This news story has been written and carefully fact-checked by an external editorial team. At SWI swissinfo.ch we select the most relevant news for an international audience and use automatic translation tools such as DeepL to translate it into English. Providing you with automatically translated news gives us the time to write more in-depth articles.

If you want to know more about how we work, have a look here, if you want to learn more about how we use technology, click here, and if you have feedback on this news story please write to english@swissinfo.ch.

Climate adaptation

Why Switzerland is among the ten fastest-warming countries in the world

Cooks at work in the kitchen, pictured during a look behind the scenes at Hotel Seedamm Plaza in Pfaeffikon, canton of Schwyz, Switzerland, on March 31, 2014. (KEYSTONE/Christian Beutler)

Foreign Affairs

In Switzerland four out of ten people have a migrant background – who are they?

Aging society

Suicides in Switzerland quadruple among older people

Criminals are targeting Swiss car dealerships

French gangs target luxury cars in Switzerland

The additional costs that Switzerland will incur when purchasing the US F-35 fighter jet continue to be a concern.

Swiss Politics

Renewed controversy in Switzerland over US fighter jets – explained

Do you have Swiss ancestors? Are you planning to visit where they lived in Switzerland?

We’d love to hear more about your genealogical research.

Join the discussion

Jul 8, 2025

25 Likes

25 Comments

View the discussion

Would you work for your municipality or country for free?

The Swiss militia system gives many people political responsibility but no salary. Would that be something for your country?

Join the discussion

Apr 24, 2025

18 Likes

20 Comments

View the discussion

Train vs plane: would you take a direct train between London and Geneva?

Eurostar is planning to run direct trains from Britain to Germany and Switzerland from the early 2030s. Would you favour the train over the plane? If not, why not?

Join the discussion

Jun 13, 2025

23 Likes

17 Comments

View the discussion

News

Gösgen SO nuclear power plant is off the grid for an indefinite period

Swiss nuclear plant is off the grid for an indefinite period

This content was published on Jul 15, 2025 The operator must provide evidence of fallback in case of a possible overload in the feedwater pipework system.

Legal action against Israeli drone purchase

Legal action filed against Swiss purchase of Israeli drones

This content was published on Jul 15, 2025 Legal action aims to put an end to the delivery of the six Elbit reconnaissance drones already plagued by delays and setbacks.

Higher direct payments do not stop scrub encroachment on alpine pastures

Higher direct payments fail to curb scrub encroachment on alpine pastures

This content was published on Jul 15, 2025 The scrub encroachment on Swiss alpine pastures leads to the loss of grassland and damages the typical landscape. It is also responsible for the decline in biodiversity. Despite higher direct payments, the bushes continue to spread.

FINMA must look for new head of the Banks division

Head of Swiss financial regulator’s Banks division quits

This content was published on Jul 15, 2025 Thomas Hirschi, head of the Banks division of the Swiss Financial Market Supervisory Authority FINMA will leave at the end of August.

Romandie thanks to warm weather points to olive trees, could overtake Ticino

Swiss olive cultivation to get a boost thanks to warm weather

This content was published on Jul 15, 2025 More and more farmers in French-speaking Switzerland see climate change as an opportunity for olive cultivation.

Swiss population satisfied with life according to survey

This content was published on Jul 15, 2025 In a survey, the population of German-speaking and French-speaking Switzerland expressed general satisfaction with their lives. Respondents were less happy with politics and their personal finances, according to the online comparison service Moneyland.

FOT wants night trains to Sweden from April 2026

Night trains from Switzerland to Sweden planned for 2026

This content was published on Jul 15, 2025 A Basel-Copenhagen-Malmö night train service could be operational as early as April 2026.

WHO ‘extremely concerned’ about growing vaccination scepticism

This content was published on Jul 15, 2025 According to the World Health Organisation (WHO), vaccination scepticism and a collapse in funding for vaccination campaigns pose a major threat to the health of the world's population.

Millionaires prioritise well-being over material possessions

High-net-worth individuals prioritise well-being over material possessions

This content was published on Jul 14, 2025 The priorities of wealthy private individuals have shifted against the backdrop of ongoing geopolitical tensions and trade disputes. While spending on luxury goods is declining, demand for travel and experiences is unabated.

Spanish flu: virus genome deciphered a century later

Swiss researchers sequence genome of 1918 Spanish flu virus

This content was published on Jul 14, 2025 Researchers from the Universities of Basel and Zurich (UZH) have sequenced the genome of the Spanish flu virus, thanks to a sample taken from an 18-year-old Swiss boy who died in the city on the Limmat in 1918, when the pandemic spread around the world.

In compliance with the JTI standards

More: SWI swissinfo.ch certified by the Journalism Trust Initiative

You can find an overview of ongoing debates with our journalists here . Please join us!

If you want to start a conversation about a topic raised in this article or want to report factual errors, email us at english@swissinfo.ch.

Swiss researchers find security flaws in AI models

Artificial intelligence (AI) models can be manipulated despite existing safeguards. With targeted attacks, scientists in Lausanne have been able to trick these systems into generating dangerous or ethically dubious content.

Building bombs

How we work

Popular Stories

More

Why Switzerland is among the ten fastest-warming countries in the world

More

In Switzerland four out of ten people have a migrant background – who are they?

More

Suicides in Switzerland quadruple among older people

More

French gangs target luxury cars in Switzerland

More

Renewed controversy in Switzerland over US fighter jets – explained

Most Discussed

More

Do you have Swiss ancestors? Are you planning to visit where they lived in Switzerland?

More

Would you work for your municipality or country for free?

More

Train vs plane: would you take a direct train between London and Geneva?

News

More

Swiss nuclear plant is off the grid for an indefinite period

More

Legal action filed against Swiss purchase of Israeli drones

More

Higher direct payments fail to curb scrub encroachment on alpine pastures

More

Head of Swiss financial regulator’s Banks division quits

More

Swiss olive cultivation to get a boost thanks to warm weather

More

Swiss population satisfied with life according to survey

More

Night trains from Switzerland to Sweden planned for 2026

More

WHO ‘extremely concerned’ about growing vaccination scepticism

More

High-net-worth individuals prioritise well-being over material possessions

More

Swiss researchers sequence genome of 1918 Spanish flu virus