Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors

October 9, 2025

4

Artificial intelligence companies have been working at breakneck speeds to develop the best and most powerful tools, but that rapid development hasn’t always been coupled with clear understandings of AI’s limitations or weaknesses. Today, Anthropic released a report on how attackers can influence the development of a large language model.

The study centered on a type of attack called poisoning, where an LLM is pretrained on malicious content intended to make it learn dangerous or unwanted behaviors. The key finding from this study is that a bad actor doesn’t need to control a percentage of the pretraining materials to get the LLM to be poisoned. Instead, the researchers found that

→ Continue reading at Engadget

Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors

Similar Articles

Most Popular

Researchers find just 250 malicious documents can leave LLMs vulnerable to backdoors

Similar Articles

The best pizza oven for 2025

Skate Story grinds its way to PlayStation Plus on December 8

Most Popular

Cancun’s Hurricane & Sargassum Seasons Officially End in as High Tourism Period Begins

This week’s best iPad deals include the base-model Apple iPad for $279

Racing Louisville’s years-long journey to the NWSL playoffs