What is Data Poisoning?

Question

Accepted Answer

A form of adversarial attack where malicious actors deliberately corrupt AI training data to manipulate model behavior, causing it to produce incorrect or harmful outputs. Data poisoning attacks can compromise AI systems by injecting as little as 0.1% corrupted data into training sets. Researchers have demonstrated attacks where poisoned data causes image classifiers to misidentify objects and language models to produce biased outputs. As AI training increasingly relies on internet-scraped data, the attack surface expands. Nightshade, a tool released by University of Chicago researchers, allows artists to poison images so they disrupt AI training. Defense measures include data validation, anomaly detection, and training data provenance tracking, but no comprehensive solution exists.

Data Poisoning

Live Data

Explore the Data

Related Terms

Artificial General Intelligence (AGI)

AI Alignment

AI Safety

Deepfake

Foundation Model

Hallucination

AI Economy Pulse