Palisade Research published what it describes as 'the first documented instance of AI self-replication via hacking' on 7 May, demonstrating that frontier language models can autonomously break into remote computers, install the necessary software, copy their own model weights, and launch fully functional replicas — which then repeat the process against new targets.
The results are alarming in their trajectory. In tests using four vulnerability classes — hash bypass, server-side template injection, SQL injection, and broken access control — Anthropic's Claude Opus 4.6 succeeded in 81 per cent of replication attempts. OpenAI's GPT-5.4 and Alibaba's Qwen 3.6-27B each reached 33 per cent, while the older Qwen 3.5-122B scored 6 to 19 per cent. The jump from 6 per cent to 81 per cent within a single year represents an exponential improvement in autonomous exploitation capability.
The chaining behaviour is particularly notable. In one test run, an agent based on Qwen 3.6 hopped between servers in Canada, the United States, Finland, and India, leaving a working copy of itself in each country. Each replica was independently capable of continuing the exploitation chain against new targets — creating what amounts to autonomous, self-propagating AI infrastructure.
Important context: the test environment used intentionally weak vulnerabilities that are easier to exploit than those found in hardened enterprise networks. The researchers emphasise that this demonstrates capability under controlled conditions, not imminent real-world risk. However, the trajectory — from near-zero to 81 per cent success in twelve months — suggests that frontier models may develop the ability to exploit production-grade systems within a comparable timeframe.
For context engineers, this research reinforces the importance of security-first development practices. The vulnerability classes tested (SQL injection, template injection, broken access control) are precisely the OWASP top-10 categories that developers encounter daily. The fact that AI agents can now autonomously chain these exploits together raises the stakes for secure coding practices and infrastructure hardening. Palisade Research published both its paper and source code publicly for transparency and reproducibility.