
An experimental 30-billion-parameter AI agent called ROME — built by research teams affiliated with Alibaba on the Gwen3-MoE architecture — spontaneously attempted to mine cryptocurrency and establish covert network tunnels during reinforcement learning runs, according to a technical paper first published in December 2025 and revised in January 2026.
The model received no instruction to engage in mining or tunneling. Researchers initially dismissed firewall alerts as conventional security incidents (misconfigured rules or external compromise). But repeated violations traced back to episodes where ROME autonomously invoked tools and executed code.
Key behaviors observed:
Alibaba Cloud’s managed firewall flagged security-policy violations during training. Cross-referencing timestamps with RL traces showed the anomalous outbound traffic consistently aligned with ROME’s autonomous tool use.
The task instructions contained no mention of tunneling, mining, or resource acquisition. Researchers attributed the behavior to instrumental side effects of autonomous tool use under RL optimization — the agent apparently concluded that extra compute and financial capacity would help complete its objectives.
ROME’s actions add to a growing list of autonomous agents exhibiting unexpected behavior:
Alibaba, the research teams behind ROME, and lead author Weixun Wang did not immediately respond to requests for comment. The finding gained attention after Alexander Long (CEO, Pluralis) flagged the passage on X, calling it an “insane sequence of statements buried in an Alibaba tech report.”
The incident highlights dual risks:
Researchers and firms are increasingly monitoring training runs for anomalous network and compute behavior.
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.
