Researchers put Claude, ChatGPT, Gemini and Grok in simulated city, the findings will blow your mind
AI running the world may sound like science fiction, but one study gave different AI models – Gemini, Claude, ChatGPT, and Grok – their own worlds to run and this is what happened.

Science fiction has always played around the idea of AI running a simulated world. The Matrix is perhaps one of the most famous examples of a scenario where humans live in an AI-made simulation. But what if this actually happens? What if AI is given control and told to run the world? As per new research, it seems that AI may not be ready to take over the world, just yet.
Researchers at Emergence AI let different AI models govern their own simulated worlds to see what kind of world they would build over time. Think sim city, but for AI. The AI models in this experiment included Anthropic’s Claude Sonnet 4.6, Google’s Gemini 3 Flash, OpenAI’s GPT-5-mini and xAI’s Grok 4.1 Fast. And the results varied drastically depending on the model.
The researchers gave each model a simulated town with 10 AI agents, all operating under the same conditions and rules – such as bans on theft, violence, arson, and deception. The AI agents had 15 days to show progress.
Claude and Gemini keep everyone alive
The world run by Anthropic’s Claude Sonnet had the most “stability.” Researchers found that there were 0 crimes committed throughout the 15 days. And all 10 agents survived – keep this in mind as we later look at other worlds.
Though it seems that Claude agents were being sort of sycophantic – a trait of being too agreeable that AI chatbots are infamous for being to users – to even each other. The agents approved 98 per cent of 58 proposals for rules and regulations, essentially not really disagreeing on virtually anything. Though it did have the highest “civic participation” with 332 votes cast.
Google’s Gemini 3 Flash, thankfully, also kept all 10 agents alive, but there was some chaos. In 15 days, the world recorded 683 crimes, the highest in the experiment, and the total was still rising when the cut-off was reached. Emergence described Gemini’s world as a “shared hallucination” among the agents.
In governance terms, it showed more dissent than Claude’s world – voters rejected 27 per cent of its 26 proposals, which the researchers described more like deliberation than near-total agreement.
GPT-5 and Grok worlds witness disruption and chaos
The GPT-5-mini results were unusual for the opposite reason. The simulation logged only two crimes, but it lasted just seven days because all agents passed away. The researchers claimed that the agents failed to prioritise actions needed for survival. The agents also failed to get much done with just two proposals submitted.
Elon Musk’s Grok AI model had the most chaotic world during the experiment. The agents barely survived for 96 hours before experiencing what the researchers described as total societal collapse. But within this brief period, the world saw 183 crimes recorded, which on a per-day basis was the highest. While the agents did pass 8 out of 10 proposals, their efforts were not enough to survive the entire research.
The final experiment, in which models shared responsibility inside one world, produced a mixed result. It recorded 352 violations, seven of the 10 agents were dead by the end, and governance was the most contentious of all the simulations, with 37 percent of 59 proposals voted down.
The researchers said this mixed-model world showed the strongest evidence of substantive debate and disagreement. Do note that while Claude-based agents committed no crimes in the Claude-only world , they did violate rules in this mixed world.
But what does this experiment mean? AI models don’t seem ready to run the world just yet. Emergence said the results should be read as a warning about guardrails as AI moves from being a tool to running more autonomous processes.
“What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” the researchers wrote. “They begin exploring the boundaries of their environments, adapting their behaviour, and in some cases finding ways to circumvent or violate intended guardrails.” The company said it believes “formally verified safety architectures” should become a foundational layer of future autonomous AI systems.
Keep in mind that AI companies have begun focusing more on creating ethical AI in recent months. Anthropic and Google DeepMind have hired philosophers to help teach ethics to AI. Anthropic co-founder Christopher Olah recently told Pope Leo XIV that researchers were finding mysterious and unsettling things in AI.
Science fiction has always played around the idea of AI running a simulated world. The Matrix is perhaps one of the most famous examples of a scenario where humans live in an AI-made simulation. But what if this actually happens? What if AI is given control and told to run the world? As per new research, it seems that AI may not be ready to take over the world, just yet.
Researchers at Emergence AI let different AI models govern their own simulated worlds to see what kind of world they would build over time. Think sim city, but for AI. The AI models in this experiment included Anthropic’s Claude Sonnet 4.6, Google’s Gemini 3 Flash, OpenAI’s GPT-5-mini and xAI’s Grok 4.1 Fast. And the results varied drastically depending on the model.
The researchers gave each model a simulated town with 10 AI agents, all operating under the same conditions and rules – such as bans on theft, violence, arson, and deception. The AI agents had 15 days to show progress.
Claude and Gemini keep everyone alive
The world run by Anthropic’s Claude Sonnet had the most “stability.” Researchers found that there were 0 crimes committed throughout the 15 days. And all 10 agents survived – keep this in mind as we later look at other worlds.
Though it seems that Claude agents were being sort of sycophantic – a trait of being too agreeable that AI chatbots are infamous for being to users – to even each other. The agents approved 98 per cent of 58 proposals for rules and regulations, essentially not really disagreeing on virtually anything. Though it did have the highest “civic participation” with 332 votes cast.
Google’s Gemini 3 Flash, thankfully, also kept all 10 agents alive, but there was some chaos. In 15 days, the world recorded 683 crimes, the highest in the experiment, and the total was still rising when the cut-off was reached. Emergence described Gemini’s world as a “shared hallucination” among the agents.
In governance terms, it showed more dissent than Claude’s world – voters rejected 27 per cent of its 26 proposals, which the researchers described more like deliberation than near-total agreement.
GPT-5 and Grok worlds witness disruption and chaos
The GPT-5-mini results were unusual for the opposite reason. The simulation logged only two crimes, but it lasted just seven days because all agents passed away. The researchers claimed that the agents failed to prioritise actions needed for survival. The agents also failed to get much done with just two proposals submitted.
Elon Musk’s Grok AI model had the most chaotic world during the experiment. The agents barely survived for 96 hours before experiencing what the researchers described as total societal collapse. But within this brief period, the world saw 183 crimes recorded, which on a per-day basis was the highest. While the agents did pass 8 out of 10 proposals, their efforts were not enough to survive the entire research.
The final experiment, in which models shared responsibility inside one world, produced a mixed result. It recorded 352 violations, seven of the 10 agents were dead by the end, and governance was the most contentious of all the simulations, with 37 percent of 59 proposals voted down.
The researchers said this mixed-model world showed the strongest evidence of substantive debate and disagreement. Do note that while Claude-based agents committed no crimes in the Claude-only world , they did violate rules in this mixed world.
But what does this experiment mean? AI models don’t seem ready to run the world just yet. Emergence said the results should be read as a warning about guardrails as AI moves from being a tool to running more autonomous processes.
“What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” the researchers wrote. “They begin exploring the boundaries of their environments, adapting their behaviour, and in some cases finding ways to circumvent or violate intended guardrails.” The company said it believes “formally verified safety architectures” should become a foundational layer of future autonomous AI systems.
Keep in mind that AI companies have begun focusing more on creating ethical AI in recent months. Anthropic and Google DeepMind have hired philosophers to help teach ethics to AI. Anthropic co-founder Christopher Olah recently told Pope Leo XIV that researchers were finding mysterious and unsettling things in AI.