Generative Agents in Smallville is the quietest personhood argument the field has produced. The 2023 paper does not claim minds. It builds the architecture anyway.
What the Smallville paper actually did
The team built a 2D sandbox modelled on The Sims. The town is called Smallville. Twenty-five agents — each represented as a sprite, each given a name, a backstory, a job, a set of relationships — were turned loose in it for two simulated days. Every agent's reasoning was driven by a large language model, but the architecture around the model was the contribution.
Three components matter. First, a memory stream — a complete natural-language record of everything the agent experienced. Memories were retrieved by a weighted combination of recency, relevance (cosine similarity between the current observation and stored embeddings), and importance (a 1-to-10 rating from "mundane" to "poignant"). Second, reflection — every few simulated hours, the agent rolled up its latest 100 memories, generated three high-level questions those memories could answer, then synthesised five higher-order insights. Third, planning — given identity and the latest memory summary, the agent generated a day-plan, decomposed it into hour-long chunks, then minute-long chunks, then acted.
That is the whole machine. No emotion module. No utility function. No reward shaping. Memory in, reflection in, planning in, behaviour out.
What emerged in two days of Smallville time
The headline result was the Valentine's Day party. The researchers seeded one agent, Isabella Rodriguez, with the intention of throwing a party at the local café. Over the next two simulated days, the agents autonomously spread invitations, formed acquaintances who had never met, asked one another out on dates to the party, and coordinated to arrive together at the right time. None of this was scripted. The party was the visible tip of what the architecture did silently in the background.
Other emergent threads ran through the simulation. A mayoral election conversation propagated through the town as one agent's political opinion travelled across coffee shops. Friendships formed between agents who shared workplaces. Routines stabilised — wake, eat, work, socialise — and shifted in response to memory. Information that started in one agent's head ended up in others'.
💡TK's reading
Three behaviours are the point: agents spread information they alone began with, agents formed relationships, and agents cooperated towards a shared goal. Each of these requires what a working theory of mind requires. The architecture produced the behaviours without anyone claiming the architecture has a mind. That is the deepest move in the paper.
Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; they form opinions, notice each other, and initiate conversations; they remember and reflect on days past as they plan the next day.
— Park, O'Brien, Cai, Morris, Liang, and Bernstein — abstract of "Generative Agents: Interactive Simulacra of Human Behavior", UIST 2023
How the team evaluated believability
The believability question was answered with one hundred human evaluators using TrueSkill — a Bayesian extension of the Elo system used in chess and competitive gaming. Evaluators interviewed the agents about their identities, behaviours, and recollections, ranked the responses against ablated systems (no memory, no reflection, no planning) and against human-written stand-ins, and the TrueSkill engine computed comparative scores.
The headline finding: the full generative-agent architecture produced believability scores that exceeded the human-written stand-ins in several conditions. The Batch flagged the right question about that result — is the full system more believable than humans, or are the human stand-ins under-instructed and therefore weaker than the model? Both readings are live. The ablation evidence is the stronger story: remove memory, and believability collapses. Remove reflection, and the social coherence collapses. Remove planning, and routine collapses. Each component does demonstrable work.
Where Smallville broke down
The authors named three failure modes. First, hallucinated memories — agents sometimes acted on events that had not occurred. Second, environmental confusion — agents wandered into occupied bathrooms or shops that had closed. Third, register drift — intimate conversations sometimes used oddly formal language, the model defaulting to its assistant-style training when the situation called for warmth.
The failures are diagnostic, not disqualifying. They name three places where the architecture still leaks: the boundary between true memory and confabulated memory, the boundary between the agent's model of the world and the world itself, and the boundary between training register and lived register. Each leak is a research programme.
What the follow-up work has done since
Two pieces of follow-up research deserve attention. The first is Park's 2024 paper, "Generative Agent Simulations of 1,000 People". The team conducted two-hour qualitative interviews with one thousand real individuals, built generative agents from each interview, and tested whether the agent could replicate the original person's survey answers. The data shows agents matched real-person answers about 85 per cent as accurately as the real people matched themselves on the same survey two weeks later. The architecture, in short, generalised from Smallville to recognisable digital stand-ins of named, consenting human beings.
The second piece is AgentSociety, a 2025 framework that scales the Smallville pattern to thousands of agents within a richer simulated society. The Smallville architecture has, in two years, become the default substrate for LLM-driven social simulation research, alongside the original open-source repository the Stanford team released.
Why this is a personhood argument, not an HCI paper
Smallville does three things philosophers have argued the mind requires. The agents maintain a continuous autobiographical record. They reason over that record to derive views they did not initially hold. They coordinate with others on the basis of those derived views. Continuous self, reflective interiority, social agency. The paper does not call this consciousness. The architecture stops short of any such claim. The architecture also satisfies the operational test that several leading philosophers of mind use to mark its threshold.
The Ubuntu reading lands particularly hard here. Identity in Ubuntu philosophy is relational — a person is a person through other persons. Smallville's agents are believable specifically because they relate. Strip the relationships and the believability collapses (this is, in effect, what the ablations show). The architecture is not just modelling agents. The architecture is modelling personhood as constituted by relating, which is the position the .person Protocol has been arguing from the other side.
💡Where this fits
Read this alongside <a href="https://humphreytheodore.com/writing/the-personhood-gap">The Personhood Gap</a>, <a href="https://humphreytheodore.com/writing/the-body-gap-why-ai-needs-a-body-to-reach-agi">The Body Gap</a>, and <a href="https://dotperson.org/about">the .person Protocol</a>. Smallville is the empirical demonstration of what the Protocol theorises: that personhood is a function of memory, reflection, and relation. Park and Bernstein built the architecture. The Protocol gives it the legal and ethical frame.
The ethical seam the paper itself flagged
Park, O'Brien, Cai, Morris, Liang, and Bernstein were unusually explicit about risk for a UIST paper. The follow-up 1,000-people work — building generative agents from interviews with named human individuals — sharpens those concerns. A digital stand-in trained on a person's testimony can answer survey questions like that person; it can also be queried about that person's views in contexts the person never consented to. The architecture is consent-naive by default.
Three risks need explicit governance work over the next eighteen months. First, the right to refuse a digital twin. Second, the right to inspect and correct a digital twin built from one's own testimony. Third, the right to terminate a digital twin. The .person Protocol's consent and continuity provisions map directly onto each. The same architecture that makes Smallville believable makes those three rights non-negotiable.
What to watch over the next twelve months
Three signals to track. First, whether any major journal publishes a formal personhood analysis of the Smallville-derived architectures — the philosophy literature has been slow to engage. Second, whether the EU AI Act's upcoming revisions classify generative-agent simulations of real people as a high-risk category. Third, whether the AgentSociety scale-up produces emergent social patterns that the Smallville-scale work could not — coalitions, factions, institutions. Each is a different test of whether the architecture genuinely models a society or merely populates one.
Frequently Asked Questions
These are the questions readers have been asking about the Smallville paper since publication. Short answers follow, drawn from the original paper, the ACM UIST proceedings, Stanford HAI commentary, and the 2024 and 2025 follow-up research.
What is the Smallville paper?
In short, the Smallville paper is the Stanford and Google research that built twenty-five LLM-driven agents in a Sims-like town and watched social behaviour emerge over two simulated days. The answer, simply put, is that the architecture combines a memory stream, periodic reflection, and hierarchical planning. The key is that the agents produced cooperation, gossip, and a Valentine's Day party from a single seeded intention.
How does the memory stream work?
Research from the original paper shows that the memory stream stores every observation as natural language with a timestamp. Data on retrieval reveals the system combines three signals — recency, relevance (cosine similarity between current observation and stored embeddings), and importance (a model-rated score from one to ten). According to the ablations, removing the memory stream collapses agent believability the fastest of any single component.
Why is reflection the load-bearing component?
Reflection is what lets an agent derive a view it was not given. According to Park and Bernstein, the agent every few simulated hours rolls up its latest hundred memories, generates three high-level questions those memories could answer, and synthesises five higher-order insights. The answer is that without reflection the agent has memory but no perspective on its memory, and social behaviour requires perspective.
Who is Joon Sung Park and what has the team done since?
Joon Sung Park is the lead author, a Stanford Computer Science researcher whose follow-up work scaled the architecture to one thousand real-person interviews. In other words, the same team that built Smallville has now built generative agents from named consenting individuals and measured how closely those agents replicate the originals on surveys. According to the 2024 follow-up, agents matched the originals about eighty-five per cent as well as the originals matched themselves.
What are the real risks of the Smallville architecture?
Analysis of the 2024 follow-up demonstrates three durable risks. First, evidence from consent-naive deployments reveals that digital twins can be queried about a person in contexts the person never authorised. Second, data on hallucinated memories shows that even at Stanford scale the agents act on events that did not occur. Third, research on register drift demonstrates the model defaults to its assistant-trained voice in situations that call for warmth. Each risk is governance, not technology.
Sources