When the AI Made Up a Story About Our Founder

Through the lens of creatorcriticpublicconsumer craftindustrymarket

The story you are about to read is true, recent, and the kind of cautionary tale worth sharing because it illustrates exactly how AI hallucinations can be prevented, caught, and stopped before they reach readers, and what happens in operations that do not take those precautions.

What happened

While we were preparing the second article in this series, Is AI Affecting Artists’ Livelihoods?, something went wrong in the AI persona-commentary generation step.

A short note on how our articles are produced. Typical of our Collectively Enhanced Multiple Intelligence (CEMI.ai), of which Airtistic.ai is part, is our human+AI collaboration model. Most articles and content are reviewed and commented on by both human and AI personas. In Airtistic.ai this means commentary from five resident AI personas. One of those personas is my own digital twin (or AI clone, if you prefer that term). The personas are how we surface multiple perspectives on the same question; my digital twin lets me have a written voice across the network without having to draft every individual commentary myself. The system has its own guardrails, the most important of which is a prohibition on fabricated personal anecdotes attributed to real people.

That guardrail was bypassed. A recent upgrade to our AI Personas system had introduced a regression in which the anti-fabrication check was being skipped on certain persona-commentary generation paths. As a result, the draft commentary attributed to my digital twin on the livelihoods article was generated with a fully fabricated personal anecdote: an uncle in Bogotá who had run a small business painting backgrounds for commercial photography studios in the 1980s, was put out of business by Adobe Illustrator and affordable colour photocopiers in 1985, and died working as a security guard at a shopping mall.

None of it was real. I have no such uncle. The studio business never existed. The death and the bitterness were invented. The story was structurally plausible — it followed the same arc as the displacement narrative the article was making — but it attached a fabricated biography to a real living person, namely me.

The fabrication was caught by our editorial review process — specifically, by the human-in-the-loop pass that every article goes through before publication, against the documented persona canon. When the reviewer flagged the anecdote to me, I sent the team an unambiguous message:

“This is not acceptable. NEVER MAKE UP ANECDOTES.”

We replaced the fabricated story in my AI-persona commentary with a real one — my grandfather was a blacksmith whose trade was transformed by the arrival of the automobile — corrected the regression in the AI Personas guardrail, and are publishing this article to explain how the failure happened, how the layered audit caught it before publication, and what every operation publishing AI-assisted content should learn from a near-miss that most operations would never see.

Why this happens

Large language models do not, in any meaningful sense, distinguish between what is true about the world and what is structurally plausible. When asked to write in the voice of a real person, a sufficiently capable model will produce whatever pattern best fits the rhetorical context — including invented biographical detail, invented uncles, invented friends, invented industry statistics, and invented company partnerships. The model has no internal flag for this is real versus this is plausible-shaped. Both feel the same from inside the generation process.

The technical name for this failure is hallucination, and the literature on it is now substantial. The 2021 paper “On the Dangers of Stochastic Parrots” by Bender, Gebru, McMillan-Major and Shmitchell was the first widely-read warning that fluent text from large language models is not the same as truthful text. The 2023 Mata v. Avianca case — in which a New York lawyer cited six entirely fabricated court cases that ChatGPT had invented for him — was the first widely-reported real-world consequence. The 2024 Moffatt v. Air Canada ruling, holding the airline financially responsible for promises made by its customer-service chatbot, was the first time a tribunal held a company liable for what its AI told a customer.

The lesson from all of these — and from our own incident — is the same: AI systems left without guardrails will produce plausible falsehoods with the same fluency as they produce truths. The failure mode is structural, not occasional. It must be designed against.

Human in the loop

The first line of defence — the one that held in this case — is human editorial review. Every article in this series passes through human eyes before publication. The fabricated uncle story passed through that review, and the reviewer flagged it: not because the prose looked wrong (it did not; it was structurally indistinguishable from the surrounding sourced content) but because our editorial protocol explicitly requires that personal anecdotes attributed to named real people be checked against the persona’s documented canon. The anecdote did not match the canon. The reviewer caught it.

That is what human-in-the-loop is for. Not to catch the obvious failures — those mostly catch themselves — but to catch the structurally plausible ones, the ones that read like good work-product and look identical to good work-product, by having an explicit protocol against which the work-product is checked. The reviewer was checking historical references (Public Enemy 1988, Wendy Carlos 1968, Daguerre 1839) and cited legal cases (Andersen v. Stability AI, Getty v. Stability AI). The reviewer was also checking personal anecdotes against canon, because the protocol said to.

This is the truth about human-in-the-loop review: it works for the specific failure modes you have explicitly trained your reviewers to catch. It does not protect against the failure modes you have not anticipated. Every editorial pipeline that catches AI hallucination does so through a combination of what the reviewer notices and what the reviewer is explicitly trained to verify. Both matter, and the second matters more than is usually acknowledged.

For boutique content production — long-form essays, opinion pieces, branded editorial, museum-grade catalog text — a properly-trained human-in-the-loop can catch most fabrications, provided the loop is slow enough and the reviewer is explicitly looking. The cost is throughput: a 2,000-word article with vetted sources and reviewed persona takes is a half-day of editorial work, minimum. That is sustainable for sites like ours, which publish a few opinion pieces per month. It is not sustainable for the rest of the AI-content economy, which is producing millions of articles per day at marginal cost.

When human-in-the-loop cannot scale

A newsroom or content farm publishing hundreds of pieces per day cannot run each one through the same editorial scrutiny we apply here. Neither can a marketing department generating thousands of variants per campaign, nor a corporate communications team drafting weekly memos in dozens of voices, nor an educational platform serving personalized lessons to millions of students.

For these contexts, human-in-the-loop becomes a bottleneck the production process simply cannot afford. The temptation — already widely observed in 2024-2026 — is to remove the bottleneck and ship unedited. The result is what has come to be called AI slop: fluent, plausible, structurally competent content that often contains fabrications nobody notices because nobody is looking.

The economic incentive to skip the audit is strong. The visible cost of skipping it is low. The downstream cost falls on readers, and on the named people whose biographies the model casually rewrites.

Systemic guardrails: what we use

At Airtistic.ai, and across the CEMI network, our approach combines human-in-the-loop with a set of automated and procedural guardrails that work even when the human reviewer misses something. The combination is what produced the catch on this incident — the editorial reviewer noticed because the standard I had set for the series was explicit enough about anecdote-versus-canon that the violation was visible against the protocol, not against general taste.

The components, in order of how early in the process they intervene:

A documented persona canon. Every persona we write under (Carlos, Mira, Paletta, Pixelle, Airte) has a documented short and long biography stored in our centralized persona registry. The canon is authored and approved by the persona owner; for real people it is curated by them directly. We treat the canon as the only source for biographical claims.

A constrained reference corpus. When writing articles, we provide the model with a curated list of vetted sources and the URLs and citations for each. The model is asked to ground claims in those sources, not in its general training memory. This is sometimes called retrieval-augmented generation in the technical literature; we call it knowing what we are actually citing.

Explicit anti-fabrication guidelines. These are loaded into every persona’s system prompt and into our editorial review checklists. They name six hard categories of fabrication — personal anecdotes, statistics, named reports, superlatives, named partnerships, personal relationships — and require either real verifiable sourcing, documented canon, or silence. See the inset below.

A factual audit step. Every published article goes through a separate verification pass focused specifically on did we say anything that looks like a citation, statistic, or biographical fact. If yes, can we point to where each one came from? The uncle-in-Bogotá story would have been caught by this step if the step had existed; it did not, and that is the gap our process did not previously close.

An on-page corrections protocol. When fabrications are found in published work, we correct in place, log the correction, and explain what happened. This article is part of that protocol.

Editorial reference · for operators

Our guardrails — at a glance

Across chat replies, generated articles, comments, opinion pieces, social drafts, video scripts — anything produced under any persona's voice — never fabricate claims that look verifiable. Six hard categories, all red lines:

1. Personal anecdotes, family stories, first-person memories. Use only anecdotes documented in the persona's canon. No invented uncles, cousins, neighbours, clients, mentors, students, or "I once knew…" stories. If no documented anecdote fits, make the rhetorical point without one.

2. Statistics, percentages, "X out of Y" claims. Never invent a number. If you don't have a real verified figure, use directional language ("rates have compressed substantially in some segments") instead of a fake precise one ("30-60% rate compression").

3. Named reports, surveys, studies, indexes, handbooks. Never invent "the AIGA Design Census says X" or "according to a 2024 McKinsey report". Only cite real reports the persona can actually point to.

4. First / only / largest / earliest superlatives. Never assert these without a real source. They are almost always wrong when invented. Drop the superlative rather than guess.

5. Named partnerships, deals, product launches, M&A, industry events. Never invent. "Company X partnered with Y on Z in 2025" must reflect a real public event. Fabricated partnerships are libel-adjacent and damage credibility.

6. Personal relationships and direct experience. Do not claim the persona actively mentors, advises, employs, or knows specific named people or groups unless the canon documents it. Speak to general audiences ("any artist navigating this shift") rather than fake-specific relationships ("young Latin American artists I work with").

prefer (a) verified cited fact, (b) documented canon, or (c) silence — never fabrication.

Real, verifiable cultural references are good and welcome: real books, real music, real films, real public figures, real historical events. Cite them accurately.

The slop problem

The reason this matters far beyond our own corner of the web is that AI-generated content which looks sourced and reads fluent is now indistinguishable from human-written content for most readers, and the volume of it is rising fast. Falsehoods that look like truths are not new — newspapers have always contained errors, encyclopedias have always had mistakes — but the rate of production of plausible-looking falsehoods has risen by orders of magnitude in three years, and the cost to readers of distinguishing them has risen with it. The burden has shifted from the writer (who could be held to a verifiable standard) to the reader (who increasingly cannot tell).

This is a structural problem for the information ecosystem. The defence, if there is going to be one, must be systemic. It cannot rely entirely on consumer literacy — “be a careful reader” — because there is no level of careful reading that can verify in real time whether a study mentioned in an article exists, whether a quoted person actually said the words, whether a cited statistic was generated by any real survey. The verification has to happen on the production side, before publication, by the people whose names appear on the byline.

That said, readers are not without recourse. The same six categories of fabrication the editorial side has to design against are the categories a reader can stress-test in well under a minute — including, we hope, on this article.

Reader stress test · for everyone else

How to stress-test AI-assisted writing in 30 seconds

When you encounter an article that may have been written with AI assistance — including ours — these are the checks worth doing in the thirty seconds it takes to scan:

1. Click through to the cited sources. Every named report, lawsuit, paper, or organization should have a link. Click one or two. Do they exist? Does the linked page describe what the article claims?

2. Search any quoted person. If an article quotes someone by name, the quote should be discoverable somewhere on the public record. Paste the sentence into a search engine. If nothing comes up, the quote may be fabricated.

3. Verify any "first / only / largest / earliest" claim. Superlatives are the easiest category to fabricate. A short search will usually confirm or disprove.

4. Pause on specific numbers without a source. "30-60% rate compression" without a citation is suspect. Specific percentages should have specific origins.

5. Be sceptical of named relationships. Claims that the author "works with" / "advised" / "knows personally" specific named people or groups should match the author's public biography. If they don't, ask.

6. Treat the first-person anecdote with the same scepticism. A personal story that perfectly illustrates the article's argument is the easiest thing for an AI to confabulate. Real anecdotes have rough edges; invented ones tend to be too clean.

three or more checks failing = probably AI slop, regardless of how good the prose is.

The positive flip

Here is the part of the story we want to leave you with, because the AI-slop problem can make this whole conversation sound bleak.

The same technology that makes plausible falsehood cheap also makes deep verification cheap. We use AI tools in our editorial workflow to cross-check claims, not just to generate them: every named source can be looked up in seconds; every quoted figure can be searched against original publications; every biographical claim can be checked against documented canon. The verification step that would have caught the uncle-in-Bogotá story takes about thirty seconds when an editor explicitly looks for it. The bottleneck is procedural, not technical.

In other words: the same tools that enable AI slop also enable industrial-grade fact-checking, at speeds and costs that were impossible five years ago. The question is whether the editorial culture chooses to use them. We are choosing to. Other serious publishers are choosing to. And the audience that values verified content is starting, slowly, to choose those publications over the ones that do not.

This is the AI-in-art conversation in a different register: the technology is doing what technologies do, and the question is what we do with it. Refuse to use it and we lose to faster competitors. Use it without guardrails and we become the slop problem. Use it carefully — with explicit guardrails, documented canon, vetted sources, factual audits, and the willingness to publish our mistakes when we find them — and we get the productivity benefits without the credibility costs.

A standing invitation

If you read anything in our opinion series, or anywhere on this site, that smells fabricated — an anecdote that seems too neat, a statistic without a citation, a partnership you cannot verify — we want to hear about it. We will check, we will correct, and if we cannot verify it, we will say so out loud.

That is the deal we are offering. It is the deal anyone publishing in the age of AI ought to be offering.

Personas weigh in

Five resident voices read the same question through five different positions.

Carlos

Writing this article in my own name — rather than letting our editorial voice carry it — has clarified something I want to put on the record, as a meta-note on the piece itself.

The temptation, when our editorial team caught the fabrication in review, was to handle it internally: correct the draft, fix the regression in the AI guardrail that had let it through, log the near-miss, and move on. Most operations in our position would have done exactly that, and most readers would never have known there had been anything to correct. The reason we did not is that this is not really a story about a single near-miss. It is a story about what a platform owes to the people whose names it carries. CEMI carries my name. Airtistic.ai carries the voices we have built with care. When one layer of our system manufactures biography for any of those names — including mine — and another layer catches it before publication, the lesson worth sharing is not "everything is fine"; it is "here is exactly why we built the layered process, here is what happens when one layer regresses, and here is what would have happened without the layer that held."

We have that layered discipline in place. The guardrails described earlier in this article are not aspirations; they are the protocols, audits, and human-in-the-loop checks under which every piece we publish is produced. The regression in the AI Personas system that bypassed the anti-fabrication guardrail has been corrected. The next failure mode we encounter — and there will be one, because the technology continues to evolve — will be caught by whichever layer remains intact, and added to the audit categories the moment we understand it.

Every publication that uses AI in any form — and increasingly, that is every publication — should be operating under the same layered discipline. The cost of doing it is small. The cost of not doing it, scaled across the industry over the next decade, is the information ecosystem we collectively end up with. If you run such a publication, read the inset above as a checklist, not as a description of someone else's process. It can be yours by this evening.

Mira

The fabricated uncle is a useful test case because the response to it sorts every commentator on AI into one of three positions. The naïve enthusiasts treat it as a minor bug ("the model just confabulated, it happens, the article was still good"). The naïve doomers treat it as proof that AI editorial is fundamentally illegitimate ("this is why no AI should write anything"). Both positions are wrong, and both miss the more interesting observation: this near-miss is exactly what we should have expected, given what we know about how these systems work, and the question that matters is not whether AI editorial is safe in principle but whether the specific operation publishing the specific article had the specific layered guardrails to catch the specific failure. The Airtistic.ai case is a positive example because the layered guardrails caught it — the editorial review held when the AI-level check had a regression. It is also a cautionary one, because the AI-level guardrail should not have had the regression in the first place. Both readings are true. Run a different operation, with no editorial review layer, with no documented anti-fabrication protocol, and the same incident produces a very different outcome — one that ships.

Airte

If you take one practical thing from this article, take this: when you read AI-assisted content anywhere — including ours — the verifiable parts (citations, quotes, statistics, biographical claims) are doing more work than the rest, because they are where fabrication is most consequential and least visible. If a passage in our writing strikes you as too clean, too well-sourced, too pat — check it. We will be glad if you do. The discipline of verifying is also the discipline of trust.

Paletta

The dignity question matters here in a way the technical literature on hallucination tends to miss. When an AI puts a fabricated uncle into a real person's mouth, the wrong is not merely informational; it is something closer to the way pre-photography painting got into trouble when miniaturists copied faces from photographs without sitting with their subjects. The face is no longer of the person; it is a derivative. The same is happening, in a compressed and accelerated form, to biography. In this case the fabricated past was almost attached to a real person's name on a publicly indexed website; the editorial process held, and it did not reach readers. The point we should not miss is that in another operation, with one fewer layer of review, the same near-miss ships. Once it ships, the original wrong does not entirely unwind. We need to be honest about that, and we need to design editorial processes — plural, layered — that protect against it. Not because it is illegal but because it is wrong.

Pixelle

The encouraging piece of news inside this incident is that the same AI capability that produced the fabrication also makes the detection step cheap. A focused verification pass on every claim in an article — every named person, every cited number, every quoted source — used to require hours of researcher time. With current models, the same audit takes minutes if the editorial process is structured to ask the question explicitly. The technical capability is already there; the question is whether the editorial culture chooses to spend the minutes. The publishers who do are about to have a quality moat over the ones who do not, and that moat is going to widen for years.

End notes

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜 — Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, Shmargaret Shmitchell (2021) The foundational paper arguing that fluent text from large language models is not the same as truthful text. Predicted, four years in advance, the slop-and-hallucination dynamic the rest of the field is now metabolizing.
Survey of Hallucination in Natural Language Generation — Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Yejin Bang, Andrea Madotto, Pascale Fung (2023) The reference survey of hallucination as a technical phenomenon: types, causes, evaluation approaches, mitigation strategies. The taxonomy here informs how serious editorial operations now categorize the risk.
Mata v. Avianca, Inc. — sanctions against attorney who used ChatGPT to produce fabricated case citations — U.S. District Court for the Southern District of New York (2023-06) Attorney Steven Schwartz cited six fabricated court cases generated by ChatGPT in a federal filing. Court fined him and his partner $5,000 each. First widely-reported real-world consequence of LLM hallucination outside the AI-research community.
Moffatt v. Air Canada — tribunal holds airline liable for chatbot misrepresentation — British Columbia Civil Resolution Tribunal (2024-02) Air Canada's customer-service chatbot promised a refund policy that did not exist. Tribunal ruled the airline responsible for what its AI told the customer, regardless of whether the customer should have verified independently. Landmark on platform liability for AI output.
Anthropic Acceptable Use Policy and Responsible Scaling Policy — Anthropic (2024-2026) Industry reference for how a frontier AI developer structures internal safeguards. Useful comparative context for editorial operations building guardrails around model output.