The accusation is everywhere: AI art is plagiarism, the models are theft engines, and anyone who uses them is benefiting from stolen labor. The accusation is too broad to be true and too pointed to be ignored. Untangling it requires distinguishing two questions that the public conversation has been blurring together for three years.
The first question is training-stage: was the data the model learned from acquired legally and ethically? The second question is output-stage: is a particular generated image substantially similar to a specific copyrighted work, or substituting for an identifiable artist’s labor? These are different claims under different legal frameworks with different remedies, and the failure to separate them is the single biggest reason the public conversation has stayed stuck.
What plagiarism has always been
Plagiarism is not a claim about style. It is a claim about provenance. Saying this work was made by X and presented as if made by Y is plagiarism. Saying this work is in the style of X is influence. Every artist in history has worked in the style of someone — the entire concept of a school, a movement, a tradition, depends on it being acceptable. Caravaggio’s followers were not plagiarizing Caravaggio. Frida Kahlo’s contemporary admirers in Mexico City making votive-painting-inspired work were not plagiarizing Frida Kahlo. The Cubists did not plagiarize Cézanne. A cover band playing “in the style of” the Beatles is not plagiarizing the Beatles unless they record a specific Beatles song and pass it off as their own.
This is the first piece of vocabulary we need to recover. Style is not the property of any artist; the marks of style accumulate in a culture and become available to anyone. A particular work is the property of its maker, and reproducing it or its meaningful substance without permission is the harm copyright law has always tried to address.
Generative AI complicates this distinction in two specific ways, and we have to be honest about each.
The training-stage question
A diffusion model is trained on a corpus that includes, almost without exception, copyrighted images that were never licensed for training use. The LAION-5B dataset, which underlies Stable Diffusion and many of its derivatives, contains roughly five billion image-text pairs scraped from the public web. Getty Images has alleged that approximately twelve million of those images are theirs. Multiple working illustrators have identified their own work in the dataset by searching it directly through Have I Been Trained and similar tools.
Is this plagiarism? Almost certainly not, in the strict legal sense — no individual image is being reproduced in any output, and the historical analogy is closer to learning from than to copying. Is it copyright infringement of a different and arguably more serious kind? The Andersen v. Stability AI class action and the Getty v. Stability AI case in the U.K. and Delaware are the first attempts by U.S. and U.K. courts to answer that. The Andersen amended complaint survived a motion to dismiss in August 2024, the first time a U.S. court permitted a training-data claim against generative AI to proceed to discovery. The Getty U.K. trial proceeded in June 2025. As of this writing, neither has produced a final ruling, but both are far enough along to suggest the law is moving toward recognizing some form of injury at the training stage, even if it is not strictly plagiarism.
What this injury is, exactly, is the question lawyers and economists are now trying to answer. The most defensible framing is: a class of working artists whose collective labor was scraped without consent, used to train commercial models that now compete with them, with no compensation flowing back to the source artists. The injury is structural and aggregate rather than individual and particular. It is closer to what labour law tries to remedy than to what copyright law tries to remedy. Some recent commentary calls it data appropriation rather than plagiarism, and that vocabulary is probably going to win, because it names what is actually happening.
The output-stage question
The output-stage question is sharper and easier to reason about. Is this particular generated image plagiarism? The honest answer depends on three sub-questions.
First: was the model prompted with the name of a specific, living working artist whose style was being deliberately mimicked? If yes, this is the configuration closest to traditional plagiarism. The Greg Rutkowski case in 2022 is the canonical example. Rutkowski is a Polish digital painter; his name was used in approximately 93,000 Midjourney prompts in the first month of public availability. He did not consent. He was not compensated. Commercial outputs sold under that prompt were directly substitutable for the kind of work he could have been commissioned to do. The Sarah Andersen case is sharper still: her distinctive comic linework was deliberately reproduced by user fine-tunes for outputs that her own audience could not distinguish from her originals at a thumbnail. This is plagiarism by any honest reading of the word, even if the existing legal framework has not yet caught up to it.
Second: is the output substantially similar to a specific identifiable copyrighted work? If yes — that is straightforward copyright infringement under existing law, no different in principle from a human artist closely tracing a copyrighted image. The model is the tool; the user is the infringer. Generative AI has not changed this analysis.
Third: is the output a generic image — a fantasy landscape, a portrait of a person who does not exist, an architectural visualization — that does not closely resemble any specific work and was not prompted with any named living artist? If yes, this is not plagiarism in any meaningful sense, and treating it as such collapses the precision we need to act on the cases that are plagiarism.
The blurred case: style mimicry
The hardest cases sit between the second and third sub-question above: outputs that clearly imitate a style associated with a named artist or a named studio, without literally reproducing any specific work. In the style of Studio Ghibli. In the style of Greg Rutkowski. In the style of Yayoi Kusama. The Studio Ghibli style trend that swept Twitter and TikTok in 2025 — millions of users generating “Ghibli-fied” versions of personal photos — surfaced this question at unprecedented scale. Hayao Miyazaki, who in 2016 had called AI animation “an insult to life itself,” is the world’s most famous unwilling participant in the style-mimicry economy.
Is this plagiarism? The traditional doctrine says style is not protectable, so no. The intuitive reading from inside the affected studios says yes, this is structurally identical to the De La Soul sampling case in 1989: a technology that makes a previously-impossible appropriation cheap, performed without permission, in volumes that materially affect the source. The legal framework will have to choose, and my prediction — informed by the trajectory of the sampling cases — is that style-mimicry-by-name for commercial purposes will, within a decade, become a licensable transaction. Either through a Getty-Stability style negotiated settlement, through statutory revision, or through court rulings that extend the existing right-of-publicity doctrine to commercial style appropriation. The Concord Music v. Anthropic settlement framework from 2024 may be the early template.
What the law actually says, today
In 2026, the operative state of U.S. law is roughly this:
AI-only output is not copyrightable. The Thaler v. Perlmutter ruling (2023, D.C. district court) affirmed the U.S. Copyright Office’s refusal to register a work generated entirely by AI with no human authorship. The Copyright Office’s Part 2 guidance, issued in January 2025, extended this: AI-generated output is registrable only insofar as there is meaningful human authorship — which can include selection, arrangement, modification, or substantial creative control over the generation process, but cannot consist solely of prompt-writing.
Training-data infringement claims are alive and progressing. Andersen v. Stability AI survived a 2024 motion to dismiss; Getty v. Stability AI tried in the U.K. in 2025; NYT v. OpenAI is proceeding through discovery as of this writing. None has produced a definitive ruling on the underlying question of whether scraping copyrighted material for training is fair use.
Output-level infringement remains analysed under existing copyright doctrine. An output that is substantially similar to a specific copyrighted work is infringing, full stop, regardless of the tool. The interesting question is who is liable: the user who prompted it, the platform that produced it, or both. Courts have been moving toward holding both responsible.
Style is still not protectable in the United States. This is the largest gap and the largest target for legal evolution.
Stakeholders
The artist whose work is in the training corpus sees a structural injury that the law has not yet metabolized. The artist whose name is being prompted commercially sees a sharper injury that is closer to traditional plagiarism. The prompt-writer producing generic non-named output is not in the same legal or ethical category as either. The platform sees a business model built on legal ambiguity it benefits from preserving. The buyer sees an output that may or may not have a clean lineage and often has no way of knowing. The critic and the gallery see the same uncertainty, magnified by the responsibility of representing it to the market.
The dignity question — whose work made this possible, and what did they consent to — is the question that connects all of the stakeholders’ concerns and that the existing legal framework was not designed to ask.
What honest practitioners do
In 2026, the practitioners who have settled on a defensible practice tend to share three habits.
They do not prompt with the names of living working artists for commercial output. This is the cleanest avoidable harm and avoiding it is not difficult.
They are honest about the role of AI in their work. They label AI-generated output as such; they do not pass off augmented work as unaided; they describe their workflow when asked. This protects them from claims of misrepresentation and protects buyers from purchasing under false provenance.
They prefer models with clearer training-data provenance when one is available. Adobe Firefly, trained on Adobe Stock and public-domain material with documented contributor compensation, is not equivalent to a model trained on uncredited scraped material — and increasingly, professional commissioning is requiring the difference.
These three practices do not solve the underlying structural questions about training-stage injury. They do narrow the practitioner-level exposure to plagiarism claims down to a manageable, well-defined surface.
Closing
So: is AI art plagiarism by default? No. That answer is precise, and the precision matters. AI art is plagiarism in a specific class of configurations — the Rutkowski-Andersen class — and the rest of the question requires the careful taxonomy this article has tried to provide. The blanket accusation collapses cases that are clearly defensible together with cases that are clearly indefensible, and the consequence is a discourse so noisy that working artists cannot defend themselves precisely where they have the strongest case.
The Andersen amended complaint will advance. The Getty trial will produce a ruling. The Copyright Office guidance will be refined. The legal framework will mature, as it has in every prior cycle, over a period of years rather than months. In the meantime, the question we asked at the top of this article admits of a more precise answer than the discourse has been giving: AI-generated work is not plagiarism by default; a clearly-namable subset of it is, and naming that subset precisely is the most useful thing the rest of us can do while the law catches up.
The next article in this series asks the harder, more uncomfortable question that has been the third rail of this whole debate: should we be offended by art created by AI? That is not a legal question, not an economic question, and not, technically, a plagiarism question. It is a question about what art is for and who it is supposed to honour. We will get to it next.
Comments
Sign in to comment