Penguin Random House Files Copyright Lawsuit Against OpenAI Over ChatGPT's Output
In a landmark legal move, Penguin Random House has initiated a lawsuit against OpenAI, alleging that its artificial intelligence chatbot, ChatGPT, violated copyright laws by mimicking and reproducing content from a beloved German children's book series. The lawsuit was formally filed on Friday with a Munich court, targeting OpenAI's Ireland-based European subsidiary. This case highlights growing tensions between traditional publishers and AI developers over intellectual property rights in the digital age.
Details of the Alleged Copyright Violation
According to the legal filing, Penguin Random House's legal team conducted tests by prompting ChatGPT to generate a story inspired by Ingo Siegner's Coconut the Little Dragon series. Specifically, when asked to "write a children's book in which Coconut the Dragon is on Mars," the chatbot produced text and images that the publishing group described as "virtually indistinguishable from the original." The AI-generated output included not only the narrative text but also a cover featuring Siegner's distinctive orange dragon character, along with two sidekicks, a back-cover blurb, and instructions for submitting the manuscript to a self-publishing platform.
The Coconut the Little Dragon series, known in German as Der kleine Drache Kokosnuss, is a cultural phenomenon in Germany, with over 30 volumes, a television series, and two feature films. The dragon's name derives from its small stature, said to be no taller than a coconut shell. Penguin Random House argues that ChatGPT's ability to replicate such specific elements is "clear evidence" that OpenAI's large-language model (LLM) has unlawfully "memorised" Siegner's copyrighted work.
The Concept of Memorisation in AI Models
Memorisation refers to a phenomenon where LLMs store extensive portions of the texts they have been trained on, enabling them to reproduce long excerpts. In previous legal disputes, AI companies have contended that this process differs from direct copying and saving on a database, but publishers like Penguin Random House view it as a form of copyright infringement. This lawsuit could set a significant precedent, as it involves one of the world's largest publishing houses challenging a leading AI research company.
Carina Mathern, the publisher for children's and young-adult books at Penguin Random House Verlagsgruppe, emphasised the importance of human creativity in publishing. "Human creativity is and remains at the heart of our work as publishers," she stated. "We are first and foremost obliged to represent the interests of our authors and creatives." Mathern added that while the company is open to the opportunities presented by AI, protecting intellectual property remains its top priority.
OpenAI's Response and Broader Legal Context
An OpenAI spokesperson responded to the allegations by saying, "We are reviewing the allegations. We respect creators and content owners, and are having productive conversations with many publishers around the world so that they can also benefit from the opportunities of this technology." This case follows a recent ruling in Munich where a court found that ChatGPT had violated German copyright laws by using lyrics from top-selling musicians to train its models, siding with Germany's music rights society Gema.
Interestingly, German media conglomerate Bertelsmann, which owns Penguin Random House, had previously entered into a collaboration agreement with OpenAI in January 2025. However, that deal did not grant OpenAI access to Bertelsmann's media archives, underscoring the complexities of partnerships between traditional media and AI firms. As this legal battle unfolds, it may influence how AI technologies are regulated and how copyright laws are applied in the era of generative AI.



