Skip to main content
9 – 17 UHR +49 8031 3508270 LUITPOLDSTR. 9, 83022 ROSENHEIM
DE / EN

Copyright Dispute Forces OpenAI to Disclose 20 Million User Chats: A Wake-Up Call for Companies

Tobias Jonas Tobias Jonas | | 3 min read

A recent court ruling is causing a stir in the tech world and highlights the risks associated with using publicly accessible AI models. OpenAI, the company behind ChatGPT, was ordered by a US court to disclose 20 million anonymized chat logs. This step is part of a far-reaching copyright dispute brought by the “New York Times” and other major media outlets. For companies relying on AI solutions, this case is an unmistakable warning sign.

The Core of the Conflict: Copyright in the Age of AI

The key claim of the lawsuit is explosive: The “New York Times” and other media outlets accuse OpenAI of using their copyrighted articles without permission to train ChatGPT. Moreover, the chatbot is allegedly capable of partially or even nearly verbatim reproducing content from these articles. To substantiate these claims, the plaintiffs demanded access to user chats. They want to prove how often and to what extent ChatGPT relies on their protected content. At the same time, this is intended to refute OpenAI’s argument that the plaintiffs deliberately manipulated (“hacked”) the chatbot to provoke copyright-infringing outputs.

Data Privacy Concerns vs. Transparency Demands

OpenAI vehemently opposed the disclosure of the logs, primarily citing the protection of user data and trust in their platform. The company argued that even anonymized data could compromise user privacy and that 99.99% of conversations were irrelevant to the case anyway. Additionally, OpenAI takes the position that training AI models with publicly available data falls under the American “Fair Use” doctrine.

However, New York Judge Ona Wang did not follow this argument. She ruled that the chat logs are relevant to clarifying the case and dismissed the data privacy concerns. The anonymization – that is, the removal of names, email addresses, and other personal data – is a sufficient measure to protect user privacy. OpenAI was ordered to submit the data within seven days of completing the anonymization.

The Consequences: A Precedent with Far-Reaching Implications

The court’s decision is more than just a milestone victory for the media outlets. It sets a precedent that affects the entire AI industry. Frank Pine, editor-in-chief of MediaNews Group, sharply criticized OpenAI’s stance and accused the leadership of “hallucinating” if they believe they can withhold evidence for a business model based on the “theft of the work of hard-working journalists”.

The case illustrates the immense legal gray area in which companies find themselves when relying on public AI tools like ChatGPT. The disclosure of chat logs, even in anonymized form, is a nightmare for any company. It raises fundamental questions about data security, protection of trade secrets, and compliance with data protection regulations like GDPR.

Conclusion: The Safe Harbor for Enterprise AI

The current developments around OpenAI unmistakably show: Using uncontrollable, public AI models in a corporate context carries significant and incalculable risks. Data leaks, copyright infringements, and compliance problems are real dangers that go far beyond theoretical discussions.

For companies, it is therefore essential to rely on a secure and data-protection-compliant AI solution. This is exactly where our CompanyGPT comes in. As a closed system tailored to your needs, it ensures that your sensitive data never leaves your company. It gives you full control and transparency over data processing and protects you from the legal pitfalls that the current case so impressively demonstrates.

We are happy to help you securely introduce a solution like CompanyGPT in your company in compliance with GDPR and the AI Act and leverage the potential of AI without compromising security.

Tobias Jonas
Written by

Tobias Jonas

Co-CEO, M.Sc.

Tobias Jonas, M.Sc. ist Mitgründer und Co-CEO der innFactory AI Consulting GmbH. Er ist ein führender Innovator im Bereich Künstliche Intelligenz und Cloud Computing. Als Co-Founder der innFactory GmbH hat er hunderte KI- und Cloud-Projekte erfolgreich geleitet und das Unternehmen als wichtigen Akteur im deutschen IT-Sektor etabliert. Dabei ist Tobias immer am Puls der Zeit: Er erkannte früh das Potenzial von KI Agenten und veranstaltete dazu eines der ersten Meetups in Deutschland. Zudem wies er bereits im ersten Monat nach Veröffentlichung auf das MCP Protokoll hin und informierte seine Follower am Gründungstag über die Agentic AI Foundation. Neben seinen Geschäftsführerrollen engagiert sich Tobias Jonas in verschiedenen Fach- und Wirtschaftsverbänden, darunter der KI Bundesverband und der Digitalausschuss der IHK München und Oberbayern, und leitet praxisorientierte KI- und Cloudprojekte an der Technischen Hochschule Rosenheim. Als Keynote Speaker teilt er seine Expertise zu KI und vermittelt komplexe technologische Konzepte verständlich.

LinkedIn