Simon Willison’s Weblog: OpenAI slams court order to save all ChatGPT logs, including deleted chats

Jun 5, 2025

—

Source URL: https://simonwillison.net/2025/Jun/5/openai-court-order/#atom-everything
Source: Simon Willison’s Weblog
Title: OpenAI slams court order to save all ChatGPT logs, including deleted chats

Feedly Summary: OpenAI slams court order to save all ChatGPT logs, including deleted chats
This is very worrying. The New York Times v OpenAI lawsuit, now in its 17th month, includes accusations that OpenAI’s models can output verbatim copies of New York Times content – both from training data and from implementations of RAG.
(This may help explain why Anthropic’s Claude system prompts for their search tool emphatically demand Claude not spit out more than a short sentence of RAG-fetched search content.)
A few weeks ago the judge ordered OpenAI to start preserving the logs of all potentially relevant output – including supposedly temporary private chats and API outputs served to paying customers, which previously had a 30 day retention policy.
The May 13th court order itself is only two pages – here’s the key paragraph:

Accordingly, OpenAI is NOW DIRECTED to preserve and segregate all output log data that would otherwise be deleted on a going forward basis until further order of the Court (in essence, the output log data that OpenAI has been destroying), whether such data might be deleted at a user’s request or because of “numerous privacy laws and regulations” that might require OpenAI to do so.
SO ORDERED.

That “numerous privacy laws and regulations" line refers to OpenAI’s argument that this order runs counter to a whole host of existing worldwide privacy legislation. The judge here is stating that the potential need for future discovery in this case outweighs OpenAI’s need to comply with those laws.
Unsurprisingly, I have seen plenty of bad faith arguments online about this along the lines of
"Yeah, but that’s what OpenAI really wanted to happen" – the fact that OpenAI are fighting this order runs counter to the common belief that they aggressively train models on all incoming user data no matter what promises they have made to those users.
I still see this as a massive competitive disadvantage for OpenAI, particularly when it comes to API usage. Paying customers of their APIs may well make the decision to switch to other providers who can offer retention policies that aren’t subverted by this court order!
Via Hacker News
Tags: ai-ethics, generative-ai, openai, new-york-times, ai, law, llms

AI Summary and Description: Yes

Summary: The text discusses a court order requiring OpenAI to preserve all ChatGPT output logs, including those from deleted chats, amidst ongoing litigation. This has significant implications for privacy compliance and the competitive landscape in AI services.

Detailed Description: The text highlights a critical legal development involving OpenAI and raises concerns about the implications for privacy, compliance, and competitive dynamics in the AI sector.

– **Court Order**: A New York court has mandated that OpenAI preserve logs of all relevant outputs, including private chats and API interactions, which previously had a 30-day retention policy. This decision is significant as it challenges OpenAI’s previous practices of deleting logs.

– **Privacy Concerns**: The order is justified by the judge’s assertion that the need for discovery in the legal case outweighs OpenAI’s compliance with various global privacy laws. The directive conflicts with OpenAI’s claims regarding adherence to privacy legislation, bringing into question their data practices and user trust.

– **Competitive Disadvantage**: The text suggests this court decision may disadvantage OpenAI in the API service market as customers might prefer other providers with more favorable data retention policies. Consequently, this could impact OpenAI’s user base, client retention, and overall market competitiveness.

– **Public Perception**: The controversy surrounding this case has sparked a discussion on the public’s perception of OpenAI’s data handling practices, particularly interpretations that suggest the company benefits from its own controversial practices despite privacy assurances.

This situation highlights the intersection of AI development, legal compliance, and privacy regulations, which is crucial for professionals in security and compliance roles to monitor. The evolving legal landscape around AI technologies necessitates a proactive approach to ensure compliance with emerging regulations and maintain user trust.

.NET 1 2 2025 3 5 7 a Act actions AI AI development AI technologies and Anthropic Anthropic’s Claude API APIs app Arch art as assurance AWS by C CERN challenges chat ChatGPT CI CIA Claude client co competitive competitive dynamics competitive landscape competitiveness compliance concerns content controversy court critical Customer D data Data Handling data handling practices data practices data retention day de decision demand development directive e emerging emerging regulations ERP Ethics exp fact fighting for future g Gen generative GIS global privacy Go GPT graph gs H hack hacker Hacker News handling high Highlight HR http HTTPS implementation implications in inter interaction interactions interpret io ite J Just k Key l land law lawsuit led Legal legal case legal compliance legal landscape legislation Li litigation llm llms lm log data logs long M made man market market competitiveness mass matt mid Mode model models Monitor N new New York new-york-times news NGO no o of off on only open openai OPM oS other out output Outputs over perception policies policy potential pre preserving privacy Privacy Assurance Privacy compliance privacy concerns privacy law privacy laws privacy legislation privacy regulation privacy regulations proactive professionals prompt prompts public public perception Q question R rag Raise RCE real red Regulation regulations Retention retention policies Ro Role Rust s search Search tool sec sector security security and compliance self service services short Sig Sim source Spark SSE start system system prompt system prompts T Tags: tech technologies text the Time times to tool Tor TP training training data trust two UI up US usage use user user base user data user trust Users V Vantage WAN web Well Wi world x york