Speaker Johnson Meets With House Freedom Caucus
Kent Nishimura
·
gettyimages.com
 
OpenAI Deletes Evidence
User avatar
Curated by
stephenhoban
3 min read
31,142
1,465
According to reports from The New York Times and TechCrunch, OpenAI accidentally deleted potential evidence crucial to its ongoing copyright lawsuit with several news organizations, including The New York Times, complicating the legal proceedings and raising questions about data management in high-stakes litigation.

Accidental Evidence Deletion

On November 14, 2024, OpenAI engineers inadvertently erased crucial evidence stored on one of two virtual machines provided to attorneys representing The New York Times and Daily News
1
2
.
The deletion occurred after legal teams had invested over 150 person-hours reviewing OpenAI's training data for potential copyright infringement since November 1
3
4
.
While OpenAI managed to recover most of the deleted data, the folder structure and file names were irretrievably lost, forcing plaintiffs to recreate their work and resulting in approximately one week's worth of expert and legal analysis being compromised
3
5
.
techcrunch.com favicon
indianexpress.com favicon
futurism.com favicon
5 sources

Technical Data Loss Details

The accidental deletion resulted in significant technical complications for the ongoing legal analysis. While OpenAI managed to recover most of the erased data, the folder structure and file names were irretrievably lost
1
2
.
This loss has severely impacted the ability to determine where the news plaintiffs' copied articles were used in building OpenAI's models
3
.
The incident affected one of two dedicated virtual machines with improved computing resources that OpenAI had provided to allow the news organizations to perform their searches
4
.
As a consequence, the stored programs and search result data on the affected machine were compromised, forcing the plaintiffs to restart their analysis from scratch
5
6
.
techcrunch.com favicon
indianexpress.com favicon
futurism.com favicon
6 sources

Legal Implications for OpenAI

The accidental deletion has significant legal implications for the ongoing copyright lawsuit. While attorneys for the news organizations have stated they do not believe the deletion was intentional, they have requested that OpenAI conduct the searches itself, arguing that the company is best positioned to search its own datasets
1
2
.
This request shifts the burden of evidence gathering onto OpenAI, potentially impacting the lawsuit's dynamics. In response, OpenAI has indicated disagreement with the characterizations made in the legal filing and plans to file a formal response
3
.
The incident was disclosed in a letter to the U.S. District Court for the Southern District of New York on November 20, 2024, bringing the matter to the court's attention and potentially influencing future proceedings
4
5
.
indianexpress.com favicon
theregister.com favicon
futurism.com favicon
5 sources

Broader Lawsuit Context

The copyright dispute between OpenAI and news organizations is part of a larger legal battle challenging the use of copyrighted material in AI training. The New York Times alleges that OpenAI used its content without permission to train AI models, potentially damaging the newspaper's relationship with readers
1
.
This lawsuit highlights the growing tension between traditional media and AI companies, as AI-generated content increasingly competes with human-authored journalism
2
.
The case raises important questions about fair use, intellectual property rights, and the future of content creation in the age of artificial intelligence.
futurism.com favicon
businessinsider.com favicon
2 sources
Related
What was the nature of the evidence that was deleted
How did OpenAI's engineers manage to delete the evidence
What steps are being taken to ensure the integrity of evidence in future lawsuits
How significant was the deleted evidence to the case
What are the potential consequences for OpenAI due to this data loss