Major US publishers have sued OpenAI and Microsoft. They accuse the companies of using their texts illegally to train AI language models. These include models like ChatGPT and Copilot. The publishers are now demanding money for their journalistic work.
This lawsuit is a battle for the future of content. If AI companies use content without payment, journalists and publishers lose their income. For you, this means the quality and diversity of news could suffer. It's about whether human work still has value in the digital world.
Publishers representing nearly 400 US newspapers have sued OpenAI and Microsoft. The companies allegedly used texts from these newspapers without permission. This was done to train their large language models like ChatGPT and Copilot. Publishers normally earn money through subscriptions and licenses. The lawsuit now seeks damages for copyright infringement.
For you, as a user of ChatGPT or Copilot, the origin of your information becomes more important. If AI models are trained with protected content without permission, questions of fairness and legality arise. It also shows how much the quality of AI answers depends on the training data. You should ask yourself how transparent these training processes are.
Companies using AI tools like ChatGPT or Copilot face increasing legal risk. If the training data for AI models was used without a license, products could lead to lawsuits. For companies, this means: carefully check which AI tools you use. Pay attention to how they were trained to avoid costly legal problems. Data origin is also crucial for compliance.
The lawsuit could be an opportunity for content creators to be paid fairly. If AI companies have to buy licenses for training data, a new market for high-quality content will emerge. This would financially secure journalism and creative work. At the same time, it improves the quality of AI models, as they are trained on legal and verifiable data.
The biggest risk is that the development of AI models will slow down. This happens if access to training data becomes too expensive or complicated. At the same time, there is a risk that publishers who do not sue will continue to see their content used without payment. It is a difficult balance between innovation and copyright protection. Another danger is that AI models might use less diverse data to avoid lawsuits. This could lead to a loss of quality.






