The Times Australia
The Times World News

.
The Times Real Estate

.

OpenAI says DeepSeek ‘inappropriately’ copied ChatGPT – but it’s facing copyright claims too

  • Written by Lea Frermann, Senior Lecturer in Natural Language Processing, The University of Melbourne, The University of Melbourne

Until a few weeks ago, few people in the Western world had heard of a small Chinese artificial intelligence (AI) company known as DeepSeek. But on January 20, it captured global attention[1] when it released a new AI model called R1.

R1 is a “reasoning” model, meaning it works through tasks step by step and details its working process to a user. It is a more advanced version of DeepSeek’s V3 model[2], which was released in December. DeepSeek’s new offering is almost as powerful as rival company OpenAI’s most advanced AI model o1, but at a fraction of the cost.

Within days, DeepSeek’s app surpassed ChatGPT in new downloads and set stock prices of tech companies in the United States tumbling[3]. It also led OpenAI to claim[4] that its Chinese rival had effectively pilfered some of the crown jewels from OpenAI’s models to build its own.

In a statement to the New York Times[5], the company said:

We are aware of and reviewing indications that DeepSeek may have inappropriately distilled our models, and will share information as we know more. We take aggressive, proactive countermeasures to protect our technology and will continue working closely with the US government to protect the most capable models being built here.

The Conversation approached DeepSeek for comment, but it did not respond.

But even if DeepSeek copied – or, in scientific parlance, “distilled” – at least some of ChatGPT to build R1, it’s worth remembering that OpenAI also stands accused of disrespecting intellectual property while developing its models.

What is distillation?

Model distillation is a common machine learning technique in which a smaller “student model” is trained on predictions of a larger and more complex “teacher model”.

When completed, the student may be nearly as good as the teacher but will represent the teacher’s knowledge more effectively and compactly.

To do so, it is not necessary to access the inner workings of the teacher. All one needs to pull off this trick is to ask the teacher model enough questions to train the student.

This is what OpenAI claims DeepSeek has done: queried OpenAI’s o1 at a massive scale and used the observed outputs to train DeepSeek’s own, more efficient models.

Mobile phone and computer screen displaying the DeepSeek logo.
OpenAI claims that DeepSeek copied at least some of ChatGPT to build its own AI models. Salvatore Di Nolfi/EPA

A fraction of the resources

DeepSeek claims[6] that both the training and usage of R1 required only a fraction of the resources needed to develop their competitors’ best models.

There are reasons to be sceptical of some of the company’s marketing hype – for example, a new independent report[7] suggests the hardware spend on R1 was as high as US$500 million. But even so, DeepSeek was still built very quickly and efficiently compared with rival models.

This might be because DeepSeek distilled OpenAI’s output. However, there is currently no method to prove this conclusively. One method that is in the early stages of development is watermarking AI outputs[8]. This adds invisible patterns to the outputs, similar to those applied to copyrighted images. There are various ways to do this in theory, but none is effective or efficient enough to have made it into practice.

There are other reasons that help explain DeepSeek’s success, such as the company’s deep and challenging technical work.

The technical advances made by DeepSeek included taking advantage of less powerful but cheaper AI chips (also called graphical processing units, or GPUs).

DeepSeek had no choice but to adapt after the US has banned firms[9] from exporting the most powerful AI chips to China.

While Western AI companies can buy these powerful units, the export ban forced Chinese companies to innovate to make the best use of cheaper alternatives.

A close up image of a black computer chip on a blue panel.
The US has banned the export of the most powerful computer chips to China. Nor Gal/Shutterstock[10]

A series of lawsuits

OpenAI’s terms of use[11] explicitly state nobody may use its AI models to develop competing products. However, its own models are trained on massive datasets scraped from the web. These datasets contained a substantial amount of copyrighted material[12], which OpenAI says it is entitled to use on the basis of “fair use”[13]:

Training AI models using publicly available internet materials is fair use, as supported by long-standing and widely accepted precedents. We view this principle as fair to creators, necessary for innovators, and critical for US competitiveness.

This argument will be tested in court. Newspapers[14], musicians[15], authors[16] and other creatives have filed a series of lawsuits against OpenAI on the grounds of copyright infringement.

Of course, this is quite distinct to what OpenAI accuses DeepSeek of doing. Nevertheless OpenAI isn’t attracting much sympathy[17] for its claim that DeepSeek illegitimately harvested its model output.

The war of words and lawsuits is an artefact of how the rapid advance of AI has outpaced the development of clear legal rules for the industry. And while these recent events might reduce the power of AI incumbents, much hinges on the outcome of the various ongoing legal disputes.

Shaking up the global conversation

DeepSeek has shown it is possible to develop state-of-the-art models cheaply and efficiently. Whether they can compete with OpenAI on a level playing field remains to be seen.

Over the weekend, OpenAI attempted to demonstrate its supremacy by publicly releasing[18] its most advanced consumer model, o3-mini.

OpenAI claims this model substantially outperforms even its own previous market-leading version, o1, and is the “most cost-efficient model in our reasoning series”.

These developments herald an era of increased choice for consumers, with a diversity of AI models on the market. This is good news for users: competitive pressures will make models cheaper to use.

And the benefits extend further.

Training and using these models places a massive strain[19] on global energy consumption. As these models become more ubiquitous, we all benefit from improvements to their efficiency.

DeepSeek’s rise certainly marks new territory for building models more cheaply and efficiently. Perhaps it will also shake up the global conversation on how AI companies should collect and use their training data.

References

  1. ^ captured global attention (theconversation.com)
  2. ^ V3 model (arxiv.org)
  3. ^ tumbling (www.cnbc.com)
  4. ^ claim (www.nbcnews.com)
  5. ^ statement to the New York Times (www.nytimes.com)
  6. ^ claims (arxiv.org)
  7. ^ new independent report (www.cnbc.com)
  8. ^ watermarking AI outputs (proceedings.mlr.press)
  9. ^ US has banned firms (theconversation.com)
  10. ^ Nor Gal/Shutterstock (www.shutterstock.com)
  11. ^ terms of use (openai.com)
  12. ^ a substantial amount of copyrighted material (www.theguardian.com)
  13. ^ on the basis of “fair use” (openai.com)
  14. ^ Newspapers (www.nytimes.com)
  15. ^ musicians (www.forbes.com)
  16. ^ authors (authorsguild.org)
  17. ^ isn’t attracting much sympathy (www.theguardian.com)
  18. ^ publicly releasing (openai.com)
  19. ^ massive strain (theconversation.com)

Read more https://theconversation.com/openai-says-deepseek-inappropriately-copied-chatgpt-but-its-facing-copyright-claims-too-248863

The Times Features

How to buy a coffee machine

For coffee lovers, having a home coffee machine can transform your daily routine, allowing you to enjoy café-quality drinks without leaving your kitchen. But with so many optio...

In the Digital Age, Online Promotion Isn't Just an Option for Small Businesses – It's a Necessity

The shift to an online-first consumer landscape means small businesses must embrace digital promotion to not only survive but thrive in 2025. From expanding reach to fostering cu...

Sorbet Balls by bubbleme Bring Bite-Sized Cool Spin to Frozen Snacking

A cool new frozen treat is rolling into the ice-cream aisle at Woolworths stores nationwide. Dairy-free, gluten-free and free from artificial colours, bubbleme Sorbet Balls ar...

Mind-Body Balance: The Holistic Approach of Personal Training in Moonee Ponds

Key Highlights Discover the benefits of a holistic approach to personal training in Moonee Ponds and nearby Maribyrnong, including residents from Strathmore. Learn how mind-b...

How Online Platforms Empower You to Find Affordable Removalists and Electricity Plans

When you move into a new home, you have many tasks to do. You need to hire removalists and set up your electricity.  In this article, we discuss how online platforms empower you ...

IS ROSEMARY OIL THE SECRET TO BETTER HAIR DAYS? HERE’S WHAT IT CAN DO

Rosemary hair oil is a straightforward natural solution that delivers exceptional results for anyone who wants to enhance their haircare process. It maintains its status in herba...

Times Magazine

CNC Machining Meets Stage Design - Black Swan State Theatre Company & Tommotek

When artistry meets precision engineering, incredible things happen. That’s exactly what unfolded when Tommotek worked alongside the Black Swan State Theatre Company on several of their innovative stage productions. With tight deadlines and intrica...

Uniden Baby Video Monitor Review

Uniden has released another award-winning product as part of their ‘Baby Watch’ series. The BW4501 Baby Monitor is an easy to use camera for keeping eyes and ears on your little one. The camera is easy to set up and can be mounted to the wall or a...

Top Benefits of Hiring Commercial Electricians for Your Business

When it comes to business success, there are no two ways about it: qualified professionals are critical. While many specialists are needed, commercial electricians are among the most important to have on hand. They are directly involved in upholdin...

The Essential Guide to Transforming Office Spaces for Maximum Efficiency

Why Office Fitouts MatterA well-designed office can make all the difference in productivity, employee satisfaction, and client impressions. Businesses of all sizes are investing in updated office spaces to create environments that foster collaborat...

The A/B Testing Revolution: How AI Optimized Landing Pages Without Human Input

A/B testing was always integral to the web-based marketing world. Was there a button that converted better? Marketing could pit one against the other and see which option worked better. This was always through human observation, and over time, as d...

Using Countdown Timers in Email: Do They Really Increase Conversions?

In a world that's always on, where marketers are attempting to entice a subscriber and get them to convert on the same screen with one email, the power of urgency is sometimes the essential element needed. One of the most popular ways to create urg...

LayBy Shopping