OpenAI has evidence that its models helped train China’s DeepSeek
www.theverge.com
Sucking in data you didnt ask permission for? Sounds familiar.Chinese artificial intelligence company DeepSeek disrupted Silicon Valley with the release of cheaply developed AI models that compete with flagship offerings from OpenAI but the ChatGPT maker suspects they were built upon OpenAI data.OpenAI and Microsoft are investigating whether the Chinese rival used OpenAIsAPI to integrate OpenAIs AI models into DeepSeeks own models, according to Bloomberg. The outlets sources said Microsoft security researchers detected that large amounts of data were being exfiltrated through OpenAI developer accounts in late 2024, which the company believes are affiliated with DeepSeek.OpenAI told the Financial Times that it found evidence linking DeepSeek to the use of distillation a common technique developers use to train AI models by extracting data from larger, more capable ones. Its an efficient way to train smaller models at a fraction of the more than $100 million that OpenAI spent to train GPT-4. While developers can use OpenAIs API to integrate its AI with their own applications, distilling the outputs to build rival models is a violation of OpenAIs terms of service. OpenAI has not provided details of the evidence it found.The situation is rich with irony. After all, it was OpenAI that made huge leaps with its GPT model by sucking down the entirety of the written web without consent.President Donald Trumps artificial intelligence czar David Sacks said it is possible that IP theft had occurred. Theres substantial evidence that what DeepSeek did here is they distilled knowledge out of OpenAI models and I dont think OpenAI is very happy about this, Sacks told Fox News on Tuesday.We know PRC (China) based companies and others are constantly trying to distill the models of leading US AI companies, OpenAI said in a statement to Bloomberg. As the leading builder of AI, we engage in countermeasures to protect our IP, including a careful process for which frontier capabilities to include in released models, and believe as we go forward that it is critically important that we are working closely with the US government to best protect the most capable models from efforts by adversaries and competitors to take US technology.
0 Commentaires ·0 Parts ·42 Vue