OpenAIs new agent can compile detailed reports on practically any topic
www.technologyreview.com
OpenAI has launched a new agent capable of conducting complex, multi-step online research into everything from scientific research to personalized bike recommendations at what it claims is the same level as a human research analyst.The tool, called Deep Research, is powered by a version of OpenAIs o3 reasoning model thats been optimized for web browsing and data analysis. It can search and analyze massive amounts of text, images and PDFs to compile a thoroughly researched report.OpenAI claims the tool represents a significant step towards its overarching goal of developing artificial general intelligence (AGI) that matches (or surpasses) humans. It says that what takes the tool tens of minutes would take a human many hours.In response to a single query, such as draw me up a competitive analysis between streaming platforms, Deep Research will search the web, analyze the information it encounters, and compile a detailed report which cites its sources. Its also able to draw from files uploaded by users.OpenAI developed Deep Research using the same chain of thought reinforcement learning methods it used to create its o1 multistep reasoning model. But while o1 was designed to focus primarily on mathematics, coding, or other STEM-based questions, Deep Research can tackle a far broader range of subjects. It can also adjust its responses as it goes in reaction to new data it comes across in the course of its research.This doesnt mean that Deep Research is immune to the same pitfalls as other AI models. OpenAI says the agent can sometimes hallucinate facts and present its users with incorrect information, albeit at a notably lower rate than ChatGPT. And because each question may take between five and 30 minutes for Deep Research to answer, its very compute intensivethe longer it takes to research a query, the more compute required.Despite that, Deep Research is now available at no extra cost to subscribers to OpenAIs paid Pro tier, and will soon roll out to its Plus, Team, and Enterprise users.
0 التعليقات ·0 المشاركات ·50 مشاهدة