9to5Neural: ChatGPT Operator, Claude Citations, Trump AI EO

@9to5Mac shared a link

2025-01-24 02:20:08 ·

9to5mac.com

Welcome to 9to5Neural. AI moves fast. We help you keep up. In our inaugural edition, were exploring the start of the next frontier for OpenAI, Anthropics thoughtful solution to a common AI critique, and presidential AI executive order ping-pong. Lets start making sense of the latest in AI news.ChatGPT gets to work with OperatorOpenAI recently released the 18K gold Apple Watch Edition of ChatGPT. ChatGPT Pro is a $200/month subscription that makes Tim Cook wish Apple had that kind of recurring revenue per customer.Starting today, ChatGPT Pro also gives AI enthusiasts a major new reason to subscribe beyond higher request limits.Meet Operator. OpenAI calls it a research preview of an agent that can use its own browser to perform tasks for you. From meme creation to ordering groceries and filling out forms, OpenAI dubs Operator one of its first agents that will execute tasks you give it. Today were releasingOperator, an agent that can go to the web to perform tasks for you. Using its own browser, it can look at a webpage and interact with it by typing, clicking, and scrolling.It is currently a research preview, meaning it has limitations and will evolve based on user feedback.Operator wont always be behind a $200/month paywall. OpenAI plans to open access to this AI tool for Plus, Team, and Enterprise paid users in the future. For now, Operator is available to all ChatGPT Pro customers in the U.S. at operator.chatgpt.com. OpenAI says Operator is powered by its new Computer-Using Agent (CUA) technology. Powering Operator is Computer-Using Agent (CUA), a model that combines GPT-4os vision capabilities with advanced reasoning through reinforcement learning. CUA is trained to interact with graphical user interfaces (GUIs)the buttons, menus, and text fields people see on a screenjust as humans do. This gives it the flexibility to perform digital tasks without using OS- or web-specific APIs.[]While CUA is still early and has limitations, it sets new state-of-the-art benchmark results, achieving a 38.1% success rate on OSWorld for full computer use tasks, and 58.1% on WebArena and 87% on WebVoyager for web-based tasks. These results highlight CUAs ability to navigate and operate across diverse environments using a single general action space.I guess this is as good of a time as any to announce that I am stepping down from 9to5Neural to spend more time with my family.All future editions of 9to5Neural will be brought to you by Operator. I have full faith in the Computer-Using Agent to translate AI news for humanity going forward. Wait, no, I spoke too soon. Apparently theres an issue with our ChatGPT Pro subscription. Im back in the saddle!But seriously, Operator is clearly a big deal. Well look back at January 2025 as a milestone in AI advancement. Computer-User Agent technology may also satisfy AI skeptics who keep asking when ChatGPT-5 is coming. The other big OpenAI story this week? Stargate. Or as Sam Altman said on X, big. beautiful. buildings. Whats Stargate? Basically a big computer brain in Texas. OpenAI detailed the initiative this week:The Stargate Project is a new company which intends to invest $500 billion over the next four years building new AI infrastructure for OpenAI in the United States. We will begin deploying $100 billion immediately. This infrastructure will secure American leadership in AI, create hundreds of thousands of American jobs, and generate massive economic benefit for the entire world. This project will not only support the re-industrialization of the United States but also provide a strategic capability to protect the national security of America and its allies.The initial equity funders in Stargate are SoftBank, OpenAI, Oracle, and MGX. SoftBank and OpenAI are the lead partners for Stargate, with SoftBank having financial responsibility and OpenAI having operational responsibility. Masayoshi Son will be the chairman.Arm, Microsoft, NVIDIA, Oracle, and OpenAI are the key initial technology partners. The buildout is currently underway, starting in Texas, and we are evaluating potential sites across the country for more campuses as we finalize definitive agreements.As part of Stargate, Oracle, NVIDIA, and OpenAI will closely collaborate to build and operate this computing system.Behind every ambitious AI firm is an ambitious billionaire, of course, and the billionaires are fighting on X over Stargate finances.Elon Musk, whose xAI firm has no involvement in Stargate, responded to the announcement on X, saying they dont actually have the money. Musk added that he has it on good authority that SoftBank has well under $10B secured.Altman, on the other hand, is confident the parties involved have funding secured.Meanwhile, the OpenAI boss says he fell into the non-playable character trap regarding Trump (now that Trump has made his character playable, referring to Stargate).Frankly, Im much more bullish on the prospects of ChatGPT Operator than I am on the relationship complexities of the billionaires. Claude brings receipts with CitationsMeanwhile, Anthropic, which has always had a more measured approach to AI safety, is launching a promising new tool for its Claude chatbot called Citations. Today, were launching Citations, a new API feature that lets Claude ground its answers in source documents. Claude can now provide detailed references to the exact sentences and passages it uses to generate responses, leading to more verifiable, trustworthy outputs. []Previously, developers relied on complex prompts that instruct Claude to include source information, often resulting in inconsistent performance and significant time investment in prompt engineering and testing. With Citations, users can now add source documents to the context window, and when querying the model, Claude automatically cites claims in its output that are inferred from those sources.Our internal evaluations show that Claudes built-in citation capabilities outperform most custom implementations, increasing recall accuracy by up to 15%.Anthropic points to relevant use cases including customer support queriers and document summarization tasks. Best take? Kyle B. Russel on X, no citations needed:Claude 3.5 Sonnet and Claude 3.5 Haiku are ready for Citations starting today, and Anthropic has documentation ready for your exploration.New AI EO trumps last AI EOFollowing that brief break from presidential politics, lets return to the American policy on AI.President Trump continued his marathon executive order signing race on Thursday, revoking the Biden administrations executive order on AI policy with the Trump administrations executive order on AI policy. In case youve forgotten, Bidens EO on AI focused on artificial intelligence safety, infrastructure standards, mitigating job disruption, and watermarking AI content for transparency. In sum, Bidens executive order:Emphasized the safe, secure, and trustworthy development of artificial intelligence (AI).Mandated standards for critical infrastructure, cybersecurity enhancements, and oversight of federally funded projects.Addressed societal challenges, including mitigating job disruptions, advancing equity, and protecting civil rights.Required AI-generated content to include watermarks for transparency and to distinguish it from human-created material.Per the AP report, Trumps AI executive order revokes past government policies that act as barriers to American AI innovation, adding that the U.S. must develop AI systems that are free from ideological bias or engineered social agendas, per the executive order.Aside from the broad policy directive, President Trumps AI EO authorizes the development of an AI action plan within 180 days, per the AP, which will be headed by Special Advisor for AI and Crypto David Sacks, the ex-PayPal executive appointed by Trump.Going forward, tech companies will no longer need to disclose with the government the development of AI models that cross a certain power threshold.Deep competition from DeepSeek R1Meanwhile, AI competition isnt just happening among American firms. This week, Chinese AI firm DeepSeek released its R1 model family into the wild.Whats unique about R1 is that the model can run locally with performance comparable to OpenAIs ChatGPT 4o model. Local models tend to trail models that operate off-machine, making this developmental model and DeepSeek worth watching.The catch? R1 naturally has a state-approved view of world history when it comes to topics like the 1989 Tiananmen Square protest and massacre or Taiwans independence. You know, just in case the stakes for who wins the AI race werent clear already.More on the latest in AI developments in the next edition of 9to5Neural only on 9to5Mac!Top iPhone accessoriesAdd 9to5Mac to your Google News feed. FTC: We use income earning auto affiliate links. More.Youre reading 9to5Mac experts who break news about Apple and its surrounding ecosystem, day after day. Be sure to check out our homepage for all the latest news, and follow 9to5Mac on Twitter, Facebook, and LinkedIn to stay in the loop. Dont know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel

0 Comments ·0 Shares ·61 Views

Upgrade to Pro