0 Commentarios
0 Acciones
242 Views
Directorio
Directorio
-
Please log in to like, share and comment!
-
BLOGS.NVIDIA.COMOracle Cloud Infrastructure Expands NVIDIA GPU-Accelerated Instances for AI, Digital Twins and MoreEnterprises are rapidly adopting generative AI, large language models (LLMs), advanced graphics and digital twins to increase operational efficiencies, reduce costs and drive innovation.However, to adopt these technologies effectively, enterprises need access to state-of-the-art, full-stack accelerated computing platforms. To meet this demand, Oracle Cloud Infrastructure (OCI) today announced NVIDIA L40S GPU bare-metal instances available to order and the upcoming availability of a new virtual machine accelerated by a single NVIDIA H100 Tensor Core GPU. This new VM expands OCIs existing H100 portfolio, which includes an NVIDIA HGX H100 8-GPU bare-metal instance.Paired with NVIDIA networking and running the NVIDIA software stack, these platforms deliver powerful performance and efficiency, enabling enterprises to advance generative AI.NVIDIA L40S Now Available to Order on OCIThe NVIDIA L40S is a universal data center GPU designed to deliver breakthrough multi-workload acceleration for generative AI, graphics and video applications. Equipped with fourth-generation Tensor Cores and support for the FP8 data format, the L40S GPU excels in training and fine-tuning small- to mid-size LLMs and in inference across a wide range of generative AI use cases.For example, a single L40S GPU (FP8) can generate up to 1.4x more tokens per second than a single NVIDIA A100 Tensor Core GPU (FP16) for Llama 3 8B with NVIDIA TensorRT-LLM at an input and output sequence length of 128.The L40S GPU also has best-in-class graphics and media acceleration. Its third-generation NVIDIA Ray Tracing Cores (RT Cores) and multiple encode/decode engines make it ideal for advanced visualization and digital twin applications.The L40S GPU delivers up to 3.8x the real-time ray-tracing performance of its predecessor, and supports NVIDIA DLSS 3 for faster rendering and smoother frame rates. This makes the GPU ideal for developing applications on the NVIDIA Omniverse platform, enabling real-time, photorealistic 3D simulations and AI-enabled digital twins. With Omniverse on the L40S GPU, enterprises can develop advanced 3D applications and workflows for industrial digitalization that will allow them to design, simulate and optimize products, processes and facilities in real time before going into production.OCI will offer the L40S GPU in its BM.GPU.L40S.4 bare-metal compute shape, featuring four NVIDIA L40S GPUs, each with 48GB of GDDR6 memory. This shape includes local NVMe drives with 7.38TB capacity, 4th Generation Intel Xeon CPUs with 112 cores and 1TB of system memory.These shapes eliminate the overhead of any virtualization for high-throughput and latency-sensitive AI or machine learning workloads with OCIs bare-metal compute architecture. The accelerated compute shape features the NVIDIA BlueField-3 DPU for improved server efficiency, offloading data center tasks from CPUs to accelerate networking, storage and security workloads. The use of BlueField-3 DPUs furthers OCIs strategy of off-box virtualization across its entire fleet.OCI Supercluster with NVIDIA L40S enables ultra-high performance with 800Gbps of internode bandwidth and low latency for up to 3,840 GPUs. OCIs cluster network uses NVIDIA ConnectX-7 NICs over RoCE v2 to support high-throughput and latency-sensitive workloads, including AI training.We chose OCI AI infrastructure with bare-metal instances and NVIDIA L40S GPUs for 30% more efficient video encoding, said Sharon Carmel, CEO of Beamr Cloud. Videos processed with Beamr Cloud on OCI will have up to 50% reduced storage and network bandwidth consumption, speeding up file transfers by 2x and increasing productivity for end users. Beamr will provide OCI customers video AI workflows, preparing them for the future of video.Single-GPU H100 VMs Coming Soon on OCIThe VM.GPU.H100.1 compute virtual machine shape, accelerated by a single NVIDIA H100 Tensor Core GPU, is coming soon to OCI. This will provide cost-effective, on-demand access for enterprises looking to use the power of NVIDIA H100 GPUs for their generative AI and HPC workloads.A single H100 provides a good platform for smaller workloads and LLM inference. For example, one H100 GPU can generate more than 27,000 tokens per second for Llama 3 8B (up to 4x more throughput than a single A100 GPU at FP16 precision) with NVIDIA TensorRT-LLM at an input and output sequence length of 128 and FP8 precision.The VM.GPU.H100.1 shape includes 23.4TB of NVMe drive capacity, 13 cores of 4th Gen Intel Xeon processors and 246GB of system memory, making it well-suited for a range of AI tasks.Oracle Clouds bare-metal compute with NVIDIA H100 and A100 GPUs, low-latency Supercluster and high-performance storage delivers up to 20% better price-performance for Altairs computational fluid dynamics and structural mechanics solvers, said Yeshwant Mummaneni, chief engineer of data management analytics at Altair. We look forward to leveraging these GPUs with virtual machines for the Altair Unlimited virtual appliance.GH200 Bare-Metal Instances Available for ValidationOCI has also made available the BM.GPU.GH200 compute shape for customer testing. It features the NVIDIA Grace Hopper Superchip and NVLink-C2C, a high-bandwidth, cache-coherent 900GB/s connection between the NVIDIA Grace CPU and NVIDIA Hopper GPU. This provides over 600GB of accessible memory, enabling up to 10x higher performance for applications running terabytes of data compared to the NVIDIA A100 GPU.Optimized Software for Enterprise AIEnterprises have a wide variety of NVIDIA GPUs to accelerate their AI, HPC and data analytics workloads on OCI. However, maximizing the full potential of these GPU-accelerated compute instances requires an optimized software layer.NVIDIA NIM, part of the NVIDIA AI Enterprise software platform available on the OCI Marketplace, is a set of easy-to-use microservices designed for secure, reliable deployment of high-performance AI model inference to deploy world-class generative AI applications.Optimized for NVIDIA GPUs, NIM pre-built containers offer developers improved cost of ownership, faster time to market and security. NIM microservices for popular community models, found on the NVIDIA API Catalog, can be deployed easily on OCI.Performance will continue to improve over time with upcoming GPU-accelerated instances, including NVIDIA H200 Tensor Core GPUs and NVIDIA Blackwell GPUs.Order the L40S GPU and test the GH200 Superchip by reaching out to OCI. To learn more, join Oracle and NVIDIA at SIGGRAPH, the worlds premier graphics conference, running through Aug. 1.See notice regarding software product information.0 Commentarios 0 Acciones 239 Views
-
BLOGS.NVIDIA.COMTaking AI to Warp Speed: Decoding How NVIDIAs Latest RTX-Powered Tools and Apps Help Developers Accelerate AI on PCs and WorkstationsEditors note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for RTX PC users.NVIDIA is spotlighting the latest NVIDIA RTX-powered tools and apps at SIGGRAPH, an annual trade show at the intersection of graphics and AI.These AI technologies provide advanced ray-tracing and rendering techniques, enabling highly realistic graphics and immersive experiences in gaming, virtual reality, animation and cinematic special effects. RTX AI PCs and workstations are helping drive the future of interactive digital media, content creation, productivity and development.ACEs AI MagicDuring a SIGGRAPH fireside chat, NVIDIA founder and CEO Jensen Huang introduced James an interactive digital human built on NVIDIA NIM microservices that showcases the potential of AI-driven customer interactions.Using NVIDIA ACE technology and based on a customer-service workflow, James is a virtual assistant that can connect with people using emotions, humor and contextually accurate responses. Soon, users will be able to interact with James in real time at ai.nvidia.com.James is a virtual assistant in NVIDIA ACE.NVIDIA also introduced the latest advancements in the NVIDIA Maxine AI platform for telepresence, as well as companies adopting NVIDIA ACE, a suite of technologies for bringing digital humans to life with generative AI. These technologies enable digital human development with AI models for speech and translation, vision, intelligence, realistic animation and behavior, and lifelike appearance.Maxine features two AI technologies that enhance the digital human experience in telepresence scenarios: Maxine 3D and Audio2Face-2D.Developers can harness Maxine and ACE technologies to drive more engaging and natural interactions for people using digital interfaces across customer service, gaming and other interactive experiences.Tapping advanced AI, NVIDIA ACE technologies allow developers to design avatars that can respond to users in real time with lifelike animations, speech and emotions. RTX GPUs provide the necessary computational power and graphical fidelity to render ACE avatars with stunning detail and fluidity.With ongoing advancements and increasing adoption, ACE is setting new benchmarks for building virtual worlds and sparking innovation across industries. Developers tapping into the power of ACE with RTX GPUs can build more immersive applications and advanced, AI-based, interactive digital media experiences.RTX Updates Unleash AI-rtistry for CreatorsNVIDIA GeForce RTX PCs and NVIDIA RTX workstations are getting an upgrade with GPU accelerations that provide users with enhanced AI content-creation experiences.For video editors, RTX Video HDR is now available through Wondershare Filmora and DaVinci Resolve. With this technology, users can transform any content into high dynamic range video with richer colors and greater detail in light and dark scenes making it ideal for gaming videos, travel vlogs or event filmmaking. Combining RTX Video HDR with RTX Video Super Resolution further improves visual quality by removing encoding artifacts and enhancing details.RTX Video HDR requires an RTX GPU connected to an HDR10-compatible monitor or TV. Users with an RTX GPU-powered PC can send files to the Filmora desktop app and continue to edit with local RTX acceleration, doubling the speed of the export process with dual encoders on GeForce RTX 4070 Ti or above GPUs. Popular media player VLC in June added support for RTX Video Super Resolution and RTX Video HDR, adding AI-enhanced video playback.Read this blog on RTX-powered video editing and the RTX Video FAQ for more information. Learn more about Wondershare Filmoras AI-powered features.In addition, 3D artists are gaining more AI applications and tools that simplify and enhance workflows, including Replikant, Adobe, Topaz and Getty Images.Replikant, an AI-assisted 3D animation platform, is integrating NVIDIA Audio2Face, an ACE technology, to enable improved lip sync and facial animation. By taking advantage of NVIDIA-accelerated generative models, users can enjoy real-time visuals enhanced by RTX and NVIDIA DLSS technology. Replikant is now available on Steam.Adobe Substance 3D Modeler has added Search Asset Library by Shape, an AI-powered feature designed to streamline the replacement and enhancement of complex shapes using existing 3D models. This new capability significantly accelerates prototyping and enhances design workflows.New AI features in Adobe Substance 3D integrate advanced generative AI capabilities, enhancing its texturing and material-creation tools. Adobe has launched the first integration of its Firefly generative AI capabilities into Substance 3D Sampler and Stager, making 3D workflows more seamless and productive for industrial designers, game developers and visual effects professionals.For tasks like text-to-texture generation and prompt descriptions, Substance 3D users can generate photorealistic or stylized textures. These textures can then be applied directly to 3D models. The new Text to Texture and Generative Background features significantly accelerate traditionally time-consuming and intricate 3D texturing and staging tasks.Powered by NVIDIA RTX Tensor Cores, Substance 3D can significantly accelerate computations and allows for more intuitive and creative design processes. This development builds on Adobes innovation with Firefly-powered Creative Cloud upgrades in Substance 3D workflows.Topaz AI has added NVIDIA TensorRT acceleration for multi-GPU workflows, enabling parallelization across multiple GPUs for supercharged rendering speeds up to 2x faster with two GPUs over a single GPU system, and scaling further with additional GPUs.Getty Images has updated its Generative AI by iStock service with new features to enhance image generation and quality. Powered by NVIDIA Edify models, the latest enhancement delivers generation speeds set to reach around six seconds for four images, doubling the performance of the previous model, with speeds at the forefront of the industry. The improved Text-2-Image and Image-2-Image functionalities provide higher-quality results and greater adherence to user prompts.Generative AI by iStock users can now also designate camera settings such as focal length (narrow, standard or wide) and depth of field (near or far). Improvements to generative AI super-resolution enhances image quality by using AI to create new pixels, significantly improving resolution without over-sharpening the image.LLM-azing AIChatRTX a tech demo that connects a large language model (LLM), like Metas Llama, to a users data for quickly querying notes, documents or images is getting a user interface (UI) makeover, offering a cleaner, more polished experience.ChatRTX also serves as an open-source reference project that shows developers how to build powerful, local, retrieval-augmented applications (RAG) applications accelerated by RTX.ChatRTX is getting a interface (UI) makeover.The latest version of ChatRTX, released today, uses the Electron + Material UI framework, which lets developers more easily add their own UI elements or extend the technologys functionality. The update also includes a new architecture that simplifies the integration of different UIs and streamlines the building of new chat and RAG applications on top of the ChatRTX backend application programming interface.End users can download the latest version of ChatRTX from the ChatRTX web page. Developers can find the source code for the new release on the ChatRTX GitHub repository.Meta Llama 3.1-8B models are now optimized for inference on NVIDIA GeForce RTX PCs and NVIDIA RTX workstations. These models are natively supported with NVIDIA TensorRT-LLM, open-source software that accelerates LLM inference performance.Dells AI Chatbots: Harnessing RTX Rocket FuelDell is presenting how enterprises can boost AI development with an optimized RAG chatbot using NVIDIA AI Workbench and an NVIDIA NIM microservice for Llama 3. Using the NVIDIA AI Workbench Hybrid RAG Project, Dell is demonstrating how the chatbot can be used to converse with enterprise data thats embedded in a local vector database, with inference running in one of three ways:Locally on a Hugging Face TGI serverIn the cloud using NVIDIA inference endpointsOn self-hosted NVIDIA NIM microservicesLearn more about the AI Workbench Hybrid RAG Project. SIGGRAPH attendees can experience this technology firsthand at Dell Technologies booth 301.HP AI Studio: Innovate Faster With CUDA-X and GalileoAt SIGGRAPH, HP is presenting the Z by HP AI Studio, a centralized data science platform. Announced in October 2023, AI Studio has now been enhanced with the latest NVIDIA CUDA-X libraries as well as HPs recent partnership with Galileo, a generative AI trust-layer company. Key benefits include:Deploy projects faster: Configure, connect and share local and remote projects quickly.Collaborate with ease: Access and share data, templates and experiments effortlessly.Work your way: Choose where to work on your data, easily switching between online and offline modes.Designed to enhance productivity and streamline AI development, AI Studio allows data science teams to focus on innovation. Visit HPs booth 501 to see how AI Studio with RAPIDS cuDF can boost data preprocessing to accelerate AI pipelines. Apply for early access to AI Studio.An RTX Speed Surge for Stable DiffusionStable Diffusion 3.0, the latest model from Stability AI, has been optimized with TensorRT to provide a 60% speedup.A NIM microservice for Stable Diffusion 3 with optimized performance is available for preview on ai.nvidia.com.Theres still time to join NVIDIA at SIGGRAPH to see how RTX AI is transforming the future of content creation and visual media experiences. The conference runs through Aug. 1.Generative AI is transforming graphics and interactive experiences of all kinds. Make sense of whats new and whats next by subscribing to the AI Decoded newsletter.0 Commentarios 0 Acciones 259 Views
-
BLOG.PLAYSTATION.COMPlayStation Plus Monthly Games for August: LEGO Star Wars The Skywalker Saga, FNAF Security Breach, Ender Lilies: Quietus of the KnightsRelive all nine Star Wars films in Lego form, survive an overnight in Freddy Fazbears Mega Pizzaplex and journey through a destroyed kingdom with Augusts PlayStation Plus Monthly Games lineup. From August 6*, all PlayStation Plus members** can add Lego Star Wars The Skywalker Saga, Five Nights at Freddys Security Breach and Ender Lilies: Quietus of the Knights to their game libraries.Lets take a closer look at each game.View and download imageDownload the imagecloseCloseDownload this imageLEGO Star Wars The Skywalker Saga | PS4, PS5Play through all nine Star Wars saga films in a brand-new LEGO video game unlike any other. Experience fun-filled adventures, whimsical humor, and the freedom to fully immerse yourself in the LEGO Star Wars galaxy. Want to play as a Jedi? A Sith? Rebel, bounty hunter, or droid? LEGO Star Wars: The Skywalker Saga features hundreds of playable characters from throughout the galaxy. Whether on land or in space, a variety of vehicles are yours to command. Jump to lightspeed in the Millennium Falcon, fly the T-47 Airspeeder and battle TIE fighters in Resistance X-wings its the ultimate LEGO Star Wars experience.View and download imageDownload the imagecloseCloseDownload this imageFive Nights at Freddys Security Breach | PS4, PS5Five Nights at Freddys: Security Breach is the latest installment of the family-friendly horror games from Steel Wool Games. Play as Gregory, a young boy trapped overnight in Freddy Fazbears Mega Pizzaplex. With the help of Freddy Fazbear himself, Gregory must survive the near-unstoppable hunt of reimagined Five Nights at Freddys characters as well as new, horrific threats.View and download imageDownload the imagecloseCloseDownload this imageEnder Lilies: Quietus of the Knights | PS4Unravel the mysteries of a destroyed kingdom in this dark fantasy 2D action RPG. Journey through the sprawling and hauntingly beautiful Lands End, traversing a submerged forest, a sealed-off contaminated underground cavern, and a grand castle. Formidable bosses await that will gladly claim your life given even the slightest chance. Defeat these powerful foes and release them from their unending curse to recruit them as allies. Overcome the challenges before you and search for the truth with powerful knights at your side.Last chance to download Julys games*PlayStation Plus members have until August 6 to add Borderlands 3, NHL 24 and Among Us to their game libraries.*All three games will be available to PlayStation Plus members on August 6 until September 2.**PlayStation Plus Game Catalog lineup may differ in certain regions. Please check the PlayStation Store on launch day for your regions lineup.0 Commentarios 0 Acciones 259 Views
-
WWW.POLYGON.COMSquid Game season 2 has a release date and a season 3 renewalImage: Netflix Season 2 of Squid Game will take us (and Lee Jung-jaes Gi-hun) back to the game Continue reading0 Commentarios 0 Acciones 224 Views
-
WWW.POLYGON.COMHeres where you can pre-order the Lego Super Mario & Yoshi setImage: Lego The latest addition to the Lego Super Mario catalog arrives Oct. 1 Continue reading0 Commentarios 0 Acciones 244 Views
-
WWW.POLYGON.COMHonkai: Star Rail Xianzhou Luofu treasure chest locations and mapImage: Hoyoverse via Polygon Where to find all the treasure, puzzles, formidable foes, and Warp Trotters on the Xianzhou Luofu Continue reading0 Commentarios 0 Acciones 225 Views
-
WWW.POLYGON.COM10 tips for recruiting in EA Sports College Football 25Image: EA Tiburon/Electronic Arts How to land those five-star recruits youve been chasing Continue reading0 Commentarios 0 Acciones 215 Views
-
WWW.POLYGON.COMHumble Games restructure puts several console updates in limboImage: Stairway Games/Humble Games A PR rep for Humble Games maintains that developer concerns will be addressed Continue reading0 Commentarios 0 Acciones 212 Views
-
WWW.POLYGON.COMHugh Jackmans performance is the one sincere thing in Deadpool & WolverinePhoto: Jay Maidment/20th Century Studios Nothing can stop Hugh from being great Continue reading0 Commentarios 0 Acciones 223 Views