• AI Search
  • Cryptocurrency
  • Earnings
  • Enterprise
  • About TechBooky
  • Submit Article
  • Advertise Here
  • Contact Us
TechBooky
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • AI
  • Metaverse
  • Gadgets
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
TechBooky
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Home Artificial Intelligence

NVIDIA Pushes Local AI Agents With New RTX Spark PCs and OpenShell on Windows

Paul Balo by Paul Balo
June 1, 2026
in Artificial Intelligence
Share on FacebookShare on Twitter

NVIDIA is making an aggressive push to bring powerful, privacy-focused AI agents directly onto personal computers, announcing a new class of RTX-powered Windows PCs, expanded developer tools and deep integrations with partners including Microsoft and Adobe.

At GTC Taipei during Computex, the company introduced NVIDIA RTX Spark systems, new security-focused agent infrastructure for Windows called OpenShell, and a series of software optimisations aimed at making locally running AI agents faster and more capable across RTX and DGX hardware.

NVIDIA positions RTX Spark as a new class of Windows PC built specifically for personal AI agents. Each system is designed to run agents locally, with up to 1 petaflop of AI compute and 128GB of unified memory to handle the intensive workloads involved in multi-step reasoning, content generation and automated workflows.

The systems target slim laptops with all-day battery life as well as ultra-efficient desktops, combining AI, creation and gaming capabilities. NVIDIA says RTX Spark is meant to turn a PC from a conventional tool into more of a “teammate” for everyday tasks.

Alongside the hardware, NVIDIA and Microsoft are collaborating to build out a secure platform for on-device agents in Windows. That stack combines new Windows security primitives with NVIDIA’s OpenShell runtime to tackle long-standing concerns about running powerful agents directly on a user’s primary PC.

According to NVIDIA, the new Windows-level primitives cover identity, containment, policy and end-to-end security for building and running native agents. OpenShell then adds:

  • Policy controls that let users specify what agents are permitted to do
  • Intelligent routing of queries to local models based on user privacy preferences
  • Mechanisms to disguise personal information in queries that must go to cloud models

This combined security and privacy layer is being adopted by leading agent developers such as Hermes Agent and OpenClaw in their new Windows applications. These apps aim to expose on-device agents that can operate across Windows software, manage cross-app workflows, generate images and video, build plug-ins and apps, and semantically search local files, all under user-defined constraints.

For professionals who need data-center-class performance in a desktop footprint, NVIDIA also introduced DGX Station for Windows. The company describes it as a deskside AI supercomputer that pairs a data-center-class GPU with a high-end CPU in a Windows-based system designed for manageability, security and compatibility in enterprise environments.

NVIDIA is also working on the software side to speed up the open models and agent frameworks that power local assistants.

Through collaboration with the llama.cpp community, NVIDIA has enabled features such as multi-token prediction (MTP), a speculative decoding technique that uses a smaller draft model to propose multiple tokens at once, which the target model then validates in a single pass. Combined with optimisations like programmatic dependent launch, NVIDIA reports:

  • 2x performance on Qwen 3.6 and 3.5 27B models
  • 1.6x performance on Qwen 3.6 and 3.5 35B models

These improvements are available through the llama.cpp web UI and LM Studio.

For Linux-focused developers and always-on local agents, NVIDIA is promoting DGX Spark as its most capable personal agent system in a Linux environment. DGX Spark combines large unified memory, high compute performance and compatibility with the CUDA ecosystem.

The latest DGX Spark OS release introduces a streamlined out-of-the-box experience with an updated NemoClaw installer and faster inference on leading agentic models. NemoClaw is now available for NVIDIA RTX and DGX PCs on Linux and via Windows Subsystem for Linux (WSL), offering automatic sandboxing and added support for Hermes Agent when deploying local agents.

NVIDIA has also worked with the vLLM community to further accelerate inference for agents. New optimisations in vLLM, along with new NVFP4 checkpoints for Qwen 3.6 35B, deliver a reported 2.6x performance increase on DGX Spark compared with previously available NVFP4 checkpoints from Unsloth. The changes include kernel-level improvements, mixed-precision execution and CUDA Graph support for MTP. A vLLM blog provides a walkthrough of serving NVFP4 mixture-of-experts models on DGX Spark, covering unified memory tuning and a Nemotron 3 Super reference setup.

NVIDIA is also expanding agent capabilities through a partnership with H Company. H Company’s “computer-use harness” allows agents to interact with a PC similarly to a human user, by seeing the screen and controlling the mouse and keyboard, including in applications that don’t expose APIs. NVIDIA says this capability is coming soon to RTX and DGX PCs with local model support.

To make that practical on consumer and developer hardware, NVIDIA has worked with H Company to quantise its Holo Computer Use models and to accelerate the harness, achieving a 2x speedup on NVIDIA GPUs while cutting memory usage by 35%. The models are already available for download, and the Holo Desktop app is described as coming soon.

On the creative side, NVIDIA is partnering with Adobe to rearchitect key Creative Cloud applications for RTX Spark. Adobe Premiere Pro and Photoshop are being tuned to take advantage of RTX Spark’s unified memory, next-generation GPU and NVIDIA TensorRT software.

  • In Photoshop, Firefly-powered Generative Fill is among hundreds of accelerated tools, and the next-generation engine will be optimised for GPU-accelerated compositing with live filters, HDR and modern natural brushing.
  • In Premiere Pro, a new video pipeline is designed to use unified memory, a Blackwell GPU and TensorRT for real-time editing and colour correction, accelerated AI effects and more efficient rendering of complex timelines.
  • Adobe Substance 3D Painter and Stager will run natively on RTX Spark for smoother, more responsive 3D texturing and scene creation workflows.

NVIDIA says these changes can deliver up to 2x faster AI, editing, colouring and effects across creative workflows. Adobe also plans to extend Premiere Pro and Photoshop so users can work with Windows agents inside the apps, treating AI as a collaborative teammate in creative projects. Updates for these Adobe tools are expected to roll out in step with RTX Spark availability.

Beyond core agents and creative apps, NVIDIA is updating several pieces of its broader RTX software stack:

  • NVIDIA Broadcast 2.2 graduates the Studio Voice feature from beta. Studio Voice applies AI processing to make virtually any microphone sound studio-grade, now running on GeForce RTX 3060 GPUs and above with better performance. Broadcast adds Elgato Stream Deck integration and configurable keyboard shortcuts.
  • Project G-Assist gains Stream Deck support via the Elgato MCP Server, enabling AI assistant controls in streaming setups.
  • Blender Cycles is integrating DLSS 4.5 Ray Reconstruction as a new denoiser. This transforms the path-tracing viewport into a near-final, interactive viewer so artists can navigate scenes with close-to-final render quality. The feature is slated to ship with Blender 5.3 this fall, alongside RTX Spark.
  • RTX Video Frame Generation will launch with RTX Spark as an AI effect that can double or quadruple video frame rates in real time. This is aimed at smoothing out typical 15–20 fps outputs from AI video models. It will be available as a Python wheel and a ComfyUI node, letting AI artists generate at low fps and then interpolate up to higher frame rates.

NVIDIA is also highlighting partner and ecosystem moves around RTX Spark:

  • ASUS ProArt creator laptops will ship with Black Forest Labs’ FLUX.2 Klein 4B image model preinstalled via the MuseTree app. The model is optimised in NVFP4 format with NVIDIA TensorRT for RTX, delivering up to a 2.5x speedup and a 560% reduction in memory usage. Creators can start generating images locally immediately after unboxing, without separate model downloads or ComfyUI configuration.
  • NVIDIA AI for Media SDK is adding updates including LipSync NIM microservices tuned for French, German and Spanish, and an enhanced Active Speaker Detection NIM with multicamera support and cross-video speaker correlation.
  • NVIDIA is directing developers to its latest RTX AI Garage blog post covering Hermes Agent and self-improving AI on RTX PCs and DGX Spark.

The company is promoting RTX Spark and its AI ecosystem across social channels including Facebook, Instagram, TikTok and X, and through an RTX Spark newsletter.

Related Posts:

  • vera-cpu-rack
    NVIDIA Unveils Vera CPU Built for the Age of Agentic AI
  • NVIDIA-open-model-families-agentic-physical-healthcare-ai (1)
    Nvidia Expands Open AI Models for Robotics and Healthcare
  • dynamo-1-0
    Nvidia Debuts Dynamo 1.0 as Operating System for AI…
  • nvidia-physical-ai-data-factory-blueprint
    Nvidia Unveils AI Data Factory Blueprint for Robotics
  • GettyImages-2183848501 (1)
    Nvidia Sees $1 Trillion AI Chip Market by 2027
  • skynews-google-chrome-logo_6753346
    Google Chrome To Debut Support for ARM64 Linux This Spring
  • gemini-3.1-pro_deep-research-and.width-1200.format-webp
    Google Launches Deep Research and Deep Research Max…
  • genai-ai-on-crop-blog-why-nemotron-4300971-blog-1280x680-1-960x510
    Nvidia Invests in Open Models to Fuel AI Agent Development

Discover more from TechBooky

Subscribe to get the latest posts sent to your email.

Tags: nvidiaPCRTXwindows
Paul Balo

Paul Balo

Paul Balo is the founder of TechBooky and a highly skilled wireless communications professional with a strong background in cloud computing, offering extensive experience in designing, implementing, and managing wireless communication systems.

BROWSE BY CATEGORIES

Receive top tech news directly in your inbox

subscription from
Loading

Freshly Squeezed

  • NVIDIA Pushes Local AI Agents With New RTX Spark PCs and OpenShell on Windows June 1, 2026
  • Apple Reportedly Targets Late 2027 Launch for Camera-Equipped Smart Glasses June 1, 2026
  • DataHub Turns SQL Query History into Context Layer to Cut AI Data Errors May 29, 2026
  • DeepSeek Locks in 75% Price Cut on V4 Pro, Undercutting Western AI Models by up to 25x May 29, 2026
  • Mistral AI Targets Enterprise with Industrial Push, New Data Center and Assistant Rebrand May 29, 2026
  • Microsoft 365 Copilot Receives Faster Performance Features & Redesigned Look May 29, 2026
  • Anthropic Surges To A $965 Billion Valuation, Overtaking OpenAI May 29, 2026
  • Google Rolls Out Media App Switcher For Android Auto May 29, 2026
  • Bluesky Adopts Long-Form Content To Rival X Articles May 29, 2026
  • Meta Rolls Out Subscriptions For Instagram, Facebook, & WhatsApp, With AI Plans May 28, 2026
  • TELCOs (Airtel & Glo) Resumes Airtime Borrowing To Customers May 27, 2026
  • Over 185,000 Affected By 7-Eleven Data Breach May 26, 2026

Browse Archives

June 2026
MTWTFSS
1234567
891011121314
15161718192021
22232425262728
2930 
« May    

Quick Links

  • About TechBooky
  • Advertise Here
  • Contact us
  • Submit Article
  • Privacy Policy
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
  • African
  • Artificial Intelligence
  • Gadgets
  • Metaverse
  • Tips
  • AI Search
  • About TechBooky
  • Advertise Here
  • Submit Article
  • Contact us

© 2025 Designed By TechBooky Elite

Discover more from TechBooky

Subscribe now to keep reading and get access to the full archive.

Continue reading

Chat with TechBooky AI
💬
TechBooky AI ✕
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.