NVIDIA is collaborating with Google Cloud to convey agentic AI to enterprises in search of to regionally harness the Google Gemini household of AI fashions utilizing the NVIDIA Blackwell HGX and DGX platforms and NVIDIA Confidential Computing for knowledge security.
With the NVIDIA Blackwell platform on Google Distributed Cloud, on-premises knowledge facilities can keep aligned with regulatory necessities and knowledge sovereignty legal guidelines by locking down entry to delicate data, reminiscent of affected person data, monetary transactions and categorized authorities data. NVIDIA Confidential Computing additionally secures delicate code within the Gemini fashions from unauthorized entry and knowledge leaks.
“By bringing our Gemini fashions on premises with NVIDIA Blackwell’s breakthrough efficiency and confidential computing capabilities, we’re enabling enterprises to unlock the complete potential of agentic AI,” mentioned Sachin Gupta, vice chairman and normal supervisor of infrastructure and options at Google Cloud. “This collaboration helps guarantee clients can innovate securely with out compromising on efficiency or operational ease.”
Confidential computing with NVIDIA Blackwell offers enterprises with the technical assurance that their person prompts to the Gemini fashions’ utility programming interface — in addition to the information they used for fine-tuning — stay safe and can’t be considered or modified.
On the identical time, mannequin homeowners can defend in opposition to unauthorized entry or tampering, offering dual-layer safety that permits enterprises to innovate with Gemini fashions whereas sustaining knowledge privateness.
AI Brokers Driving New Enterprise Purposes
This new providing arrives as agentic AI is remodeling enterprise know-how, providing extra superior problem-solving capabilities.
In contrast to AI fashions that understand or generate primarily based on discovered information, agentic AI methods can purpose, adapt and make choices in dynamic environments. For instance, in enterprise IT help, whereas a knowledge-based AI mannequin can retrieve and current troubleshooting guides, an agentic AI system can diagnose points, execute fixes and escalate complicated issues autonomously.
Equally, in finance, a standard AI mannequin may flag probably fraudulent transactions primarily based on patterns, however an agentic AI system may go even additional by investigating anomalies and taking proactive measures reminiscent of blocking transactions earlier than they happen or adjusting fraud detection guidelines in actual time.
The On-Premises Dilemma
Whereas many can already use the fashions with multimodal reasoning — integrating textual content, photos, code and different knowledge sorts to resolve complicated issues and construct cloud-based agentic AI functions — these with stringent safety or knowledge sovereignty necessities have but been unable to take action.
With this announcement, Google Cloud will likely be one of many first cloud service suppliers to supply confidential computing capabilities to safe agentic AI workloads throughout each surroundings — whether or not cloud or hybrid.
Powered by the NVIDIA HGX B200 platform with Blackwell GPUs and NVIDIA Confidential Computing, this resolution will allow clients to safeguard AI fashions and knowledge. This lets customers obtain breakthrough efficiency and vitality effectivity with out compromising knowledge safety or mannequin integrity.
AI Observability and Safety for Agentic AI
Scaling agentic AI in manufacturing requires strong observability and safety to make sure dependable efficiency and compliance.
Google Cloud at present introduced a brand new GKE Inference Gateway constructed to optimize the deployment of AI inference workloads with superior routing and scalability. Integrating with NVIDIA Triton Inference Server and NVIDIA NeMo Guardrails, it presents clever load balancing that improves efficiency and reduces serving prices whereas enabling centralized mannequin safety and governance.
Trying forward, Google Cloud is working to boost observability for agentic AI workloads by integrating NVIDIA Dynamo, an open-source library constructed to serve and scale reasoning AI fashions throughout AI factories.
At Google Cloud Subsequent, attend NVIDIA’s particular handle, discover classes, view demos and speak to NVIDIA consultants.