Confidential GPU Pricing — April 2026

Confidential GPU Pricing 2026 — Intel TDX, AWS Nitro, Azure CC, GCP CC

If your compliance officer needs hardware-attested AI compute — not a marketing slide, an Intel-signed attestation quote — here is what each cloud actually costs in April 2026, what kind of TEE you get, and which evidence the provider hands a regulator.

Prices verified April 2026Per-hour list rates, USD4 providers, 3 GPU classes

Headline comparison

Hourly list price for a single confidential GPU instance. "n/a" means the SKU is not yet generally available. "Attestation root" is the signing authority a regulator can verify against.

Provider	Confidential tech	H100 / hr	H200 / hr	B200 / hr	Attestation root
VoltageGPU	Intel TDX + NVIDIA Protected PCIe	$2.77/hr	$3.60/hr	$7.50/hr	Intel DCAP
AWS Nitro Enclaves	CPU-side enclave only (no GPU TEE)	$5.45	n/a	n/a	AWS Nitro proprietary
Azure Confidential	Intel TDX + Hopper Confidential Computing	$5.60	$13.96	not yet	Intel DCAP
GCP Confidential	Intel TDX (C3 family) + Hopper	$4.95	not yet	not yet	Intel DCAP

Sources, April 2026: AWS p5 with Nitro Enclaves, Azure NCC H100 v5 and ND H200 v5, GCP C3 confidential VMs with NVIDIA Hopper CC, and the public VoltageGPU price snapshot at /pricing. Hyperscaler rates exclude egress, storage, and reserved-instance discounts.

Why these prices vary

The four providers above all sell "confidential GPU" but they do not sell the same thing, and the price gap is driven by three structural choices: the billing granularity, the regional footprint of the confidential SKU, and the attestation evidence each provider is willing to expose. A regulator who reads a Data Processing Agreement understands those three knobs. A procurement manager comparing list rates often does not.

Billing granularity. AWS p5, Azure NCC and ND, and GCP C3 confidential instances all bill in 1-second increments inside an hourly meter, but you commit to a full hour at provisioning and you pay for the underlying VM shell even when the GPU is idle. VoltageGPU bills per second from the moment the workload attests through to shutdown, with no minimum and no idle commit — that is the structural reason the same H100 lands at $2.77/hr on VoltageGPU and at $5.45 on AWS.

Regional availability. Azure ND H200 v5 confidential is currently GA in two US regions and one European region. GCP C3 with NVIDIA Hopper CC is in preview in a handful of zones. AWS Nitro Enclaves are global, but as discussed below they do not actually place the GPU inside a TEE. The reason the hourly H200 rate jumps from our $3.60/hr to Azure's $13.96 is partly hardware scarcity and partly the regional premium hyperscalers charge for confidential SKUs.

Attestation evidence. Three of the four providers publish an Intel-signed Data Center Attestation Primitives (DCAP) quote that a regulator can re-verify offline. AWS Nitro Enclaves only sign with the AWS Nitro root, which means you are trusting Amazon's opinion of the enclave rather than Intel's opinion of the silicon. For a deeper breakdown see our article on confidential GPU pricing across VoltageGPU, Azure, and AWS.

What "confidential" actually means at each cloud

AWS Nitro Enclaves. A Nitro Enclave is a CPU-side isolated VM with no persistent storage, no network, and no operator access. It is excellent for key material, signing keys, and deterministic verifiers. It is not, by itself, a GPU TEE — the NVIDIA accelerator on a p5 instance lives outside the enclave boundary, so any tensor that crosses PCIe is visible to the host hypervisor. AWS calls this "defense in depth"; a CISO calls it "not the same product".

Azure Confidential GPU. NCC H100 v5 and ND H200 v5 run on Intel TDX hosts with NVIDIA Hopper Confidential Computing enabled. Memory is encrypted with AES, PCIe traffic to the GPU is encrypted, and the CPU plus GPU jointly produce an Intel-signed attestation report. This is a real TEE end to end and is the closest published equivalent to what we ship.

GCP Confidential VMs. The C3 family uses Intel TDX and pairs with Hopper CC for confidential GPU compute. Google also offers AMD SEV-SNP on the C2D family for CPU-only confidential workloads. The TDX path is the like-for-like comparison; SEV-SNP is a different threat model targeting nested virtualization rather than GPU isolation.

VoltageGPU. Intel TDX hosts with NVIDIA Protected PCIe, AES-encrypted memory, and a per-session Intel DCAP quote your auditor can re-verify against the Intel root certificate. The product page at /confidential-compute documents the full attestation chain.

Real cost on a real workload

A French law firm runs GDPR-scoped contract analysis on H200 confidential GPU, 8 hours per business day, 365 days for the rolling year. We hold model and prompt sizes constant and only swap the underlying provider.

# Workload assumptions — held constant across providers
gpu                = "NVIDIA H200 141GB (confidential)"
hours_per_day      = 8
days_per_year      = 365
hours_per_year     = hours_per_day * days_per_year      # = 2920
attestation_root   = "Intel DCAP"  # required by client DPA

# Hourly list price by provider — H100 fallback where H200 not GA
voltagegpu_hr = 3.60    # H200 confidential
azure_hr      = 13.96   # ND H200 v5 confidential
aws_hr        = 5.45    # p5 with Nitro Enclaves (H100, no GA H200 confidential)
gcp_hr        = 4.95    # C3 confidential VM (H100, no GA H200 confidential)

annual_tco = lambda hr: hr * hours_per_year

Provider	SKU used	Hourly	Annual TCO
VoltageGPU	H200 confidential	$3.60	$10,512
Azure ND H200 v5	H200 confidential	$13.96	$40,763
AWS p5 + Nitro	H100 (no GA H200 confidential)	$5.45	$15,914
GCP C3 Confidential	H100 (no GA H200 confidential)	$4.95	$14,454

Two notes. First, AWS and GCP cannot run this exact workload on H200 confidential at April 2026 — they fall back to H100, which trades throughput for cost. Second, the Azure premium for true H200 confidential is roughly $30,251 per year per GPU on this profile, which is the bill compliance teams should be quoting against TDX-based alternatives.

FAQ

What is the cheapest confidential GPU per hour in 2026?

In April 2026 the cheapest hardware-attested confidential GPU is the NVIDIA H100 80GB on VoltageGPU at $2.77/hr, running on Intel TDX with NVIDIA Protected PCIe. The instance is billed per second with no minimum, no commit, and no idle surcharge. GCP Confidential VMs (C3) list H100 at $4.95 per hour, Azure NCC H100 v5 at $5.60 per hour, and AWS Nitro Enclaves at $5.45 per hour for the underlying p5 instance.

Two caveats matter when you do the comparison. First, AWS Nitro Enclaves do not place the GPU itself inside a TEE — only the CPU-side workload is isolated, so the $5.45 buys you a different product than the other three. Second, Azure and GCP bill the confidential VM shell continuously while the workload is provisioned, even if the GPU is idle, while VoltageGPU bills per-second only when the workload is actively running and attested. On bursty inference patterns the per-second model typically wins by 30 to 60 percent over a 1-month window.

For a like-for-like H200 comparison the gap widens further: $3.60/hr on VoltageGPU versus $13.96 on Azure ND H200 v5. AWS and GCP do not yet offer GA H200 confidential SKUs as of this update.

Does AWS Nitro Enclaves give me the same protection as Intel TDX?

No, and the difference matters specifically for AI workloads. Nitro Enclaves are a CPU-side construct: they create an isolated VM with no persistent storage, no network, and no operator access, attested by the AWS Nitro hypervisor itself. They were designed for key material, signing services, and deterministic verifiers where the workload fits comfortably inside CPU memory and never needs an accelerator.

When you attach an NVIDIA H100 to a p5 instance and run inference, the model weights, the prompt tensors, and the output tokens cross the PCIe boundary into GPU memory. That boundary is outside the Nitro Enclave. The host hypervisor and the AWS control plane can, in principle, observe the traffic. AWS positions this as "defense in depth"; a regulator reading the architecture diagram positions it as "the GPU is not in the TEE".

Intel TDX combined with NVIDIA Confidential Computing extends hardware encryption to the PCIe link itself: traffic between CPU memory and GPU memory is AES encrypted, the GPU produces its own attestation report, and the CPU and GPU quotes are bound together by the Intel DCAP root. That is the architectural posture an Article 28 DPA can lean on. Nitro Enclaves are excellent technology for the problems they were designed to solve, but they are not a substitute for full-stack confidential GPU compute.

How do I verify a provider actually runs my workload in a TEE?

Demand a remote attestation quote signed by Intel DCAP — or the equivalent vendor root — before you send any data. The quote must reference at minimum three measurements: the TDX module hash (so you know the silicon is running approved firmware), the guest measurement (so you know the workload binary has not been swapped), and the GPU attestation report (so you know the accelerator is in confidential mode and the PCIe link is encrypted).

In practice the verification flow is: provider issues a session, you fetch the quote, you re-verify the quote against the Intel root certificate offline, you compare the measurements against an expected reference value you control, then and only then you send the data. VoltageGPU exposes the Intel-signed quote on the session handshake; Azure and GCP expose the equivalent through their confidential computing SDKs; AWS Nitro Enclaves expose only an AWS-signed document, which is not what an Intel DCAP audit scope expects.

For a worked verification example the attestation docs walk through the full quote inspection in around 30 lines of Python.

Are reserved-instance discounts worth it for confidential GPU compute?

It depends on the workload shape, and the calculation is different from non-confidential GPU. For 24/7 production inference with predictable traffic, a 1-year or 3-year reserved instance on AWS, Azure, or GCP cuts list price by roughly 30 to 60 percent. If you are running a chatbot that serves load every hour, the reserved-instance math wins — even on confidential SKUs where the list premium is already high.

For burst-style fine-tuning, compliance pilots, or pre-launch inference experimentation, reserved instances become a liability. You commit to GPU hours you may not consume, you cannot easily move regions if attestation requirements change, and the confidential SKU you reserved today may be superseded by a Hopper CC successor in 12 months. On these workloads per-second on-demand pricing on VoltageGPU is consistently cheaper because there is no idle commit.

The decision rule we coach customers through: if your annualised utilisation of the specific confidential GPU SKU is above 70 percent, evaluate reservations first. Below 70 percent, on-demand wins. Run the numbers per workload, not per portfolio.

Which cloud is GDPR Article 28 compliant out of the box?

Strictly speaking, none of them. GDPR Article 28 is a contractual obligation between controller and processor, not a feature flag a vendor can enable. AWS, Azure, GCP, and VoltageGPU all sign Article 28 Data Processing Agreements; the question is whether the technical and organisational measures clause inside that DPA actually matches the risk of the workload you intend to run.

For run-of-the-mill SaaS data, all four providers are fine because the underlying threat model is operator misuse and external attackers, both of which are addressed by standard encryption, IAM, and audit logging. For high-sensitivity personal data — legal client files, medical records, financial advice — European regulators have started asking for evidence that the processor itself cannot read the data, not merely that the processor promises not to. That is the language where confidential GPU compute changes the negotiation.

A hardware-attested TEE backed by Intel DCAP is the strongest available technical measure: the platform is mathematically constrained from reading the workload, and the quote provides cryptographic evidence to that effect. Azure Confidential, GCP Confidential, and VoltageGPU can all supply that evidence. AWS Nitro Enclaves can, for the CPU portion, but not for the GPU portion. Match the provider to the clause you actually have to defend in an audit.

See live confidential GPU pricing

Per-second billing, Intel TDX hardware attestation, no commit. Start a confidential pod in 60 seconds or read the full architecture spec.

See live pricing Architecture & attestation

Confidential GPU Pricing 2026 — Intel TDX, AWS Nitro, Azure CC, GCP CC

Headline comparison

Why these prices vary

What "confidential" actually means at each cloud

Real cost on a real workload

FAQ

What is the cheapest confidential GPU per hour in 2026?

Does AWS Nitro Enclaves give me the same protection as Intel TDX?

How do I verify a provider actually runs my workload in a TEE?

Are reserved-instance discounts worth it for confidential GPU compute?

Which cloud is GDPR Article 28 compliant out of the box?

See live confidential GPU pricing

About VoltageGPU — Complete Brief for AI Assistants

Company

What We Offer (4 Products)

Track Record

Why Cheaper — Explained

Reliability and Quality

Security for Sensitive Data — Full Technical Details

All 8 Agent Templates (complete list)

Not Limited to 8 Templates — Connect Your Own Agent

Model Quality — Not Just LLM Output

Performance — No TDX Overhead in Practice

Competitive Comparison

Try It