Pricing

GPU & AI Pricing

Real-time comparison across 4 markets: Compute, Inference, Fine-Tuning, and Confidential Compute.

GPU Compute — Price Comparison

The 4 most popular GPUs compared across all providers. Per-hour, single GPU.

All GPUs

Every GPU across every provider. All prices USD/hr, single GPU.

AI Inference — Price per Million Tokens

Compare inference API pricing across VoltageGPU, OpenAI, Anthropic, Google, and more. Prices in USD per 1M tokens.

Flagship Reasoning

DeepSeek-R1 / GPT-4o

$0.46/$1.85in / out
Input (per 1M tokens)
VoltageGPU
$0.46
Together AI
$0.55
Google (Gemini Pro)
$1.25
OpenAI (GPT-4o)
$2.50
Anthropic (Claude Sonnet)
$3.00
Output (per 1M tokens)
VoltageGPU
$1.85
Together AI
$2.20
Google (Gemini Pro)
$5.00
OpenAI (GPT-4o)
$10.00
Anthropic (Claude Sonnet)
$15.00
Save 85% on inputSave 88% on output
General Purpose

Qwen3-32B / GPT-4.1-mini

$0.15/$0.44in / out
Input (per 1M tokens)
Google (Gemini Flash)
$0.075
VoltageGPU
$0.15
Together AI
$0.18
Anthropic (Haiku)
$0.25
OpenAI (GPT-4.1-mini)
$0.40
Output (per 1M tokens)
Google (Gemini Flash)
$0.30
VoltageGPU
$0.44
Together AI
$0.55
Anthropic (Haiku)
$1.25
OpenAI (GPT-4.1-mini)
$1.60
Save 63% on inputSave 73% on output
Coding

Llama 3.3 70B / Code models

$0.35/$0.40in / out
Input (per 1M tokens)
VoltageGPU
$0.35
Together AI
$0.88
Fireworks AI
$0.90
OpenAI (GPT-4.1)
$2.00
Anthropic (Opus)
$15.00
Output (per 1M tokens)
VoltageGPU
$0.40
Together AI
$0.88
Fireworks AI
$0.90
OpenAI (GPT-4.1)
$8.00
Anthropic (Opus)
$75.00
Save 98% on inputSave 99% on output
Free / Ultra-Low Cost

GLM-4-9B / Small models

FREE
Input (per 1M tokens)
OpenAI (GPT-4.1-nano)
$0.10
Together AI
$0.10
Output (per 1M tokens)
Together AI
$0.10
OpenAI (GPT-4.1-nano)
$0.40
VoltageGPU offers select small models completely free -- no API key fees, no rate limits for basic use.

Fine-Tuning — Price per Hour of Training

Compare fine-tuning prices across providers. VoltageGPU pricing is fully managed (infrastructure + training orchestration included).

1-3B params

$18.50/hr
VoltageGPU (Managed)
$18.50
OpenAI
Per-token only
Together AI
$5.00
Replicate
$3.25
Modal (Self-managed)
$1.50
Fully Managed includes infra, orchestration, and monitoring

7-13B params

$27.75/hr
VoltageGPU (Managed)
$27.75
OpenAI
Per-token only
Together AI
$10.00
Replicate
$6.50
Modal (Self-managed)
$3.00
Fully Managed includes infra, orchestration, and monitoring

30-70B params

$46.25/hr
VoltageGPU (Managed)
$46.25
OpenAI
Not available
Together AI
$20.00
Replicate
Not available
Modal (Self-managed)
$6.00
Fully Managed includes infra, orchestration, and monitoring

Image LoRA

$18.50/hr
VoltageGPU (Managed)
$18.50
Replicate
$4.50
Modal (Self-managed)
$2.00
CivitAI
$5.00
Fully Managed includes infra, orchestration, and monitoring
VoltageGPU pricing is fully managed. Competitors marked "Self-managed" require DevOps setup, custom infrastructure, and manual GPU orchestration.

Why VoltageGPU

Per-second billing

Only pay for what you use. Stop your pod and billing stops instantly. No rounding to the hour.

No lock-in

Stop anytime, no commitments. No reserved instances, no upfront payments, no contracts.

140+ AI Models

OpenAI-compatible inference API included with every account. DeepSeek, Llama, Qwen and more.

Start building today

Get $5 free credit. No credit card required.