OpenAI-compatible API running inside Intel TDX enclaves. One line to switch.
Create account, Sign up at voltagegpu.com and get $5 free credit.
Go to your dashboard and create an API key.
https://api.voltagegpu.com/v1/confidentialUse any OpenAI SDK — just change the base URL.
curl https://api.voltagegpu.com/v1/confidential/chat/completions \
-H "Authorization: Bearer vgpu_YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "contract-analyst",
"messages": [
{"role": "user", "content": "Review this NDA clause: The Receiving Party shall not disclose any Confidential Information for 5 years..."}
],
"max_tokens": 2048,
"stream": true
}'from openai import OpenAI
# One line to change — same SDK, same code, hardware-encrypted
client = OpenAI(
base_url="https://api.voltagegpu.com/v1/confidential",
api_key="vgpu_YOUR_API_KEY",
)
# Use any of the 8 agents as model ID
response = client.chat.completions.create(
model="contract-analyst", # or: financial-analyst, compliance-officer, etc.
messages=[
{"role": "user", "content": "Review this NDA and flag non-standard terms:\n\n" + nda_text}
],
max_tokens=2048,
)
print(response.choices[0].message.content)import OpenAI from 'openai';
const client = new OpenAI({
baseURL: 'https://api.voltagegpu.com/v1/confidential',
apiKey: 'vgpu_YOUR_API_KEY',
});
const response = await client.chat.completions.create({
model: 'financial-analyst',
messages: [
{ role: 'user', content: 'Analyze this P&L for red flags:\n\n' + financialData }
],
stream: true,
});
for await (const chunk of response) {
process.stdout.write(chunk.choices[0]?.delta?.content || '');
}from langchain_openai import ChatOpenAI
llm = ChatOpenAI(
base_url="https://api.voltagegpu.com/v1/confidential",
api_key="vgpu_YOUR_API_KEY",
model="compliance-officer",
)
response = llm.invoke("Assess GDPR compliance gaps in our AI usage policy")
print(response.content)Pre-built agents accessible via API for contracts, audits, and compliance.
| Model ID | Agent | Industry | Best For |
|---|---|---|---|
contract-analyst | Contract Analyst | Legal | NDA review, clause risk, liability analysis |
financial-analyst | Financial Analyst | Finance | P&L analysis, fraud detection, audit findings |
compliance-officer | Compliance Officer | GRC | GDPR gaps, policy review, regulatory risk |
medical-analyst | Medical Records Analyst | Healthcare | Patient records, drug interactions, clinical trials |
due-diligence | Due Diligence Analyst | M&A | Target assessment, concentration risk, valuation |
cybersecurity-analyst | Cybersecurity Analyst | Security | Incident triage, threat analysis, response plans |
hr-analyst | HR & Workplace Analyst | HR | Investigation analysis, compliance, policy review |
tax-analyst | Tax & Transfer Pricing | Tax | Transfer pricing review, tax exposure, structure analysis |
You can also list agents programmatically: GET /v1/confidential/models
/v1/confidential/chat/completionsChat completions — OpenAI-compatible
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Agent slug (e.g. contract-analyst) or model ID |
messages | array | Yes | Array of {role, content} objects. Max 100 messages, 200K chars total. |
max_tokens | integer | No | Max response tokens. Default: agent-specific (typically 4096). |
temperature | float | No | 0.0-1.0. Default: agent-specific (0.08-0.12 for precision). |
stream | boolean | No | If true, response is streamed as SSE. Default: false. |
| Developers | Description |
|---|---|
X-Confidential | true if processed in TDX enclave |
X-Agent | Agent slug used |
/v1/confidential/modelsList available TEE models
All API requests require a Bearer token in the Authorization header.
Authorization: Bearer vgpu_YOUR_API_KEY
vgpu_| Code | Meaning | Fix |
|---|---|---|
401 | Invalid or missing API key | Check your Authorization header |
402 | Insufficient balance | Top up at voltagegpu.com/billing |
429 | Rate limit exceeded | Wait or upgrade plan (Developer: 60/min, Team: 300/min) |
503 | TDX enclave starting up | Retry after 30-60s. We never fall back to non-encrypted infrastructure. |
100 requests/day, all models
Unlimited requests, priority support
Custom SLA, dedicated support, DPA
All inference is billed per token. See pricing page for details.
TLS 1.3 encryption for all API calls
Intel TDX hardware enclaves — we cannot access your data
Zero data retention — prompts and outputs are never stored
GDPR Art. 28, DPA available, hosted in France
Get $5 free credit. No credit card required.