Is the Privatemode API compatible with OpenAI SDKs and tools?

Yes. The Privatemode API follows the OpenAI API standard, so any tool or application already using the OpenAI SDK can switch with minimal code changes. No data is transferred to OpenAI — Privatemode only adopts the same interface for convenience and portability.

What is the Privatemode proxy and why do I need it?

The proxy runs locally and has two responsibilities: it verifies the integrity of the Privatemode backend before any data is sent, and it handles end-to-end encryption of all requests and responses. It's a lightweight Docker container and is required to establish the confidential channel between your application and the AI.

Can I use Privatemode with my IDE or coding assistant?

Yes. Privatemode works with Claude Code, OpenCode, VS Code (GitHub Copilot, Cline, Continue), and JetBrains IDEs, making it suitable for teams working with sensitive codebases they can't share with traditional AI services.

E2E confidential computing

E2E verifiability

Zero-access architecture

Easy-to-use

E2E confidential computing

E2E verifiability

Zero-access architecture

Easy-to-use

E2E confidential computing

E2E verifiability

Zero-access architecture

Easy-to-use

E2E confidential computing

E2E verifiability

Zero-access architecture

Easy-to-use

The AI API with zero
data exposure

With Privatemode, you get on-prem like privacy combined
with the convenience and flexibility of the cloud.

Get your free API key

Trusted by security‑critical teams at enterprises and public institutions

Product

Overview

Use cloud‑grade AI with
on‑prem‑level confidentiality.

Models

Use frontier‑grade LLMs
without giving up privacy.

Privatemode gives you access to leading open-weight models through one secure endpoint, so you can choose the best model for each workload without exposing sensitive data.

Explore models

Performance

Get cloud‑level speed on
sensitive workloads.

The inference API runs on modern GPU infrastructure to deliver low‑latency responses and high token throughput, even under heavy load. You keep real‑time performance for agents and user‑facing apps without managing your own GPU clusters.

Visit documentation

Integration

Keep your OpenAI and Anthropic integrations,
change only the endpoint.

The Privatemode Encryption Proxy is fully compatible with the OpenAI and Anthropic API, including existing SDKs and client libraries. Point your apps to the proxy URL and reuse your code while all traffic is transparently encrypted end‑to‑end.

See all integrations

Pricing

Pay only for tokens you
actually use.

Predictable usage-based pricing with no hidden fees, from free tier to production. Enterprise plans with custom rate limits available on request.

Explore pricing

Security and encryption

Privatemode protects
your data end-to-end.

Privatemode layers end‑to‑end encryption with confidential computing and verifiable attestation, so you can verify where and how your data is processed. Explore the security architecture to see how key management and attestation defend against both external attacks and insider access.

View security architecture

Introduction
Confidential inference API

Use cloud‑based AI without losing control of your data.

Drop‑in, OpenAI‑compatible API

Run the Privatemode Encryption Proxy and keep your existing OpenAI clients, SDKs, and request formats—just point them to the new endpoint. Your apps talk to a familiar API while the proxy transparently encrypts prompts and decrypts responses for you.

Your data never leaves your control

Prompts and responses are encrypted before they leave your infrastructure and stay encrypted in transit and at rest. Only the AI inside Privatemode’s confidential‑computing environment can access plaintext, never operators or cloud providers.

Confidential computing, end‑to‑end

Privatemode uses hardware‑based confidential computing to keep data encrypted even while it is processed in main memory. Remote attestation verifies the runtime and models before any request is decrypted, so only approved code ever sees plaintext.

How to get started
In 3 simple steps

As easy as 1-2-3. 
As secure as it gets.

Run the Privatemode Encryption Proxy.

The proxy verifies the integrity of the Privatemode service using confindential computing-based remote attestation. The proxy also encrypts all data before sending and decrypts data it receives.

Send your prompts to the proxy.

You now have programmatic access to an end-to-end secure AI. The proxy is compatible with the OpenAI Chat API. Process your own sensitive data or provide trustworthy services to others – the possibilities are endless.

Start using the Privatemode API

Ready in 5min

Documentation

See the quick-start guide

Comparison

The scalability of the cloud.
The security of on-prem.

Data Privacy
Compliance
Setup time
Costs
Maintenance
Scalability

Privatemode

Cloud-grade AI with end-to-end encryption and confidential computing. Built for industries where compliance isn't optional.

Confidential computing
Compliance by design
Ready within minutes
No infrastructure costs
Fully managed
Automatic

Get started for free

ChatGPT

Powerful AI, but prompts and data are processed unencrypted on OpenAI's servers, with no verifiable privacy guarantees

Contractual
Limited
Usable within minutes
No infrastructure costs
Fully managed
Automatic

Self-hosted AI

Full data control, but requires dedicated hardware, in-house ML ops, and ongoing maintenance.

Full control
Full control
Weeks to months
High upfront
Self-managed
Manual

Joint case study

How Privatemode delivers secure AI with confidential computing

Read the report

FAQ

Technical Details

Frequently asked questions about using Privatemode API

Install Docker, run the Privatemode proxy with your API key, and start sending requests to localhost:8080. The proxy handles all encryption and attestation automatically. Full instructions are in the quickstart guide — setup takes around 10 minutes.

Want to see Privatemode in action?

We're happy to show you around and give an overview of what's possible.

Book a demo

The AI API with zero data exposure

Use cloud‑grade AI with on‑prem‑level confidentiality.

Models

Use frontier‑grade LLMs without giving up privacy.

Performance

Get cloud‑level speed on sensitive workloads.

Integration

Keep your OpenAI and Anthropic integrations, change only the endpoint.

Pricing

Pay only for tokens you actually use.

Security and encryption

Privatemode protects your data end-to-end.

Use cloud‑based AI without losing control of your data.

Drop‑in, OpenAI‑compatible API

Your data never leaves your control

Confidential computing, end‑to‑end

As easy as 1-2-3. As secure as it gets.

Run the Privatemode Encryption Proxy.

Send your prompts to the proxy.

Start using the Privatemode API

Documentation

The scalability of the cloud. The security of on-prem.

Privatemode

ChatGPT

Self-hosted AI

How Privatemode delivers secure AI with confidential computing

Frequently asked questions about using Privatemode API

How do I get started with the Privatemode API?

Is the Privatemode API compatible with OpenAI SDKs and tools?

What is the Privatemode proxy and why do I need it?

Can I use Privatemode with my IDE or coding assistant?

Want to see Privatemode in action?

The AI API with zero
data exposure

Use cloud‑grade AI with
on‑prem‑level confidentiality.

Use frontier‑grade LLMs
without giving up privacy.

Get cloud‑level speed on
sensitive workloads.

Keep your OpenAI and Anthropic integrations,
change only the endpoint.

Pay only for tokens you
actually use.

Privatemode protects
your data end-to-end.

As easy as 1-2-3. 
As secure as it gets.

The scalability of the cloud.
The security of on-prem.