Browse chat, coding, embedding, and speech models, all served through one confidential API.
Model
Kimi K2.5
The most capable model available on Privatemode. Combines advanced reasoning with support for vision.

Model
Voxtral Mini 3B
Transcribe multilingual audio to text with Voxtral Mini 3B’s fast ASR preview, ideal for low‑latency speech input in fully private workflows.
Model
Qwen3 Coder
Power coding assistants with Qwen3‑Coder 30B‑A3B for refactors, explanations, and tool‑driven edits, all processed end‑to‑end inside secure enclaves.
Model
Gemma 3 27B
Run multimodal chat and vision agents with a 128k context window, served through an encrypted /v1/chat/completions endpoint.
Model
Qwen3 Embedding
Generate multilingual embeddings for search and RAG using Qwen3‑Embedding 4B over an OpenAI‑compatible /v1/embeddings endpoint.
Model
gpt-oss-120b
Use OpenAI’s reasoning‑focused gpt‑oss‑120b for long‑context analysis and agent orchestration, while prompts and outputs stay fully encrypted in Privatemode.
Model
Whisper Large-v3
Use Whisper large‑v3 when accuracy matters most, combining state‑of‑the‑art speech recognition and translation with Privatemode’s encrypted audio processing.
Would you like to use Privatemode as the AI provider in a specific tool that we do not yet support?
Let us know!