Create a chat completion via a Verlon gate
Use when: caller wants a chat completion (multi-turn or single-prompt) routed through a Verlon gate with automatic provider failover, quality scoring, and usage tracking
Routes a chat completion request through the named Verlon gate. The gate decides which underlying model+provider serves the request based on its current configuration (primary model, fallback chain, parameter overrides). Supports both streaming and non-streaming responses. Verlon handles provider auth, billing, observability, and fallback on failure.
Documentation Index
Fetch the complete documentation index at: https://docs.verlon.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Verlon API key (prefixed verlon_; legacy keys prefixed layer_ from before the 2026-05 rename also validate). Generated from the dashboard under Settings → API Keys, or via verlon key create in the CLI (legacy verlon key create still works as a bin alias until 2026-06-20).
Body
Response
Chat completion response. Shape matches the underlying provider format selected by the gate (OpenAI-compatible by default). For streaming requests, returns SSE chunks instead of a single JSON body.

