Smart Inference
Route every LLM call to the cheapest endpoint that meets your quality bar. Drop-in OpenAI-compatible.
Documentation
Guides, references and quickstarts for Smart Inference, Guard and Architect. One SDK surface, one place to find everything.
Route every LLM call to the cheapest endpoint that meets your quality bar. Drop-in OpenAI-compatible.
Policy-as-code for prompts and outputs. Author-time + CI + runtime enforcement.
Visual designer for multi-model pipelines and agent graphs. Export to code.