AI Gateway

One API. All AI Models.

Stop managing five different AI vendor APIs. Synaplan routes requests to OpenAI, Claude, Gemini, Groq, and local Ollama models through a single endpoint — with fallbacks, cost controls, and full observability.

View on GitHub Test via WhatsApp

Model Flexibility: Switch models per use case — fast Groq for chat, powerful GPT-5.5 for reasoning, local Ollama for GDPR-strict environments.
Cost Control: Route simple queries to cheaper models, complex ones to more powerful. Define rules by cost, latency, or capability.
No Vendor Lock-in: Open-source and self-hosted. Migrate between providers without rewriting application code.
Full Observability: Every request logged with model, tokens, latency, and cost. Audit trails ready for compliance reviews.
Local Models via Ollama: Run Llama 4, Mistral, Qwen 3, or any Ollama-compatible model on your hardware. No data leaves your server.
OpenAI-Compatible API: Synaplan speaks the OpenAI API format. Drop it in as an OpenAI proxy — no SDK changes needed.

Models & providers

Listing specific model names here would be pointless — they get outdated every few weeks. Instead: we cover the big three commercial providers and a growing list of niche specialists, run any model you like locally via Ollama or NVIDIA Triton, plug straight into HuggingFace, and partner with Groq for ridiculously fast inference. The full, current catalogue lives in our API documentation.

OpenAI · Anthropic · Google — the big three
Niche & specialist providers
Ollama — local open-source models
NVIDIA Triton — GPU self-hosting
HuggingFace — direct integration
Groq — ultra-fast inference partner

See all supported models in the API docs

Start routing AI models in minutes

View on GitHub Test via WhatsApp