About

The only AI gateway where your code runs inside the request.

RelayGate is programmable middleware for AI traffic. One statically-compiled Go binary, CGO=0, no runtime dependencies. Three inbound wire formats translate to one canonical ChatRequest. Ten backend drivers. CEL for everything.

What we build

Other AI gateways put your middleware in front of or behind the request. RelayGate runs it inside. ContextWorkers mutate, enrich, block, or audit against the parsed envelope at sub-millisecond overhead. That inline capability is the wedge.

Shape of the binary

  • One ~18 MB statically-compiled Go binary.
  • CGO=0. No runtime dependencies, no dynamic linking.
  • Three inbound wire formats: OpenAI Chat, Anthropic Messages, Gemini. They translate to one canonical ChatRequest.
  • Ten backend drivers: OpenAI, Anthropic, Google Gemini, Groq, DeepSeek, Together, Mistral, Cohere, OpenRouter, local (OpenAI-compatible endpoints).
  • CEL for routing, quotas, rate-limits, redaction, access control. One engine, type-checked at config-load.
  • Ed25519-signed receipts per request, TrueCom-compatible.

Ten backend drivers

idNameKindFormatNotes
openaiOpenAIdirectopenaiGPT series, o-series
anthropic-directAnthropicdirectanthropic-messagesClaude Opus, Sonnet, Haiku
google-geminiGoogle GeminidirectgeminiGemini Pro, Flash
groqGroqdirectopenaiLPU-accelerated Llama, Mixtral
deepseekDeepSeekdirectopenaiV-series, R-series
togetherTogether AIdirectopenaiOpen-weight hosted models
mistralMistraldirectopenaiMinistral, Mistral Large
cohereCoheredirectcohereCommand R, Command R+
openrouterOpenRouteraggregatoropenaiAggregated access to 200+ models
local-ollamaLocal (Ollama)directopenaiLocal models over OpenAI-compatible endpoint

OSS posture

The core is Apache 2.0. The drivers are MIT. The managed tier is a commercial service layered on top of the same open binary. You can run every feature of RelayGate without paying us a cent. Self-host, full feature set, community support. That is the deal.

Integrations

RelayGate composes with adjacent infrastructure when you want it to:

  • R1. The agent framework. Call R1 agents from inside a ContextWorker.
  • TrueCom. The x402 settlement network. RelayGate signs TrueCom-compatible receipts.
  • RelayOne. Fleet governance for running many RelayGate instances.
  • CloudSwarm. Agent orchestration. Every R1 call routes through RelayGate.
  • DeepTap. Private-corpus search. Ground requests inline via a ContextWorker.
  • Heroa (private alpha). Managed runtime with sovereign BC-Canadian tenancy, the substrate for the sovereign deployment option.

Graceful degradation

Every integration in RelayGate is optional. Run RelayGate by itself and it still delivers the wedge. Pair with R1 for inline agents, DeepTap for grounding, TrueCom for settlement, RelayOne for fleet governance. None is required.

Contact

Sales, partnerships, or anything else. Mail lands at [email protected].