What we build

Other AI gateways put your middleware in front of or behind the request. RelayGate runs it inside. ContextWorkers mutate, enrich, block, or audit against the parsed envelope at sub-millisecond overhead. That inline capability is the wedge.

Shape of the binary

One ~18 MB statically-compiled Go binary.
CGO=0. No runtime dependencies, no dynamic linking.
Three inbound wire formats: OpenAI Chat, Anthropic Messages, Gemini. They translate to one canonical ChatRequest.
Ten backend drivers: OpenAI, Anthropic, Google Gemini, Groq, DeepSeek, Together, Mistral, Cohere, OpenRouter, local (OpenAI-compatible endpoints).
CEL for routing, quotas, rate-limits, redaction, access control. One engine, type-checked at config-load.
Ed25519-signed receipts per request, TrueCom-compatible.

Ten backend drivers

id	Name	Kind	Format	Notes
`openai`	OpenAI	direct	openai	GPT series, o-series
`anthropic-direct`	Anthropic	direct	anthropic-messages	Claude Opus, Sonnet, Haiku
`google-gemini`	Google Gemini	direct	gemini	Gemini Pro, Flash
`groq`	Groq	direct	openai	LPU-accelerated Llama, Mixtral
`deepseek`	DeepSeek	direct	openai	V-series, R-series
`together`	Together AI	direct	openai	Open-weight hosted models
`mistral`	Mistral	direct	openai	Ministral, Mistral Large
`cohere`	Cohere	direct	cohere	Command R, Command R+
`openrouter`	OpenRouter	aggregator	openai	Aggregated access to 200+ models
`local-ollama`	Local (Ollama)	direct	openai	Local models over OpenAI-compatible endpoint

OSS posture

The core is Apache 2.0. The drivers are MIT. The managed tier is a commercial service layered on top of the same open binary. You can run every feature of RelayGate without paying us a cent. Self-host, full feature set, community support. That is the deal.

Integrations

RelayGate composes with adjacent infrastructure when you want it to:

R1. The agent framework. Call R1 agents from inside a ContextWorker.
TrueCom. The x402 settlement network. RelayGate signs TrueCom-compatible receipts.
RelayOne. Fleet governance for running many RelayGate instances.
CloudSwarm. Agent orchestration. Every R1 call routes through RelayGate.
DeepTap. Private-corpus search. Ground requests inline via a ContextWorker.
Heroa (private alpha). Managed runtime with sovereign BC-Canadian tenancy, the substrate for the sovereign deployment option.

Graceful degradation

Every integration in RelayGate is optional. Run RelayGate by itself and it still delivers the wedge. Pair with R1 for inline agents, DeepTap for grounding, TrueCom for settlement, RelayOne for fleet governance. None is required.

Contact

Sales, partnerships, or anything else. Mail lands at [email protected].