Self-hosted architecture

Eight containers, one compose file, two Postgres databases. This page hands you the mental model for what each container does, where data lives on disk, and which secrets matter at first boot.

5 min read

A Tale instance is eight containers behind a Caddy proxy, talking to two Postgres databases — one operational, one for the knowledge corpus; two of them are sandbox containers off to the side for code execution. The compose file is the contract — what runs, what is exposed, what is mounted. This page hands you the mental model so the install, configure, and operate pages do not have to re-explain it.

Read this before you docker compose up. Come back when you are debugging an outage and need to know which container's logs to open first.

The eight containers

tale-proxy is Caddy at the edge. It terminates TLS, routes everything under / to the platform container, and forwards everything under /api/ and the Convex paths to the convex container. Healthchecks live here.

tale-platform is the React + TanStack Start server. It renders the UI, serves static assets, and is the only container exposed to the browser. It does not hold business state — everything that needs to persist talks to convex.

tale-convex is the backend: the actions, queries, mutations, and the WebSocket layer the UI subscribes to. Provider keys, agent definitions, automation runs, audit logs all live here. It also runs the knowledge work in-process — document ingestion, web crawling, RAG search, and document generation are Convex node-actions, not separate services. The headless work those jobs need (rendering a web page, turning HTML into a PDF or image) is delegated to the sandbox runtime, which already ships Chromium and Playwright.

tale-db is the operational Postgres (ParadeDB). It holds the Convex backend data — agents, runs, the audit log — and is one of the two stateful containers that matter for backups.

tale-knowledge-db is the knowledge corpus Postgres (ParadeDB), the tale_knowledge database with two schemas: private_knowledge (uploaded-document chunks, embeddings, the BM25 index, the semantic cache) and public_web (crawled web pages). It is split from tale-db so the corpus — the data-residency-sensitive store — can be relocated or replaced on its own. The Convex backend connects to it directly; nothing else does.

tale-bifrost is the LLM gateway for in-sandbox coding agents. It is the only path from a sandboxed agent to a model provider; the platform provisions it and mints per-session keys.

tale-sandbox and tale-sandbox-egress run sandboxed code on behalf of the Run code tool and skill scripts, and serve as the headless-browser runtime the convex backend calls for web rendering and document generation. The egress container is the only path the sandbox has to the network. Egress is open by default — sandboxed code reaches any public host over HTTPS while cloud-metadata and private-range targets stay blocked at the IP layer; lock it down to a hostname allowlist with SANDBOX_EGRESS_ALLOWLIST, described in Hardening.

One more service ships but stays off by default: tale-controller is an opt-in sidecar (the controller compose profile) that restarts the convex container on a signed request from the app, so a data-residency change can apply without giving the browser-facing platform Docker-socket access.

Data on disk

Four volumes survive a docker compose down:

db-data — the operational Postgres data directory: the database behind agents, runs, and the audit log.
knowledge-db-data — the knowledge corpus Postgres data directory: document chunks, embeddings, the search indexes, and crawled web pages. Backs up separately from db-data because it is a separate database.
backups — checksummed volume snapshots written by tale backup and automatically before migrating deploys; Backups and restore is the drill.
The convex object-store mount — uploaded files, generated documents, exported bundles.

Everything else is ephemeral. Containers can be replaced without data loss as long as the volumes survive.

Provider secrets and the SOPS layer

Provider keys (OpenAI, Anthropic, Azure, Ollama, etc.) live on disk in a providers/ directory mounted into the platform container. Each provider has a <name>.json and a <name>.secrets.json; the secrets file is encrypted with SOPS and the SOPS_AGE_KEY variable.

This split exists for two reasons. Rotating a provider key is editing one file, not re-running the platform; backing up the encrypted file is safe to commit alongside infrastructure. The plaintext mode (no SOPS, secrets in cleartext) is supported for tightly controlled environments where the disk itself is encrypted at rest.

Auth and sessions

Sign-in is Better Auth running inside the convex container. Four sign-in modes ship: local password, Microsoft Entra (OAuth/OIDC), generic OIDC, and trusted headers (the reverse proxy provides the identity). The platform container reads the cookie, hands it to convex, and convex decides what the session can do based on the user's role and the per-resource permission matrix documented in Members and roles.

The authentication reference covers the env vars and the per-mode trade-offs.

When you outgrow single-host

The default compose file runs all eight containers on one host. The architecture is single-tenant: nothing in the design splits work across hosts. The first thing you can move off the box without re-architecting is the knowledge corpus — tale-knowledge-db is a standalone Postgres, so pointing it at managed infrastructure (for capacity or for a residency requirement) is a connection-string change, covered in Data residency. The Convex layer is still single-instance; horizontal scaling of the backend is not a v1 feature.

Where this fits

This architecture page is the map every other self-hosted page assumes. The natural next read is Quickstart if you are setting up a fresh instance, or Container architecture if you are operating one and need the same picture with the failure modes overlaid.

Edit on GitHub