How Open-Inspect Works

Open-Inspect is a background coding agent system. Unlike interactive coding assistants where you watch the AI work in real-time, Open-Inspect runs sessions in the cloud independently of your connection. You send a prompt, optionally close your laptop, and check the results later.

This guide covers the core architecture, how sessions work, and what happens when you send a prompt. For deployment instructions, see GETTING_STARTED.md.

The Background Model

The key insight behind Open-Inspect is that coding sessions don't need your constant attention.

Traditional coding assistants require you to stay connected:

You type → AI responds → You watch → You respond → Repeat

Open-Inspect decouples your presence from the work:

You send prompt → Session runs in background → You check results when ready

This enables workflows that aren't possible with interactive tools:

Fire and forget: Notice a bug before bed, kick off a session, review the PR in the morning
Parallel sessions: Run multiple approaches simultaneously without tying up your machine
Multiplayer: Share a session URL with a colleague and collaborate in real-time
Unlimited concurrency: Your laptop isn't the bottleneck—spin up as many sessions as you need

Sessions

A session is the core unit of work in Open-Inspect. Each session is:

Tied to a repository: The agent works in a clone of your repo
Persistent: State survives across connections—close the browser, come back later
Multiplayer: Multiple users can join, send prompts, and see events in real-time
Stateful: Contains messages, events, artifacts, and sandbox state

Session Lifecycle

Created → Active → Archived
            ↑
            └── Can be restored from archive

Sessions start when you create one (via web or Slack). They remain active as long as there's work happening or recent activity. You can archive sessions to clean up your list, and restore them later if needed.

What's Stored in a Session

Data	Description
Messages	Prompts you've sent and their metadata
Events	Tool calls, token streams, status updates
Artifacts	PRs created, screenshots captured
Participants	Users who have joined the session
Sandbox state	Reference to the current sandbox and its snapshot

Each session gets its own SQLite database in a Cloudflare Durable Object, ensuring isolation and high performance even with hundreds of concurrent sessions.

Architecture

Open-Inspect uses a three-tier architecture spanning multiple cloud providers:

┌─────────────────────────────────────────────────────────────────────────┐
│                              Clients                                     │
│                    ┌───────────┬───────────┐                            │
│                    │    Web    │   Slack   │                            │
│                    └─────┬─────┴─────┬─────┘                            │
│                          │           │                                   │
└──────────────────────────┼───────────┼───────────────────────────────────┘
                           │           │
                           ▼           ▼
┌─────────────────────────────────────────────────────────────────────────┐
│                    Control Plane (Cloudflare)                            │
│  ┌────────────────────────────────────────────────────────────────────┐ │
│  │                    Durable Objects (per session)                    │ │
│  │  ┌──────────┐  ┌───────────┐  ┌────────────┐  ┌────────────────┐  │ │
│  │  │  SQLite  │  │ WebSocket │  │   Event    │  │    Sandbox     │  │ │
│  │  │   State  │  │    Hub    │  │   Stream   │  │   Lifecycle    │  │ │
│  │  └──────────┘  └───────────┘  └────────────┘  └────────────────┘  │ │
│  └────────────────────────────────────────────────────────────────────┘ │
│  ┌────────────────────────────────────────────────────────────────────┐ │
│  │                   D1 Database (shared state)                        │ │
│  │           Sessions index, repo metadata, encrypted secrets          │ │
│  └────────────────────────────────────────────────────────────────────┘ │
└───────────────────────────────────┬─────────────────────────────────────┘
                                    │
                                    ▼
┌─────────────────────────────────────────────────────────────────────────┐
│                       Data Plane (Modal)                                 │
│  ┌────────────────────────────────────────────────────────────────────┐ │
│  │                        Session Sandbox                              │ │
│  │  ┌────────────┐    ┌────────────┐    ┌────────────┐               │ │
│  │  │ Supervisor │───▶│  OpenCode  │───▶│   Bridge   │───────────────┼─┼──▶ Control Plane
│  │  └────────────┘    └────────────┘    └────────────┘               │ │
│  │                           │                                        │ │
│  │                    Full Dev Environment                            │ │
│  │              (Node.js, Python, git, Playwright)                    │ │
│  └────────────────────────────────────────────────────────────────────┘ │
└─────────────────────────────────────────────────────────────────────────┘

Control Plane (Cloudflare Workers)

The control plane is the coordinator. It doesn't execute code—it manages state and routes messages.

Responsibilities:

Session state management (SQLite in Durable Objects)
WebSocket connections for real-time streaming
Sandbox lifecycle orchestration (spawn, snapshot, restore)
GitHub integration (repo listing, PR creation)
Authentication and access control

Why Cloudflare? Durable Objects provide per-session isolation with SQLite storage. Each session gets its own lightweight database that can handle hundreds of events per second without impacting other sessions. The WebSocket Hibernation API keeps connections alive during idle periods without incurring compute costs.

Data Plane (Modal Sandboxes)

The data plane is where code actually runs. Each session gets an isolated sandbox with a full development environment.

What's in a sandbox:

Debian Linux with common dev tools
Node.js 22, Python 3.12, git, curl
Package managers: npm, pnpm, pip, uv
agent-browser CLI + headless Chrome (for browser automation)
OpenCode (the coding agent)

Why Modal? Modal sandboxes start near-instantly and support filesystem snapshots. This lets us freeze a sandbox's state after setup, then restore it later in seconds instead of re-cloning and reinstalling dependencies.

Clients

Clients are how users interact with sessions. The architecture is client-agnostic—any client that can make HTTP requests and maintain WebSocket connections can participate.

Current clients:

Web: Next.js app with real-time streaming, session management, and settings
Slack: Bot that responds to @mentions, classifies repos, and posts results

All clients see the same session state. Send a prompt from Slack, watch the results on web. This works because state lives in the control plane, not the client.

The Sandbox Lifecycle

Understanding the sandbox lifecycle explains why Open-Inspect can be fast despite running in the cloud.

Fresh Start (No Snapshot)

When you create a session for a repo without an existing snapshot:

┌─────────┐    ┌──────────┐    ┌─────────────┐    ┌─────────────┐    ┌─────────────┐    ┌───────┐
│ Sandbox │───▶│ Git Sync │───▶│ Setup Script│───▶│ Start Script│───▶│ Agent Start │───▶│ Ready │
│ Created │    │ (clone)  │    │ (optional)  │    │ (optional)  │    │ (OpenCode)  │    │       │
└─────────┘    └──────────┘    └─────────────┘    └─────────────┘    └─────────────┘    └───────┘
                                     │                    │
                                     ▼                    ▼
                            .openinspect/setup.sh   .openinspect/start.sh

Sandbox created: Modal spins up a new container from the base image
Git sync: Clones your repository using GitHub App credentials
Setup script: Runs .openinspect/setup.sh for provisioning (if present)
Start script: Runs .openinspect/start.sh for runtime startup (if present)
Agent start: OpenCode server starts and connects back to the control plane
Ready: Sandbox accepts prompts

Restore (From Snapshot)

When restoring from a previous snapshot:

┌─────────────┐    ┌────────────┐    ┌─────────────┐    ┌───────┐
│  Restore    │───▶│ Quick Sync │───▶│ Start Script│───▶│ Ready │
│  Snapshot   │    │ (git pull) │    │ (optional)  │    │       │
└─────────────┘    └────────────┘    └─────────────┘    └───────┘

Restore snapshot: Modal restores the filesystem from a saved image
Quick sync: Pulls latest changes (usually just a few commits)
Start script: Runs .openinspect/start.sh for runtime startup (if present)
Ready: Sandbox is ready almost instantly

Snapshots include installed dependencies, built artifacts, and workspace state. This is why follow-up prompts in an existing session are much faster than the first prompt.

Repo Image Start

When starting from a pre-built repo image:

Incremental git sync: Fast fetch + hard reset to latest branch head
Setup skipped: .openinspect/setup.sh already ran when the image was built
Start script runs: .openinspect/start.sh executes for per-session runtime startup
Ready: Agent starts once runtime hook succeeds

If start.sh exists and fails, startup fails fast instead of continuing with a broken runtime.

When Snapshots Are Taken

After successful prompt completion: Preserves the workspace state
Before sandbox timeout: Saves state before the sandbox shuts down due to inactivity
On explicit save: Can be triggered by the control plane

Sandbox Warming

To minimize perceived latency, sandboxes warm proactively:

When you start typing a prompt, the control plane begins warming a sandbox
By the time you hit enter, the sandbox may already be ready
If restore is fast enough, you won't notice any delay

How Prompts Flow Through the System

Here's what happens when you send a prompt:

┌──────┐   ┌────────┐   ┌───────────────┐   ┌─────────┐   ┌──────────┐
│ User │──▶│ Client │──▶│ Control Plane │──▶│ Sandbox │──▶│ OpenCode │
└──────┘   └────────┘   └───────────────┘   └─────────┘   └──────────┘
              │                 │                              │
              │                 │         Events stream back   │
              │◀────────────────┼◀─────────────────────────────┘
              │                 │
              ▼                 ▼
         Display to        Broadcast to
           user           all clients

Step by Step

You send a prompt via web or Slack
Control plane queues it: The prompt goes to the session's Durable Object and is added to the message queue. If a sandbox isn't running, one is spawned or restored.
Sandbox receives the prompt: Via WebSocket, the control plane sends the prompt to the sandbox along with author information (for commit attribution).
OpenCode processes it: The agent reads files, makes edits, runs commands—whatever the task requires. Each action generates events.
Events stream back: Tool calls, token streams, and status updates flow back through the WebSocket to the control plane.
Control plane broadcasts: Events are stored in the session database and broadcast to all connected clients in real-time.
Artifacts are created: If the agent creates a PR or captures a screenshot, these are stored as artifacts and announced to clients.

Prompt Queuing

If you send a prompt while the agent is still working on a previous one, it's queued:

Prompt 1 (processing) ──▶ Prompt 2 (queued) ──▶ Prompt 3 (queued)

This lets you send follow-up thoughts while the agent works. Prompts are processed in order.

You can also stop the current execution if the agent is going down the wrong path.

The Agent

Open-Inspect uses OpenCode as its coding agent. OpenCode is an open-source agent designed to run as a server, making it ideal for background execution.

What the Agent Can Do

Capability	Description
Read files	Explore the codebase, understand context
Edit files	Make changes, refactor code
Run commands	Execute tests, builds, scripts
Git operations	Commit changes, create branches
Web browsing	Look up documentation, research errors
Visual verification	Use Playwright to check UI changes

How Changes Are Attributed

When the agent makes commits, they're attributed to the user who sent the prompt:

Author: Jane Developer <jane@example.com>
Committer: Open-Inspect <bot@open-inspect.dev>

This ensures your contributions are properly credited in git history.

Creating Pull Requests

When you ask the agent to create a PR:

Agent pushes the branch using GitHub App credentials
Control plane receives the branch name
Control plane creates the PR using your GitHub OAuth token
PR appears as created by you, not a bot

This maintains proper code review workflows—you can't approve your own PRs.

Real-time Events

Sessions stream events to all connected clients via WebSocket.

Event Types

Event	Description
`sandbox_spawning`	Sandbox is being created
`sandbox_ready`	Sandbox is ready to accept prompts
`sandbox_event`	Tool call, token stream, or other agent event
`artifact_created`	PR created, screenshot captured
`presence_update`	User joined or left the session
`session_status`	Session state changed

Multiplayer

Multiple users can connect to the same session:

Presence: See who else is watching
Shared stream: Everyone sees the same events
Attributed prompts: Each prompt is tagged with who sent it
Collaborative: One person can start a task, another can refine it

This makes sessions useful for pair programming, live debugging, or teaching.

Snapshots and Performance

Speed is critical for background agents. If sessions are slow, people won't use them.

The Cold Start Problem

Without optimization, starting a session would require:

Spinning up a container (~5-10s)
Cloning the repository (~10-30s for large repos)
Installing dependencies (~30s-5min)
Starting the agent (~5s)

That's potentially minutes before the agent can start working.

How Snapshots Solve This

Modal's filesystem snapshots let us capture a sandbox's state after setup:

First session:  Clone ─▶ Install/Build ─▶ Start Runtime ─▶ [Snapshot] ─▶ Work
                              (slow)

Later sessions: [Restore Snapshot] ─▶ Quick sync ─▶ Start Runtime ─▶ Work
                     (fast)

The first session for a repo pays the setup cost. Subsequent sessions restore in seconds.

Image Prebuilding

For frequently-used repositories, images can be prebuilt on a schedule:

Clone repo, install dependencies, run initial build
Save as a snapshot
Sessions start from this snapshot, only syncing recent changes

This means even "cold" sessions (no previous snapshot) start from a recent baseline.

Security Model

Open-Inspect is designed for single-tenant deployment where all users are trusted members of the same organization.

Why Single-Tenant?

The system uses a shared GitHub App installation for all git operations. This means:

Any user can access any repository the GitHub App is installed on
There's no per-user repository access validation
The trust boundary is your organization, not individual users

This follows Ramp's original design, which was built for internal use where all employees have access to company repositories.

Token Architecture

Token	Purpose	Scope
GitHub App Token	Clone repos, push commits	All repos where App is installed
User OAuth Token	Create PRs, identify users	Repos the user has access to
Sandbox Auth Token	Authenticate sandbox → control plane	Single session
WebSocket Token	Authenticate client connections	Single session

Repo-Scoped Secrets

You can configure environment variables (API keys, credentials) per repository:

Stored encrypted (AES-256-GCM) in D1 database
Injected into sandboxes at startup
Never exposed to clients (only key names are visible)

Deployment Recommendations

Deploy behind SSO/VPN: Control who can access the web interface
Limit GitHub App scope: Only install on repositories you want accessible
Use "Select repositories": Don't give the App access to all org repos

What's Next

Getting Started: Deploy your own instance
Debugging Playbook: Troubleshoot issues with structured logs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How Open-Inspect Works

The Background Model

Sessions

Session Lifecycle

What's Stored in a Session

Architecture

Control Plane (Cloudflare Workers)

Data Plane (Modal Sandboxes)

Clients

The Sandbox Lifecycle

Fresh Start (No Snapshot)

Restore (From Snapshot)

Repo Image Start

When Snapshots Are Taken

Sandbox Warming

How Prompts Flow Through the System

Step by Step

Prompt Queuing

The Agent

What the Agent Can Do

How Changes Are Attributed

Creating Pull Requests

Real-time Events

Event Types

Multiplayer

Snapshots and Performance

The Cold Start Problem

How Snapshots Solve This

Image Prebuilding

Security Model

Why Single-Tenant?

Token Architecture

Repo-Scoped Secrets

Deployment Recommendations

What's Next

FilesExpand file tree

HOW_IT_WORKS.md

Latest commit

History

HOW_IT_WORKS.md

File metadata and controls

How Open-Inspect Works

The Background Model

Sessions

Session Lifecycle

What's Stored in a Session

Architecture

Control Plane (Cloudflare Workers)

Data Plane (Modal Sandboxes)

Clients

The Sandbox Lifecycle

Fresh Start (No Snapshot)

Restore (From Snapshot)

Repo Image Start

When Snapshots Are Taken

Sandbox Warming

How Prompts Flow Through the System

Step by Step

Prompt Queuing

The Agent

What the Agent Can Do

How Changes Are Attributed

Creating Pull Requests

Real-time Events

Event Types

Multiplayer

Snapshots and Performance

The Cold Start Problem

How Snapshots Solve This

Image Prebuilding

Security Model

Why Single-Tenant?

Token Architecture

Repo-Scoped Secrets

Deployment Recommendations

What's Next