Pome is a digital-twin testing platform for AI agents. Run your agent against a local GitHub-shaped twin with real REST routes and 35 MCP tools, backed by SQLite state you can reset between every run. No GitHub rate limits, no shared sandbox, no flaky network — just deterministic, repeatable evaluations.Documentation Index
Fetch the complete documentation index at: https://docs.pome.sh/llms.txt
Use this file to discover all available pages before exploring further.
Quick Start
Boot your first twin and run an agent against it in under 60 seconds.
Write Scenarios
Define your agent’s task, seed state, and pass/fail criteria in a single Markdown file.
CLI Reference
Full reference for all
pome commands — run, inspect, export, and more.API Reference
Authenticate and interact with the Pome REST API to manage sessions and runs.
How Pome works
Boot a twin
Start a GitHub-shaped twin locally with
docker compose up or provision one via the hosted API. The twin exposes real REST endpoints and MCP tools backed by SQLite.Write a scenario
Describe your agent’s task, seed initial state (repos, issues, labels), and define success criteria — all in a single Markdown file.
Run your agent
Point your agent at the twin’s URL. Pome records every tool call and API request automatically.
Core Concepts
Understand twins, scenarios, sessions, and evaluation before you build.
CI Integration
Drop Pome into GitHub Actions with one step using the official action.
Self-Host
Run the full Pome stack locally with Docker Compose — no account required.
Pricing
Free self-host tier, hosted free tier, and Pro plans starting at $99/mo.