Resources
Everything behind a Throne verdict.
How to run Throne, how a verdict is made, the security checks it applies, a real report, and why the tools your agents call need to be vetted. One place, in the order you would read it.
Quickstart
Run a public package, repo, or local config. The first run returns the client matrix, security findings, verdict, and a report link.
npx throne run @modelcontextprotocol/server-everything
npx throne run github.com/acme/files-mcp
npx throne run ./mcp.config.json
Accepted targets
| Type | Example | Best for |
|---|---|---|
| npm | @scope/server | published Node MCP servers |
| uvx | uvx mcp-server | Python packages |
| GitHub | github.com/acme/server | source scans before publish |
| config | ./mcp.config.json | custom launch commands |
Minimal config
{
"name": "files-mcp",
"command": "node",
"args": ["dist/server.js"],
"cwd": ".",
"security": {
"root": "./fixtures/workspace",
"failOn": ["high"]
}
}
CI gate
Use the GitHub Action to block merges when the verdict regresses.
- uses: usethrone/throne-ci@v1
with:
target: ./mcp.config.json
api-key: ${{ secrets.THRONE_API_KEY }}
Replay a failing cell
npx throne replay run_4821 \
--client cursor \
--step streaming \
--trace traces/cursor-streaming-timeout.json
Exit codes
| Code | Verdict | Meaning |
|---|---|---|
0 | pass | All requested release gates passed. |
1 | review | Warnings were found and strict mode is enabled. |
2 | fail | A client failure, blocked launch, or high security finding exists. |
how it works
Every verdict comes from a run you can inspect.
Throne does not grade README claims. It starts the artifact, replays calibrated client behavior, records raw traces, scans risk, and seals the evidence. Vendor neutral: no hosting, no IDE, no framework lock-in.
GitHub repo, npm package, uvx command, or JSON config.
Install and launch the server in a single-use sandbox.
Claude Code and Cursor run live today with recorded behavior profiles.
Connect, discover, schema, tools, errors, streaming, resources, parallel, reconnect.
Path, shell, prompt, secret, rate, dependency, and launch heuristics.
Publish scan id, raw trace, timestamp, verdict, and evidence hash.
Client coverage
Throne only shows verdicts from emulators that exist. Planned clients are visible so you understand the roadmap, but they never produce a false pass or fail.
nine compatibility checks
nine compatibility checks
profile pending real-traffic capture
emulation profile not yet calibrated
emulation profile not yet calibrated
Verdict model
Compatibility and security stay separate until the release call. They are never blended into one number.
Ship
Live client checks pass and no release-blocking security finding is present.
Fix or approve
The server runs, but needs credentials, launch arguments, or security review before trust.
Do not release
A real client fails, the server never launches, or a high severity finding is present.
security
The nine checks.
Each scan runs in a disposable Firecracker microVM created for your server and destroyed after the verdict. Heuristic rules are labeled as heuristics; high severity findings are treated as release-blocking until reviewed.
Path traversal
Tool inputs escape declared roots through traversal, symlink, or normalization gaps.
THR-PATH-01Shell boundary
User input reaches spawned processes as shell strings instead of isolated arguments.
THR-EXEC-04Prompt sink
Untrusted tool output enters model context without delimiters or provenance.
THR-PROMPT-07Secret exposure
Tokens or environment values leak into descriptions, logs, traces, or outputs.
THR-SECRET-02Mutation rate
Write, delete, browser, or network tools can be called repeatedly without guardrails.
THR-RATE-09Dependency risk
Shipped packages include known advisories or unsupported dependency majors.
THR-VER-11Handshake failure
The server starts but never completes MCP initialization inside the sandbox.
THR-HS-03Package launch heuristic
Package metadata points to a command unlikely to expose an MCP server entrypoint.
THR-PKG-05Capability scope heuristic
Tool names imply write, shell, or network access without confirmation boundaries.
THR-SCOPE-08Anatomy of a finding
Each finding ships with the offending surface, a reproduction, the observed result, and the fix pattern. Enough evidence to fix it, not just fear it.
evidence
A sample report.
One artifact for engineering, security, and release: client matrix, raw trace, security findings, and a sealed receipt. Open the live, interactive sample to click through the matrix, trace, security, and receipt tabs.
The public registry
Every server Throne executes is recorded publicly: verdict, security state, sealed date, and evidence hash. Browse the records, filter by verdict, and open any one to inspect the proof.
Why this matters
AI agents now act through MCP servers that can read files, call APIs, move data, and touch credentials, and almost none of them have been independently run before someone trusted them. A broken server fails silently for everyone who installed it; an unsafe one becomes a breach vector the moment an agent drives it.