Developers & cloud

Debug CI runners & build servers live

The build is green on your laptop and red in CI. Instead of pushing "debug" commits and squinting at log artifacts, AI Commander lets Claude open a real shell on the actual runner — inspect the workspace, re-run the failing step, clear a poisoned cache — and tell you what differs.

The job

Self-hosted runners and build servers are often the hardest machines to reach: ephemeral, locked down, no inbound SSH. Yet "works on my machine" bugs live precisely in their environment — toolchain versions, env vars, caches, disk. AI Commander gives an AI client a shell right there:

What it looks like

Claude Code — MCP
You: the build on runner-3 fails at the test step — reproduce it and find why Claude: > cd /actions-runner/_work/app && npm ci npm warn: lockfile out of sync > node -v v18.19.0 (laptop: v20) Claude: The runner is on Node 18 while you build on Node 20, and the lockfile is out of sync — that's the test failure. Bump the runner's Node or pin it in CI.

Why AI Commander for CI debugging

AI CommanderDebug commitsSSH-into-runnerAI sandbox
The real runner stateslow loopfresh env
No inbound port
Interactive✗ commit→wait
AI client drives it✓ MCPSDK

Set it up

On the runner / build host (Linux), install the agent:

curl -fsSL https://aicommander.dev/install | sudo bash

Connect your AI client, quote the session code (or alias runner-3), and start debugging on the real thing.

FAQ

How do I debug a flaky build on a self-hosted runner?
Install the agent on the runner and ask your AI client to inspect the workspace, re-run the failing step, and read the output by quoting the runner's session code. You debug on the exact machine the build ran on, not from log artifacts.
Can I get a shell on a build server with no inbound SSH?
Yes. The agent connects outbound, so the runner needs no open ports. Your AI client runs commands through the relay — ideal for ephemeral or locked-down runners.
How is this different from an AI cloud sandbox?
A sandbox gives the AI a fresh, throwaway environment. AI Commander gives it a shell on your real runner — with your caches, toolchain, and the failing state — which is what reproducing a build problem requires.

Debug where the build actually ran

Install the agent on your runner and let Claude reproduce the failure on the real machine.