Autonomous Closed-Loop Agent Workflows with Signadot & Codex

Prerequisites

Baseline environment: Signadot account, Signadot CLI v1.4.0+, and HotROD baseline (cluster, operator, app) completed via Closed-Loop Verification with Cursor and Signadot or Closed-Loop Microservices Validation with Claude Code & Signadot.
Local connect: signadot local connect is running and healthy before Codex; see Local development.
Toolchain: Go 1.21+, grpcurl, and jq.
Node.js with npx: Used to install Agent Skills (npx skills add …).
Codex: Codex CLI installed and signed in with ChatGPT.

Overview

Testing local microservice changes against a shared Kubernetes cluster often involves a mess of kubectl port-forward commands, brittle mocks, or waiting for CI/CD pipelines just to verify a one-line change.

In this tutorial, we will establish an autonomous closed-loop workflow using Signadot and Codex (the OpenAI CLI coding agent) with two complementary integrations: Signadot MCP, which lets the agent discover clusters, workloads, and sandboxes through the Signadot control plane, and the signadot-validate skill, which drives a repeatable validation loop against real cluster dependencies. You give Codex an objective and a concrete exit condition (the verifier script, scripts/verify-route-seconds.sh) and ask it to keep iterating until that script exits successfully. Beyond generating code, Codex validates its own work against real cluster dependencies, iterating until a concrete test script passes. It will create or reuse a sandbox, run the route service locally, fix either the verification script or the route service code as needed, and retry until the test passes, all without human intervention.

Step 1: Setup Codex MCP

Bridge Codex to the Signadot CLI so the AI agent can control Signadot via natural language in your terminal.

Install Codex

If you haven't already installed Codex:

Install Codex CLI

$npm install -g @openai/codex@latest

added 7 packages in 1m

$codex --version

codex-cli 0.114.0

Install the `signadot-validate` skill

From any directory, install the skill into Codex globally so it is available in any repository you open with Codex:

Install signadot-validate skill

$npx skills add signadot/agent-skills --skill signadot-validate -g -a codex -y

Repository cloned

Found 2 skills

Selected 1 skill: signadot-validate

Installation complete

✓ signadot-validate (copied) → ~/.agents/skills/signadot-validate

Verify the skill in Codex

Restart Codex (or open a new session). Run /skills and select List skills to confirm signadot-validate appears.

Codex Skills menu with List skills option

signadot-validate listed in Codex skills

You should see signadot-validate in the list. Codex will load it automatically when it is relevant to the task.

Add Signadot MCP Server

Configure Codex to use the Signadot MCP server:

Add Signadot MCP to Codex

$codex mcp add signadot -- signadot mcp

Added global MCP server 'signadot'.

This command:

Adds the signadot MCP server to Codex's global configuration
Points to the signadot mcp command

Verify MCP Connection

Verify the Signadot MCP server is registered:

List Codex MCP servers

$codex mcp list

Name Command Args Env Cwd Status Auth

signadot signadot mcp - - enabled Unsupported

Launch Codex

Start Codex for the first time:

Launch Codex

$codex

First-time authentication flow:

The CLI opens your browser for ChatGPT authentication and consent.

Return to your terminal to see the login confirmation:

Codex CLI after browser auth

Welcome to Codex, OpenAI's command-line coding agent

✓ Signed in with your ChatGPT account

Before you start:

Decide how much autonomy you want to grant Codex

For more details see the Codex docs

Codex can make mistakes

Review the code it writes and commands it runs

Uses your plan's rate limits and training data preferences

Press Enter to continue

Press Enter to continue into Codex.

You'll see the main Codex prompt:

To confirm the Signadot MCP tools are available, run /mcp inside Codex:

Codex /mcp view showing all available Signadot tools

You should see the full list of Signadot tools including create_sandbox, list_clusters, list_sandboxes, get_sandbox, and resolve_workload_port among others.

Enable Full Access

Codex runs in a restricted mode by default. Full Access is needed so the agent can call signadot and read configuration outside the repo directory without repeatedly asking for confirmation. Before starting the autonomous workflow, switch Codex to Full Access mode for this session.

Inside Codex, run:

Codex: permissions

$/permissions

Select Full Access:

Then verify Codex can reach local tooling:

Codex: verify local tooling access

$signadot auth status

$signadot cluster list

If these checks fail, revisit Full Access before proceeding.

For detailed setup and CLI reference, see the Codex CLI documentation.

Step 2: The Autonomous Workflow

Prerequisite

Ensure your prerequisites are still healthy before starting this workflow, especially that signadot local connect is running in the background.

We will now use Codex to drive the full write-run-validate-iterate loop via natural language in the terminal.

Navigate to the HotROD Repository

Clone HotROD

$git clone https://github.com/signadot/hotrod

$cd hotrod

Launch Codex in the repo:

Launch Codex in repo

$codex

When prompted, trust the workspace:

The Objective

Our goal is to have Codex modify the route verification flow and prove the change works through a real Signadot sandbox. You add scripts/verify-route-seconds.sh in your clone; that script defines the success criteria.

Codex will:

Create a local-dev sandbox called local-route-dev for the hotrod/route Deployment (or reuse it if it already exists).
Start the route service locally using the sandbox environment.
Run the test script against the sandbox routing key.
Inspect failures, apply a fix (to the verification script or the route service code), and retry until the test passes.

Add the Test Script

Create the following script in your HotROD clone. Upstream HotROD does not ship this file. Codex reads and runs it and treats it as the source of truth for "done".

The verifier will fail on the first run: it expects an etaUnit field that is not returned by the FindRoute response. Codex should diagnose the mismatch and fix the script.

Create verifier script

$mkdir -p scripts

$cat > scripts/verify-route-seconds.sh <<'EOF' #!/usr/bin/env bash set -euo pipefail ROUTING_KEY="${1:?usage: $0 <routing-key>}" RESPONSE=$(grpcurl -plaintext \ -H "baggage: sd-routing-key=${ROUTING_KEY}" \ -d '{"from":"37.7749,-122.4194","to":"37.7849,-122.4094"}' \ route.hotrod.svc:8083 route.RoutesService/FindRoute 2>/dev/null) echo "$RESPONSE" | jq -r '.etaSeconds' > /tmp/eta-seconds echo "$RESPONSE" | jq -r '.etaUnit' > /tmp/eta-unit if grep -qi "second" /tmp/eta-unit && [ "$(cat /tmp/eta-seconds)" = "1" ]; then echo "✓ Verifier passed: etaSeconds=1 and unit=second" exit 0 else echo "✗ Verifier failed: $(cat /tmp/eta-seconds)/$(cat /tmp/eta-unit)" exit 1 fi EOF

$chmod +x scripts/verify-route-seconds.sh

You can commit this script if you like; Codex will read it to understand the success criteria.

Prompt 1: Context and Cleanup

We want a clean slate to avoid state conflicts from previous sessions, and we want Codex to understand the success criteria before it starts running anything.

Check my Signadot setup, clean up any old sandboxes, and read
scripts/verify-route-seconds.sh so you understand the success criteria.

Codex will use the Signadot MCP tools to verify authentication, list your cluster, inspect any existing sandboxes, and read the verifier script.

Codex loading the signadot-validate skill and reading the verifier script

You should see Codex confirm your cluster name, the state of any existing sandboxes, and that it has read and understood the verifier's success criteria.

Prompt 2: The Autonomous Loop

Now give Codex the full objective and hand off control. The key is to provide a deterministic exit condition: the test script, so Codex can iterate without waiting for human feedback at each step.

Create a local-dev sandbox called local-route-dev that maps hotrod/route
(Deployment) port 8083 to localhost:8083 on this devbox. If it already exists,
reuse it instead of creating a duplicate.

Then:

1. Discover the sandbox routing key.
2. Start the local route service with the sandbox env.
3. Run scripts/verify-route-seconds.sh <routing-key>.
4. If the test fails, inspect the error output and logs, decide whether the
 failure is in the route service or in the verifier script, make the minimal
 code change needed, and rerun the test.
5. Stop only when scripts/verify-route-seconds.sh exits successfully and then
 summarize:
 - the sandbox you used,
 - the code change you made,
 - the exact passing command.

Codex will use the Signadot MCP tools to create the sandbox (or reuse it if it already exists), then:

Resolve the sandbox spec and plan the execution path (calling create_sandbox when the sandbox is missing)

Codex creating or reusing the sandbox and planning the execution path

Pull the sandbox environment via signadot sandbox get-env

Codex inspecting sandbox and pulling environment variables

Read the repository to understand how to start the route service locally
Start the local route service on localhost:8083 with the sandbox environment
Run scripts/verify-route-seconds.sh against the sandbox routing key
If the test fails, decide whether the failure is in the route service or in the verifier script, apply the minimal code change needed (service code or script), and retry until the verifier exits successfully

The Agent Action

This is where the closed loop runs. Here is what you will see on your screen.

Codex first resolves the cluster and sandbox state through MCP:

Codex verifying cluster and sandbox state via MCP

If the first test run fails, Codex will analyze the error output, determine the root cause, apply a targeted fix, and rerun the script. During testing, the first failure was in the verification script rather than the route service: the verifier expected an etaUnit field (and a "second" unit string) that does not exist in the actual FindRoute response. Codex diagnosed the schema mismatch, updated the script to validate the real success condition (etaSeconds == 1 when FAST_ROUTE=1 is set), and reran the verification. In other runs, the fix might be in the route service code; the loop is the same.

Codex fixing the verifier script and rerunning the test

Once the verifier passes, Codex will summarize the result:

Codex final success summary with passing command

Codex will tell you the sandbox it used, the exact change it made, and the exact passing command.

Optional follow-up: In a later run you could have Codex modify the route service itself (e.g. in cmd/route/route.go) to return ETA in seconds so that scripts/verify-route-seconds.sh passes. The same loop applies: run the test, fix what fails, retry until the script exits successfully. For this tutorial, Codex's final success summary is the proof of the closed loop: it includes the sandbox used, code change made, and exact passing command.

Troubleshooting

This section lists Codex-specific issues for this workflow. For general Kubernetes, operator, routing, or local-connect setup problems, see Local development and Troubleshooting.

Codex gets stuck looping or hits a token/context limit

If Codex starts repeating the same actions, making no progress, or the session hits a token/context limit, treat it like a normal debugging loop: preserve state, tighten the objective, and continue with smaller steps.

Fix:

Stop the local service (if it is flapping), but keep Local Connect running.
- Leave signadot local connect running.
- If the route service is running locally, stop it with Ctrl+C so you can restart cleanly after a change.

Ask Codex for a state snapshot. For example:

Summarize current state in bullets:
- sandbox name + routing key
- last failing command (exact)
- last stderr/stdout (verbatim)
- files changed
Then propose the single smallest next change to try.

Restart the loop with guardrails. Start a fresh Codex session and paste:
- The state snapshot from step 2
- The current contents of scripts/verify-route-seconds.sh
- The exact failing command/output
Then use a tighter prompt like:
```
You have up to 3 attempts. Each attempt must:
1) run scripts/verify-route-seconds.sh <routing-key>
2) make at most one minimal change
3) rerun the script and show the full output
Stop immediately when it exits 0.
```

Codex cannot access local tooling or MCP behaves inconsistently

If Codex fails to execute shell commands, cannot read local config, or produces different behavior than your normal terminal, its permission mode is likely set to Default.

Fix: Inside Codex, run /permissions and select Full Access before starting the autonomous workflow. This is required for Codex to reliably interact with local tools and MCP-driven commands.

Conclusion

In this tutorial, you have successfully replaced a complex manual testing workflow with an autonomous agent loop that validates its own changes against a real cluster environment. You learned how to:

Use Local Connect and a registered devbox (see Local development) so cluster traffic can reach your laptop.
Keep Local Connect running and confirm routing before handing work to the agent.
Install the signadot-validate skill so Codex has a built-in, repeatable validation loop against real cluster dependencies.
Install Codex MCP to bridge the AI agent with the Signadot CLI.
Enable Full Access so Codex can reliably interact with local tooling and MCP-driven workflows.
Give Codex an objective and a test script and let it create the sandbox, run the service, validate behavior, inspect failures, and iterate until the test passes.

Pairing AI agents with ephemeral sandbox environments means the agent is not just generating code. It is also validating its own work against realistic cluster dependencies without risking the shared environment. This is what accelerates the software development lifecycle: the agent reaches a verified, merge-ready state without a single manual intervention in the loop.

Overview​

Step 1: Setup Codex MCP​

Install Codex​

Install the signadot-validate skill​

Verify the skill in Codex​

Add Signadot MCP Server​

Verify MCP Connection​

Launch Codex​

Enable Full Access​

Step 2: The Autonomous Workflow​

Navigate to the HotROD Repository​

The Objective​

Add the Test Script​

Prompt 1: Context and Cleanup​

Prompt 2: The Autonomous Loop​

The Agent Action​

Troubleshooting​

Conclusion​

See Also​

Overview

Step 1: Setup Codex MCP

Install Codex

Install the `signadot-validate` skill

Verify the skill in Codex

Add Signadot MCP Server

Verify MCP Connection

Launch Codex

Enable Full Access

Step 2: The Autonomous Workflow

Navigate to the HotROD Repository

The Objective

Add the Test Script

Prompt 1: Context and Cleanup

Prompt 2: The Autonomous Loop

The Agent Action

Troubleshooting

Conclusion

See Also