The Latest Updated Apr 29, 2026, 2:00 PM

Server control system updated with new command interface Latest BS1 checkpoint: Control Portal Server Command Surface (2026-04-28).

An independent review approved the initial setup phase (BS0), and the current active phase (BS1) focuses on stabilizing the control system and monitoring fleet health. The most recent work milestone—the Control Portal Server Command Surface completed on April 28, 2026—has logged 46 checkpoints, with a decision recorded that same day to adopt an AI-era interface and automation standard, and 13 reviewer approvals have been documented across both phases. BS0 (fresh provisioning) was approved on independent review. BS1 (stable control path and fleet watchdog) is the active sequence. Most recent milestone: Control Portal Server Command Surface on 2026-04-28 (46 checkpoints recorded so far). Most recent decision logged: "AI-era interface and automation standard" on 2026-04-28. 13 reviewer verdicts on file across BS0 and BS1.

Latest Checkpoint Accomplishments

Before this checkpoint, the control interface for managing servers was cluttered with dense technical tables, specification grids, and automated diagnostic logs that made it hard for operators to quickly see what actions they could take. This checkpoint redesigned the front page to lead with four large action buttons—Deploy Server, Inspect Server, Verify Package, and Remove Server—along with a clear status view and lifecycle indicator, while moving all the detailed technical information to a separate diagnostics section where it belongs. The buttons now properly prepare actions in the interface only without automatically making changes to the actual running servers, ensuring that operators must explicitly approve any real changes before they take effect. The next step is to build out the actual workflows behind the Deploy and Remove buttons, including preflight checks and confirmation dialogs that keep the operator in control of destructive operations.

Checkpoint 046 — Control Portal Server Command Surface. The Servers front page was corrected again after user feedback that it still read like a dense log/info page.

Right now the team is building the foundation that will eventually host Works. They've finished the basic groundwork — spinning up a single isolated copy of the system and making sure it stays healthy — and they're currently working on the control layer that lets operators manage the platform and watch what's happening inside each customer's instance. This means building dashboards and command surfaces that give visibility and control without breaking the isolation between customers. The work is real and moving forward, but it's still very much the unglamorous plumbing phase.

This is Phase 1 of 3, and we're early in it. The platform itself — the thing that will eventually run Works — still needs another four planned sequences of work before it's ready for real customers: onboarding logic, integration with external systems, channel support, and operational reliability like upgrades and disaster recovery. Only after all that is done can the team build the actual Works product on top, and after that comes finding and onboarding real small businesses. We've cleared one hurdle and are partway through the second; the real product is still months away.

Active

BS1

Stable Control Path And Fleet Watchdog

Latest review

approve-with-amendments

BS1 — works control plane runtime adapter review 1

Runtime

Active

Approved to proceed

Platform phase

25%

Phase 1 of 3 · 1 of 6 build sequences complete, 1 active · ≈ 8% of total project

Roadmap click any sequence to expand

build-sequence-0 Fresh Provisioning Approved

Set up a new OpenClaw Instance for a test organization. Fresh-provision one OpenClaw Instance for one synthetic Org.

Full plan docs/aw-handoff/06-agent-team-execution/build-sequence-queue/build-sequence-0-fresh-provisioning/README.md

build-sequence-1 Stable Control Path And Fleet Watchdog Active

Blocked on Build Sequence 0 close report, evidence bundle, published contracts, and independent reviewer verdict.

We're building a system to keep our control operations running reliably during disruptions and monitor the overall condition of our equipment fleet to catch problems early. Durable control path and fleet health model.

Objective

# Proposed Infrastructure Changes Replace the current SSH connection method for slice-0 with a more reliable reverse-connectivity tunnel (such as AW Gateway or Cloudflare's cloudflared tool), then add regular automated health checks to monitor all instances across the fleet. Replace or augment the slice-0 managed SSH local-forward control path with a stable AW Gateway/cloudflared or equivalent reverse-connectivity path, then strengthen recurring Instance health checks across the fleet model.

What it will deliver

The system uses a secure reverse connection tool—like Cloudflare's cloudflared or similar software—to safely link your internal network to external services without opening direct firewall ports. AW Gateway/cloudflared or equivalent stable reverse connector.
The system checks whether the communication pathway to the Gateway WebSocket—the intended normal route for data transmission—is accessible and working properly. Gateway WebSocket reachability through the intended steady-state path.
The system tracks five status conditions: current (up-to-date information), stale (outdated information), unreachable (unable to connect), degraded (working but with reduced capability), and unknown (no data available). Status freshness model: current, stale, unreachable, degraded, unknown.
The system runs regular automated checks to make sure everything is working correctly and catches any problems early. Recurring Watchdog checks.
The system automatically identifies and categorizes different types of network disconnections so the team can quickly understand whether a connection dropped due to a client issue, server problem, or network infrastructure failure. Connection loss classification.
We can set up automated notifications that trigger whenever a resource or system changes status, allowing your team to stay informed in real time without having to manually check for updates. Alert/event hooks for status changes.

Won't do (yet)

The system will not accept or process data from Spawn. No Spawn ingestion.
The test environment runs on separate infrastructure that does not contain actual customer data or connect to production systems. No real customer systems.
I don't have enough context to rewrite this. Could you provide the technical text that needs to be revised for a mixed audience? No full Works Agent.
The team is not currently setting up new communication or distribution methods for customers or partners. No channel onboarding.
# No billing. The system is currently not recording or charging for usage. No billing.
We are not currently performing any production work that spans multiple geographic regions. No production multi-region work.

Full plan docs/aw-handoff/06-agent-team-execution/build-sequence-queue/build-sequence-1-stable-control-path-and-fleet-watchdog/SEQUENCE-PACKET-DRAFT.md

build-sequence-2 Org Onboarding And Credential Collection Draft

Blocked on Sequence 0 provisioning contracts and likely Sequence 1 control-path decision.

We're setting up test accounts with fake customer information to train operators on how the system works and gather the login details they'll need for their work. Operator-driven onboarding and credential collection — no real customers.

Objective

# Operator-Driven Onboarding Workflow for New Organization Setup **Core Workflow** When a new organization joins, operators follow a guided checklist that collects required credentials (API keys, database connection strings, authentication tokens) through a secure form, then runs automated validation tests against a sandbox environment to confirm everything connects properly before the organization can access production systems. **Key Components** - **Credential Collection Form**: Operators enter each required credential one at a time with clear labels and help text; the system encrypts credentials immediately and never displays them after entry - **First-Run Validation**: Automated tests check that each credential works (by connecting to services, listing resources, or running test queries) and report pass/fail status with troubleshooting hints - **Sandbox Testing**: All validation happens in an isolated test environment that mirrors production but contains no real customer data - **Operator Checkpoints**: A final review step lets operators confirm all tests passed before marking the organization as ready for use This keeps new customers completely Turn synthetic Org setup into an operator-driven onboarding workflow for fresh provisioning, including credential collection design and first-run validation, without touching real customers.

What it will deliver

An operator sets up a new organization account in the system to begin the setup process. Operator creates a new Org onboarding record.
New users move through several stages as we set them up: we start with a draft profile, then wait for their credentials to arrive, verify those credentials work, check that everything meets our requirements, begin their account setup, confirm everything is working, or flag any problems that prevent moving forward. Onboarding state machine: draft, credentials pending, validation pending, ready to provision, provisioning, verified, blocked.
The user interface for entering passwords and credentials saves only secure references (pointers to where the actual secrets are stored) rather than storing the sensitive information directly in the system. This approach keeps real passwords out of the application's database and reduces the risk of a data breach exposing them. Credential collection UX that stores only SecretRefs or fixture equivalents.
# Credential Transition Design: Fixture to Real When we move from test credentials (fixture credentials used in development) to production credentials (real credentials used in live systems), we need a clear plan for how that handoff happens safely and securely. Fixture-to-real credential transition design.
# Org Handoff Checklist When transferring organizational responsibilities to new team members, ensure you cover: - **Access and credentials**: Provide login information, permission levels, and system access needed to perform the role. - **Key contacts**: Share names and contact details for people they'll work with regularly across departments. - **Current projects and priorities**: Document what's in progress, deadlines, and what matters most right now. - **Tools and processes**: Explain the systems they'll use daily and the standard steps for common tasks. - **Documentation Org handoff checklist.
# Policy/Baseline Selection The process of choosing which standard practices or rules to follow when building or managing a system—deciding, for example, whether to use stricter security settings or faster but less protected options. Policy/baseline selection.

Won't do (yet)

The system is not processing new customer sign-ups or account creation at this time. No real customer onboarding.
Don't use actual API keys (the security credentials that grant access to external services) in code or documents unless they've been specifically approved for a test environment that's isolated from live operations. No real provider API keys unless separately approved for a controlled non-production environment.
The system will not accept or process data from Spawn. No Spawn ingestion.
# Billing is Incomplete The invoice or charge statement is missing information or has not been fully processed yet. No full billing.
There is no Works Agent available right now. No Works Agent.
The company's different sales channels (like online, retail stores, and partners) aren't operating with the same pricing, product availability, or promotional offers. No broad channel parity.

Full plan docs/aw-handoff/06-agent-team-execution/build-sequence-queue/build-sequence-2-org-onboarding-and-credential-collection/SEQUENCE-PACKET-DRAFT.md

build-sequence-3 Spawn Ingestion Dry Run Draft

Blocked on Sequence 0 provisioning foundation and explicit user approval to start ingestion work.

We're testing our system's ability to handle incoming data using test fixtures (sample data) before processing real information. Controlled Spawn ingestion dry run with fixture systems only.

Objective

# Rewritten for Mixed Audience We can bring in a test version of the Spawn fixture (a simulated system component) to verify that our discovery tools find it, our classification system labels it correctly, our unwinding process can safely shut it down, and our audit logs capture all the details—all without affecting actual customer environments. Ingest a controlled Spawn-like fixture and prove discovery, classification, unwind planning, and evidence generation without touching real customer systems.

What it will deliver

The team maintains a managed supply of spawn-like test fixtures (specialized equipment used to simulate real-world conditions in testing) to ensure consistent availability for quality assurance activities. Controlled Spawn-like fixture inventory.
# Existing OpenClaw Discovery The OpenClaw security flaw has been found in systems that use this software library. Existing OpenClaw discovery.
This system tracks all active components—including agents (automated workers), binding connections (communication links), sessions (active conversations), channels (communication routes), plugins (add-on tools), and tasks (work items)—to show what's running and how resources are being used. Agent/binding/session/channel/plugin/task inventory.
# Tenant/Org Classification Workflow The system sorts customers and their organizations into categories based on size, industry, or contract type so that each one receives appropriate support and service levels. This classification happens automatically when a new customer signs up and can be updated manually if business needs change. Tenant/Org classification workflow.
# Observe → Managed → Authoritative State Model The system moves through three stages: first it watches what's currently happening, then it takes action to match a desired configuration, and finally it enforces that configuration as the source of truth that overrides any manual changes. Observe -> Managed -> Authoritative state model.
# Unwind Plan with No Destructive Execution We can reverse or stop the system changes we made while keeping all data and configurations intact, so nothing gets lost or damaged in the process. Unwind plan with no destructive execution.

Won't do (yet)

# Spawn systems are test environments only and don't contain actual customer data. No real customer Spawn systems.
The system will preserve all existing data and settings during the transfer to the new platform, so nothing will be lost or overwritten. No destructive migration.
The system will not switch over to a backup automatically if the primary system fails. No automatic cutover.
Team members must use separate, unique login credentials for each application and cannot reuse passwords across systems—except within approved test environments and secure credential storage systems that IT has authorized. No credential reuse outside approved fixture/vault paths.
The system does not trust information from previous versions or earlier data states and instead verifies everything from scratch each time. No assumption that legacy state is trustworthy.
The system doesn't balance workload across different channels beyond grouping them by fixture type. No channel parity beyond fixture classification.

Full plan docs/aw-handoff/06-agent-team-execution/build-sequence-queue/build-sequence-3-spawn-ingestion-dry-run/SEQUENCE-PACKET-DRAFT.md

build-sequence-4 Channel Onboarding Parity Draft

Blocked on credential collection, vault flow, and onboarding state machine from earlier sequences.

# Controlled Channel Onboarding Workflows New sales or distribution channels go through a structured approval process with defined checkpoints before they can start operating. Each step requires specific teams to review and sign off on requirements like pricing, inventory, and compliance before the channel can launch. Controlled channel onboarding workflows.

Objective

Teams can now set up communication channels with built-in approval processes and security controls, ensuring that designated priority channels stay protected while keeping organization-owned login credentials secure and maintaining a complete record of who accessed what information—with the ability to hide sensitive data when needed. Add controlled channel setup workflows for prioritized channels while preserving Org-owned credentials, auditability, and redaction.

What it will deliver

We're setting up a step-by-step process to bring new sales channels online, with each channel following its own customized setup path based on its specific requirements. Prioritized channel onboarding flows, likely split by channel.
Each communication channel (like email, messaging, or API connections) has its own separate set of login credentials and access tokens that must be managed and refreshed individually. Fixture credential/token handling per channel.
When setting up tests or integrations, you need to register a web address where the system will automatically send notifications or data when specific events happen. This is similar to providing a mailing address so the system knows where to deliver messages. Webhook endpoint registration or fixture equivalent.
The system sends regular test messages to communication channels to verify they're working properly and alert the team if any channels fail to respond. Channel status probes.
The system records and tracks specific actions taken within each communication channel (such as email, chat, or phone) so teams can review who did what and when for compliance and security purposes. Channel-specific audit events.
The organization applies different data-hiding standards depending on which communication channel is used—for example, email might require names and addresses to be removed, while a public website might need additional redaction of account numbers and payment information. Channel-specific redaction rules.

Won't do (yet)

Don't use the same content across all sales channels (online, retail, phone, etc.) in a single release unless you've specifically divided the content by channel and gotten approval first. No all-channel parity in one slice unless explicitly split and approved.
The system currently has no actual customer communication channels active in the production environment. No real production customer channels.
The system does not store passwords or authentication tokens that work across all channels—each channel uses its own separate login credentials. No platform-wide channel credentials.
All channel automation must work end-to-end without requiring people to manually step in and complete tasks that should happen automatically. No unsupported channel automation hidden behind manual steps.
I don't have enough context to rewrite this. Could you provide the technical text you'd like me to simplify for a mixed audience? No Works Agent behavior expansion.

Full plan docs/aw-handoff/06-agent-team-execution/build-sequence-queue/build-sequence-4-channel-onboarding-parity/SEQUENCE-PACKET-DRAFT.md

build-sequence-5 Operations, Upgrades, And DR Hooks Draft

Blocked on stable control path, telemetry/status model, provisioning evidence, and onboarding foundations.

# Key Operational Tasks for System Changes **Upgrade** means installing a newer version of software to gain new features or security fixes. **Rollback** means reverting to a previous version if problems occur. **Backup** means copying critical data to a separate location so it can be restored if something goes wrong. **Disaster Recovery (DR)** refers to the procedures and systems in place to restore operations after a major failure. **Operations readiness** means confirming that staff, processes, and systems are prepared to handle the change smoothly. Upgrade, rollback, backup, DR, and operations readiness.

Objective

**Operational Readiness Framework for AW** Establish clear ownership by defining who handles routine tasks, upgrades, and emergencies; create a documented upgrade process that tests changes before deploying them to production along with a plan to quickly reverse problematic updates; implement automatic backup procedures and tested recovery methods to restore data if needed; and prepare step-by-step disaster recovery guides that teams can follow during outages, while capturing incident details for later review. Formalize AW operational readiness: roles, upgrade policy, rollback, backup/restore hooks, disaster recovery runbooks, and incident evidence.

What it will deliver

# MSP Role Matrix A role matrix for a Managed Service Provider (MSP) documents who performs what tasks, when they're responsible, and who approves decisions—helping teams stay aligned on responsibilities across different service areas. MSP role matrix.
We're updating the canary flow—the process that gradually rolls out changes to a small group of users first before expanding to everyone—to improve how we test and monitor new features. Upgrade canary flow.
The system verifies whether OpenClaw (a software tool) can work properly with your current setup and configuration. This check confirms that all necessary components are in place before you proceed with using the tool. OpenClaw compatibility check.
# Rollback Policy and Dry-Run When deploying changes, a dry-run simulates the deployment without making actual changes, so your team can verify everything works correctly before going live. If problems occur after a real deployment, the rollback policy defines the steps to revert to the previous stable version quickly. Rollback policy and dry-run.
The system saves a complete record of all backup storage locations and the instructions that trigger automatic restore actions, allowing teams to recover data quickly if needed. Backup inventory and restore hook definitions.
# Disaster Recovery Runbook: Control-Plane and Instance-Host Failures ## Control-Plane Failure When the system's central management layer stops responding, immediately contact the infrastructure team to check hardware status, restart the affected servers, and redirect traffic to backup management systems while repairs are underway. ## Instance-Host Failure If individual application servers go down, the system automatically moves running applications to healthy servers; your team should verify that services are responding normally and alert the infrastructure team if any applications fail to restart within 5 minutes. DR runbook for control-plane failure and Instance-host failure.

Won't do (yet)

Don't move forward with releasing this to all customers unless leadership gives explicit approval first. No full production launch unless separately approved.
The team has not yet conducted a live test of the backup and recovery plan with actual customers involved. No real customer DR exercise.
Work on a single region for now unless we explicitly plan a separate project for multi-region support. No multi-region implementation unless separately scoped.
I'd be happy to help rewrite technical text for a mixed audience, but I don't see any text to rewrite in your message. Could you please share the technical content you'd like me to simplify? No billing or marketing.
The system does not automatically update itself when running on actual production servers. No automatic upgrade across a real fleet.

Full plan docs/aw-handoff/06-agent-team-execution/build-sequence-queue/build-sequence-5-operations-upgrades-and-dr-hooks/SEQUENCE-PACKET-DRAFT.md

Decisions locked, in writing, in the repo

Locked architectural questions

Foundational answers agreed during the initial design dialogue. Not up for renegotiation unless new evidence forces it.

Q1 OpenClaw Deployment Topology

OpenClaw runs on a variety of systems—from individual computers to company-owned servers in data centers—and typically operates multiple separate instances at larger organizations to keep different agents from interfering with each other, which is a core principle built into the system from the start.

- OpenClaw deploys to personal machines, self-hosted VPSes, cloud VMs, and on-prem servers. - Multi-instance fleets are common at company scale for agent isolation. - **Agent isolation is a first-class invariant** of AW's design.

Q2 Capabilities Manifest for Implementing Agent (Agent A)

# What Agent A Can Do Agent A (Claude Code with full capabilities) will set up OpenClaw on your local system, run through its complete startup process, gather documentation, and catalog your repository—then only report findings that it can point back to actual files or archived records.

- Agent A = Claude Code with full capabilities. - Required first actions: install OpenClaw locally, run its lifecycle, crawl docs, inventory the repo. - **Anti-fabrication enforced:** claims must cite evidence files or archived docs.

Q3 Repository Context and Rebuild Scope

The team will rebuild Spawn's foundation from scratch while keeping all its current features intact, releasing pieces incrementally as backend and interface improvements work together—the original creator will verify features remain complete while the interface design improves where it makes sense. This pre-production work must set up the system to capture operational data as a side effect, so future products (like billing, analytics, and marketing tools) can build on that foundation later.

- **Rebuild scope: FEATURE PARITY + ARCHITECTURAL REBIRTH.** Keep every feature the existing Spawn codebase claims (Feature Contract), rebuild the architecture ground-up. - Original author (Jason / steipete) will check features exist, but UI gets reimagined on merit (not change-for-change). - **Delivery:** incremental. UI ships alongside backend pieces, piece by piece. - **Status:** pre-production. No real customers yet. - **Out of scope for v1, but must be additive:** billing, marketing, analytics, marketing-intelligence. MSP layer MUST harvest operational corpus as byproduct so later products can be built on it.

Q4 Follow-on Receiver (Who Implements)

Each build sequence item gets its own dedicated coding session where a developer writes and tests the code, then one approver reviews and approves the plan before implementation begins and the finished work merges into the main codebase.

- Another Claude Code session **per Build Sequence item**. - One-approval model: plan → user approves → implement → merge review.

Q5 Budget and Stopping Conditions

# Budget and Stopping Conditions The work will use up to 500 tool calls (requests to access files or systems) spread across the brief construction. Key documents get up to 100 calls each, supporting materials get 50, and informational files get 25. The team will run through a quality checklist three times to catch issues, compress the context (saved information) to 70% of its original size, and bring in an independent reviewer from a fresh session to approve early completion—with results saved to a verdict file and a maximum of three review requests allowed.

- 500 tool calls total across the brief construction. - Per-file budgets: 100 for authoritative files (§05/§06/§07/§08/§10); 50 for supporting; 25 for informational. - 3 iteration passes on the zero-issue checklist. - Context compaction at 70%. - **Early completion requires INDEPENDENT REVIEWER AGENT** (fresh Claude Code session) to approve. Verdict saved to `architecture-rebuild-review/reviewer_verdict.md`. Max 3 review requests.

Q6 Tenancy Model (went through several revisions before locking)

# Tenancy Model – Simplified Overview The system organizes users and infrastructure into a hierarchy: one Service Provider manages multiple Customers (Orgs), each Customer may have multiple Tenants, each Instance (a physical gateway) belongs to exactly one Customer, and Agents run within each Instance. Each Instance serves only one Customer, and Tenants within the same Customer are completely isolated from each other—these rules are permanent and cannot be changed. Users are identified globally and can work for multiple Customers; Service Provider staff can view any Customer's setup for support purposes, and when migrating customers into the system, the person who performed the migration loses access to that Customer once the move completes.

**Final hierarchy:** MSP → Org → Tenant → Instance → Agent **Entity definitions:** - **MSP:** Singleton operator. God-mode. Not a tenant. One MSP per control plane, permanently. No multi-MSP ever. - **Org:** Customer of MSP. Billing, policy, and isolation unit. Infinite Orgs per MSP. - **Tenant:** Sub-of-Org. A served entity within an Org's book of work. Isolation unit. - **Instance:** Physical OpenClaw Gateway on a host. Belongs to exactly ONE Org. - **Agent:** Intra-Instance scope. OpenClaw's native concept. Default 1 per Instance. Multi-agent (multiple agents per Gateway) is a documented OpenClaw pattern — legacy shared-Instance patterns (e.g., Jason's Spawn) are unwound during ingestion. (The earlier "Master Agent" terminology is **retired.** OpenClaw's vocabulary wins — we call it "Agent.") **Locked invariants:** - **α: One Org per Instance** (LOCKED) - **β: Absolute cross-Tenant isolation** (LOCKED) - **Option A chosen:** shared-Instance topologies are **unwindable-only**, not representable in AW's steady state. **Additional locks:** - Users globally identified in the control plane; a single user can belong to multiple Orgs. - MSP staff have "view-as-Org" capability (first-class support primitive, fully audited). - MSP can see all Org skills (including Org-private) for support. - MSP skills: immutable by Orgs, non-disableable, forkable into new Org-owned skills. - All Orgs are under MSP management; self-service is a delegated privilege, not a separate tier. - Migration = just another Org. No special "migrated" construct. Jason's six Spawn customers become six peer Orgs in AW. - **Ingestion access boundary:** the operator performing ingestion has **NO residual access** to the destination Org after migration completes. ---

Q7 Compliance and Security Architecture

# Compliance and Security Architecture The system includes a built-in AI assistant that helps staff members manage the control plane—the central system where configuration and access decisions happen. This assistant operates separately from the main application platform itself, running in a different infrastructure location for each organization to maintain isolation and security. The assistant respects user roles and organization boundaries, restricting what each person can see and do based on their permissions.

- In-product AI assistant for MSP and Org staff to operate the control plane. - Distinct from OpenClaw itself. (Works Agent lives in AW; OpenClaw lives on per-Org Instances.) - Tenancy-scoped, role-scoped. - First-class in §06 (object model) and §09 (operations).

Recent decisions (23)

Architecture decisions logged as they're made. Newest first.

2026-04-28 AI-era interface and automation standard product architecture

Anthropy Works is designed to handle tasks intelligently by first making safe predictions, then checking those predictions automatically, then offering clear suggestions—only asking people to decide when it truly matters, and making those decisions straightforward and reassuring. The system should do the work with clear proof of what it's doing, explain outcomes in plain terms, and feel like guiding a helpful partner rather than wrestling with complex infrastructure.

Decision: Anthropy Works is being built in the AI era. Every build slice must consider the interface, and every interface decision must consider how to get the needed deployment or management outcome with the least human effort possible. The preferred product behavior is: 1. infer safely, 2. inspect automatically, 3. recommend clearly, 4. ask the human only when needed, 5. make the human step simple, beautiful, and confidence-building, 6. execute with evidence, 7. explain the result in plain language. The operator should not feel like they are operating infrastructure. The operator should feel like they are guiding a capable system.

2026-04-28 AW is an operator-first managed runtime platform product architecture

Anthropy Works is a platform designed to let operations teams—not just engineers—manage servers and applications through a straightforward interface. The platform succeeds when a non-technical operator can onboard servers, check their readiness, install software updates, connect them to the control system, deploy applications, monitor and repair those applications, and provide necessary approvals exactly when the system asks for them.

Decision: Anthropy Works should be built and reviewed as an operator-first managed runtime platform. The platform is not done when Docker, Nomad, OpenClaw, or a status screen exists. The platform is useful when a nontechnical operator can: 1. onboard a server, 2. see whether that server is suitable, 3. install or update the AW server package, 4. connect that server to Works Control Plane, 5. deploy customer runtime Instances such as OpenClaw, 6. manage, monitor, edit, restart, delete, and repair those Instances, 7. use the deployed OpenClaw when the Instance is declared ready, 8. provide required human inputs, such as SSH access, server choice, data path, gateway token policy, provider keys, domains, and approvals, at the moment they are needed. Build iterations must show working product slices, not

2026-04-27 AW runtime spine is Nomad-first architecture

# Rewritten for Mixed Audience Anthropy Works uses Nomad (a container scheduling system) as its foundation for running and managing all workloads. The Works Control Plane tracks business decisions and configurations, while Nomad handles the technical decisions about where containers run, how they restart, and how services find each other. OpenClaw, the first application built on this foundation, manages its own gateway permissions and device approvals. Future applications and tools will plug into the same Nomad foundation through custom adapters rather than each building separate management systems.

Decision: Anthropy Works will treat Nomad as the runtime spine behind Works Control Plane. Works Control Plane remains AW's business source of truth. Nomad becomes the source of truth for general container runtime placement, scheduling, allocation state, restart behavior, service discovery, node drain behavior, and job update/rollback behavior. OpenClaw is the first proven AW workload on top of that spine. It remains the workload source of truth for OpenClaw-specific gateway auth, device approval, agents, sessions, tasks, and workspace behavior. Future AW functions, workers, probes, automations, customer jobs, and one-off dispatched containers should use the same Nomad runtime spine through workload-specific adapters. The earlier "AW per-host agent" direction is narrowed: - AW does not build its own

2026-04-27 Manifest target access is two-hop via jumpbox operations

To access the Manifest target system, you must connect through two sequential steps: first from your local orchestrator to the jumpbox server (`aai-jumpbox`), then from the jumpbox to the actual target host (`[REDACTED-EMAIL]`). The target system only recognizes credentials from the jumpbox side, so direct connection attempts will fail even if you configure them to route through the jumpbox.

Decision: All work on the current Manifest target host uses the two-hop access path: 1. Local orchestrator -> `aai-jumpbox` 2. `aai-jumpbox` -> `[REDACTED-EMAIL]` Do not assume the local orchestrator's SSH identity can authenticate directly to the Manifest target through `ProxyJump`. The target expects the jumpbox-side identity for the second hop.

2026-04-27 Works Control Plane runtime adapter contract architecture

# Control Plane Runtime Adapter Contract The Control Plane communicates with Nomad (the container orchestration system) through a translation layer called the runtime adapter, which converts between the Control Plane's desired state, product records, and Nomad's actual running state. The user interface and business database must use only the adapter's standardized output rather than directly reading Nomad job definitions, container allocation identifiers, service catalog details, or workload-specific status formats—ensuring that when new container types (like functions, automations, or batch jobs) get added in the future, the system remains stable and extensible.

Decision: Works Control Plane will talk to the runtime through a narrow runtime adapter. The adapter translates between: - Works Control Plane desired state and product records. - Nomad runtime state. - Workload-specific state. The UI and business database should consume the adapter's normalized model. They should not hard-code raw Nomad jobs, raw allocation IDs, raw service catalog fields, ad hoc OpenClaw CLI output, or any future workload's private status format. OpenClaw is the first proven workload type. The adapter must also leave room for AW to dispatch many other containerized workloads through Nomad: functions, probes, automations, customer jobs, worker fan-out, batch work, and future tools that are simply "run this container with this policy."

2026-04-26 Linux OpenClaw runtime is containerized architecture

OpenClaw runtime must run in containers on Linux systems, with host-native installations allowed only on macOS or for temporary administrative tools—never for production use. To validate readiness on Linux, teams must gather evidence from the containerized OpenClaw runtime running through Docker, with the Gateway exposed only through a loopback connection on the Debian host for managed SSH access.

Decision: Linux OpenClaw runtime Instances must run as Docker/containerized runtimes. Host-native OpenClaw runtime is allowed only as a macOS host exception. Host-native Linux OpenClaw CLI installs may exist only as exploratory/admin tooling and must not be used as production runtime, Gateway supervisor, or Linux readiness pass evidence.
Reason: BS1 must prove the durable Anthropy Works runtime shape for Linux hosts. Using host-native Linux OpenClaw Gateway or host-native Linux CLI doctor/status output would validate the wrong operating model and could hide container supervisor, volume/state, network binding, and cleanup risks.
Impact: BS1 runtime work is paused until resumed under the amended packet. Linux readiness evidence must come from the Docker/containerized OpenClaw runtime, with Gateway exposure constrained to a Debian-host loopback publish path for the managed SSH forward. The accidental exploratory host-native Linux CLI install on `s187-u007.manifest0.net` was cleaned up and documented in BS1 evidence.
Review status: active for BS1.
Sources: User architecture clarification on 2026-04-26; amended `docs/aw-handoff/06-agent-team-execution/build-sequence-queue/build-sequence-1-stable-control-path-and-fleet-watchdog/SEQUENCE-PACKET.md`.
Rollback path: Do not reintroduce host-native Linux OpenClaw runtime without a new architecture decision and explicit user approval. If OpenClaw container operation cannot satisfy BS1 gates, stop and amend the packet rather than falling back to host-native Linux runtime.

2026-04-26 AW deployment architecture: multi-tenant reverse-proxy (HetzClaw pattern), Traefik selected architecture

# AW Deployment Architecture Overview The standard setup uses a reverse-proxy server (Traefik) running on each host machine to direct incoming requests to the correct organization based on the request's hostname, with each organization running in its own isolated container group that connects to a shared network for discovery and communication. This approach eliminates the need to manually manage ports or maintain a separate registry—the proxy automatically finds each organization's containers through configuration labels.

Decision: The authoritative AW deployment architecture for hosting Linux containerized OpenClaw runtime Instances is the **HetzClaw multi-tenant pattern**: - **One reverse-proxy stack per host** (Traefik plus a `tecnativa/docker-socket-proxy` hardening layer). Listens externally and routes to per-Org gateways by Host header. - **One compose stack per Org Instance.** Each Org's OpenClaw gateway is a separate compose stack at `/var/lib/aw/orgs/<orgid>/`, with its own per-Org token, per-Org config volume, per-Org workspace volume, and per-Org Traefik routing labels. - **Shared `aw-proxy` Docker bridge network.** Both the proxy stack and every Org stack join this network. The proxy auto-discovers each Org via Docker labels; no manual registry, no port allocation. - **No host-side port publish for any Or

2026-04-26 AW naming convention for tenants, Instances, stacks, and artifacts architecture

# How We Name Things in Our System We've established a single naming standard that applies to all our managed components—from customer organizations and server instances to services, containers, and configuration files. This standard ensures that every component's name can be predictably recreated from a few key pieces of information: the organization identifier, the component's role, when it was set up, and which service it belongs to. We're using names that people can read and understand, that won't accidentally collide with other names, and that stay consistent throughout a component's lifetime. This standard takes effect immediately for our current build cycle and all future deployments, unless we make an official change to it.

Decision: This document fixes the naming convention for every AW-managed artifact: tenants (Orgs), Instances, compose stacks, services, containers, hostnames, filesystem paths, secrets, networks, Traefik routing labels, systemd units, and Docker labels. It applies to BS1 immediately and to all subsequent Build Sequences and production deployments unless amended by a new dated decision. The core rule: **every artifact's name is deterministically derivable from a small set of identifiers** — Org slug, role, provisioning timestamp, service-within-stack — and those identifiers are themselves chosen to be human-legible, collision-free, and stable across the artifact's lifetime.

2026-04-26 AW per-host agent (forward reference; build-vs-adopt direction set) architecture

The Works Control Plane will communicate with each Instance Host through a dedicated agent software (built and maintained by AW) that runs as a managed Docker container on that host. Currently, host management uses direct SSH connections; this agent will replace that channel in a future phase, likely when multi-host onboarding or a separate control plane sequence begins.

Decision: The Works Control Plane will communicate with each Instance Host via a **per-host AW agent** running on the host as a managed Docker container. The agent is an AW-specific component that AW builds and maintains; AW does **not** adopt Portainer Agent, Kubernetes kubelet, or other generic fleet-management agents as a substrate. This decision sets the build-vs-adopt direction. **Implementation is deferred** — likely to BS3 (multi-host onboarding) or a dedicated Works Control Plane sequence. BS1's host management uses SSH directly; the agent replaces that channel later.

2026-04-26 AW target-host data root: `/data/` architecture

# Data Storage Location for Managed Hosts All persistent data owned by the AW system on any managed host—including the current test fixture host `s187-u007.manifest0.net`—is stored under the `/data/` directory, which includes subdirectories for AW configuration, Docker, the container runtime, job scheduling, and any caches, secrets, or workspace files that need to survive a restart. The older `/var/lib/aw/` location is kept only for historical record-keeping; new work should not create permanent AW data there.

Decision: For every AW-managed target host, including the current Manifest fixture host (`s187-u007.manifest0.net`), **all AW-owned persistent host data lives under `/data/`**. This includes: - AW per-host layout: `/data/aw/` - Docker engine data root: `/data/docker/` - containerd runtime root used by Docker Engine: `/data/containerd/` - Nomad state for the approved spike: `/data/nomad/` - Future Consul state, if adopted: `/data/consul/` - Any AW-managed cache, source mirror, evidence staging, runtime volume, secret projection, workspace, config, or scheduler state that must persist on the target host. The prior BS1 path family under `/var/lib/aw/` is now historical for evidence reconstruction only. Future target-host work must not introduce new AW-owned persistent data under `/var/lib/aw/`, `/opt/a

2026-04-26 Debian 13 (trixie) accepted as BS1 P2 production-like fixture release governance

Debian 13, the latest version of the free operating system, has been approved for use in production-like test environments with standard priority status. This means teams can now use it to test software and systems in conditions that closely match real production setups.

2026-04-26 Nomad BSL License Working Decision

# Nomad BSL License Working Decision HashiCorp's Nomad workload orchestrator software operates under a Business Source License (BSL), which allows free use for most purposes but requires a paid license for organizations exceeding specific scale thresholds or using certain advanced features.

2026-04-26 OpenClaw Linux container runtime: `ghcr.io/openclaw/openclaw` is the authoritative artifact source architecture

# OpenClaw Linux Runtime Source When running OpenClaw on Linux in containers at Anthropy Works, always pull the official runtime from `ghcr.io/openclaw/openclaw` maintained by the OpenClaw team; do not substitute custom-built versions for production work without explicit approval. For macOS machines or one-time command-line tasks, the `npm install openclaw` installation method remains the standard approach.

Decision: For all Linux containerized OpenClaw runtime work in Anthropy Works, the **authoritative source of the OpenClaw runtime container artifact is the OpenClaw maintainers' own GitHub Container Registry path**: ``` ghcr.io/openclaw/openclaw ``` Henceforth: - Linux OpenClaw runtime Instances pull from `ghcr.io/openclaw/openclaw:<pinned-version>`. - Custom-built OpenClaw runtime images are not authoritative for production evidence and must not be substituted for the upstream image without a new architecture decision. - The `npm install openclaw` install-cli path remains authoritative for **macOS** host-native runtime (where the container boundary is exempt by the 2026-04-26 Linux-container decision) and for any non-runtime workflow such as one-shot CLI use, but is not the authoritative path for *

2026-04-26 Works Control Plane is centralized; substrate must remain portable architecture

# The Works Control Plane: Centralized yet Portable The Works Control Plane is a single logical service that all operators access through one web address; individual host agents report status to it and receive instructions from it, but do not communicate directly with each other. The control infrastructure remains separate from the machines that run your actual workloads, and the underlying code is built on portable technology so it can move between different environments without major rewrites.

Decision: ### 1.1 The Works Control Plane is centralized The Works Control Plane is a **single logical service** that operators reach at one URL (e.g. `portal.aw.example.com`). It is not gossip / peer-to-peer / each-host-knows-everything. Per-host AW agents are leaves that report up to it and receive instructions from it; they do not communicate sideways. The Control Plane may be deployed in HA shape (multiple servers, multi-AZ, multi-region for DR) but is logically one entity. Instance Hosts are distinct from Control Plane infrastructure; they do not host Control Plane components. The BS1 fixture host runs Instances and a per-host proxy; it does **not** run the Control Plane. ### 1.2 The Works Control Plane substrate must remain portable The Control Plane codebase is built on **portable abstraction

2026-04-25 Run workspace isolation operational

Create a separate workspace for the Codex Build Sequence 0 process that stores all outputs in the `runs/codex-bs0/` folder instead of the main system directory, which prevents Codex results from mixing with other build processes and keeps the application code in `runs/codex-bs0/app/` and all related review materials in `runs/codex-bs0/`.

Decision: Execute the Codex Build Sequence 0 run under `runs/codex-bs0/` instead of the master architecture root.
Reason: Keeps Codex outputs isolated from any Claude Code build-off run.
Impact: Implementation repo lives at `runs/codex-bs0/app/`; evidence/review paths live under `runs/codex-bs0/`.
Review status: pending orchestrator close review.
Sources: `DUAL-ORCHESTRATOR-RUNBOOK.md`, `RUN-ISOLATION-MANIFEST-CODEX.md`.
Rollback path: Move or archive the run root after close; do not merge outputs into master without review.

2026-04-25 Local bootstrap verification while pnpm is unavailable operational

When the package manager pnpm isn't installed yet, run initial verification checks using Node's built-in TypeScript support instead of waiting for pnpm, then install pnpm afterward to maintain the locked versions in package.json for testing frameworks and dependencies. The first setup step must complete only after confirming pnpm is either available or the missing tool has been addressed.

Decision: Preserve the pinned `pnpm`/Vitest/Playwright/Next stack in `package.json`, but use Node native TypeScript tests for the first local bootstrap check.
Reason: Current host has Node but no `pnpm`; switching package managers would violate the sequence pin.
Impact: Sequence 0 cannot close until `pnpm` path is installed and evidenced or the environment gap is resolved.
Review status: superseded by the local bootstrap green decision below; this was not a closure waiver.
Sources: `BUILD-SEQUENCE-FRESH-PROVISIONING.md` stack pins.
Rollback path: Install `pnpm`, run pinned test/build/e2e commands, and replace local bootstrap evidence with final evidence.

2026-04-25 Local bootstrap green with patched dependency set operational

The local development environment now runs successfully with all patches applied and testing complete—the build uses the pinned software versions and generates documentation that reviewers can examine. The remaining work requires access to remote deployment infrastructure (SSH, OpenClaw, and Gateway systems) rather than local tools.

Decision: Treat the local scaffold as bootstrap-green after installing workspace-local `pnpm@10.10.0`, resolving the patched dependency lockfile, and passing Vitest, TypeScript, static scan, and Next production build.
Reason: The local build no longer depends on the earlier Node-native fallback; it now exercises the pinned stack and produces reviewer-readable evidence.
Impact: Remaining blockers are host-level SSH/OpenClaw/Gateway evidence, not local toolchain availability.
Review status: accepted as bootstrap status only; Sequence 0 remains open.
Sources: `runs/codex-bs0/architecture-rebuild-evidence/build-sequence-0/codex-bs0/manifest.json`.
Rollback path: If later host evidence forces code changes, rerun the local verification bundle and update this decision with the new commit.

2026-04-25 Next.js security patch pin security

The team needs to update Next.js from version 15.3.1 to 15.3.8 in the Build Sequence 0 to fix a security vulnerability in React Server Components. After making this change, the hosting preparation step must run dependency installation again to ensure the lockfile pulls in the patched version.

Decision: Update the Build Sequence 0 Next.js pin from `15.3.1` to `15.3.8`.
Reason: `next@15.3.1` is affected by the React Server Components/Next.js security advisories, including CVE-2025-66478. The later Next.js December 11, 2025 advisory lists `15.3.8` as the patched version for the 15.3.x line.
Impact: Host prep must rerun dependency install after this change so the lockfile resolves the patched Next.js version.
Review status: accepted as security patch.
Sources: https://nextjs.org/blog/CVE-2025-66478 and https://nextjs.org/blog/security-update-2025-12-11.
Rollback path: Do not roll back to vulnerable 15.3.1. Future upgrades may move to a newer patched Next line after security review.

2026-04-25 OpenClaw install evidence source operational

# Rewrite When installing OpenClaw, use the official installer provided by the vendor and record which version actually gets installed when you run the setup, rather than assuming an older version will be used. Before confirming the installation is complete, check whether the commands that ran match what was expected and note any differences.

Decision: Use OpenClaw's official installer path for host-level Sequence 0 evidence and record the resolved installed version at execution time instead of pinning an old OpenClaw release in the app scaffold.
Reason: OpenClaw's own release policy is beta-first/stable-promoted, and the execution docs require Packet 5 to resolve and record the current stable version during the run.
Impact: Host evidence must cite the installer/version actually used and classify any command drift before claiming acceptance.
Review status: accepted as host-evidence policy.
Sources: https://docs.openclaw.ai/install and https://docs.openclaw.ai/RELEASING.
Rollback path: If the current installer or latest stable cannot satisfy a Sequence 0 acceptance command, stop and classify the drift under `DRIFT-CONTROL.md`.

2026-04-25 Host fixture and Gateway evidence passed operational

The testing infrastructure confirms that a disposable Debian 12 container running a real SSH server meets the requirements for the initial build phase, and the OpenClaw Gateway should run as a foreground process rather than as a background service since the container lacks systemd support. The current test validates that SSH access, OpenClaw installation, Gateway startup, and basic communication all work correctly, but production-environment behavior still needs to be verified separately on infrastructure that matches the actual production setup.

Decision: Treat the disposable Debian 12 Docker SSH container as the valid Build Sequence 0 SSH fixture target and use OpenClaw foreground Gateway mode inside that container.
Reason: The fixture is a real SSH daemon reachable over TCP and satisfies the approved fixture class. The container does not provide systemd, and OpenClaw's own runtime output instructs container users to run the Gateway in the foreground instead of the systemd service path.
Impact: BS0 host evidence proves SSH execution, OpenClaw install/readiness, Gateway startup, and a WebSocket read probe. Production supervisor behavior remains outside this container fixture proof and must be covered on a production-like host in a later sequence.
Review status: ready for independent review.
Sources: `runs/codex-bs0/architecture-rebuild-evidence/build-sequence-0/codex-bs0/host-sequence0-summary.md`, `gateway/foreground-start.log`, `gateway/websocket-read-probe.log`.
Rollback path: Re-run host evidence against a non-container Debian SSH host with systemd when production supervisor proof is required.

2026-04-25 Gateway auth material uses SecretRef plan security

# Use OpenClaw's secrets command to store gateway authentication tokens safely Apply the `gateway.auth.token` and `gateway.remote.token` credentials through OpenClaw's `secrets apply` command (which uses the SecretRef plan) rather than entering them as plaintext with the `openclaw config set` command. The secrets command encrypts credentials and passes the `OPENCLAW_GATEWAY_TOKEN` environment variable at runtime, which ensures the tokens meet security requirements and pass OpenClaw's audit checks.

Decision: Apply `gateway.auth.token` and `gateway.remote.token` through an OpenClaw `secrets apply` SecretRef plan using `OPENCLAW_GATEWAY_TOKEN` as the fixture runtime source, rather than writing token plaintext through `openclaw config set`.
Reason: The first Gateway evidence attempt produced a working WebSocket path but failed `openclaw secrets audit --check` because token fields were stored as plaintext. The SecretRef plan path is the documented OpenClaw credential projection surface and satisfies the AW no-plaintext credential boundary.
Impact: Secrets audit is clean with plaintext `0`, unresolved `0`, shadowed `0`, and legacy `0`.
Review status: ready for independent review.
Sources: `runs/codex-bs0/host-prep/openclaw-gateway-secretref-plan.json`, `openclaw/gateway-secretref-plan-apply.log`, `openclaw/secrets-audit.log`.
Rollback path: Do not revert to plaintext config. If OpenClaw changes the SecretRef plan schema, update the plan and re-run the audit.

2026-04-25 Build Sequence 0 evidence ready for independent review governance

The initial set of evidence has passed all automated checks (application functionality, user interface testing, infrastructure verification, encrypted communication validation, and security credential review) and is now available for an independent reviewer to examine—but the work cannot be marked as fully complete until that reviewer approves it. No subsequent work phases should proceed or be marked finished until the reviewer's decision is received and incorporated.

Decision: Mark the Codex BS0 evidence bundle as `ready-for-independent-review`, not complete.
Reason: App checks, UI E2E, host evidence, Gateway WebSocket proof, and secrets audit are green. The approved execution model still requires independent review before the slice can be marked done.
Impact: No further build work should be claimed complete until reviewer verdict is received. Future sequence packets remain draft/not executable until BS0 review outcome is incorporated.
Review status: pending independent reviewer.
Sources: `runs/codex-bs0/architecture-rebuild-evidence/build-sequence-0/codex-bs0/manifest.json`, `REVIEWER-HANDOFF.md`.
Rollback path: If reviewer finds blockers, reopen BS0, patch, re-run evidence, and update this decision.

2026-04-25 Build Sequence 0 approved and closed governance

Build Sequence 0 has been approved and closed after an independent reviewer confirmed no blocking issues were found. The seven remaining findings are scheduled for Build Sequence 1, and the team should now move forward with preparing and approving the next build packet unless a regression requires returning to this sequence.

Decision: Mark the Codex Build Sequence 0 run complete after independent reviewer approval.
Reason: The isolated reviewer returned `Verdict: approve` with no blocking findings and explicitly stated that Build Sequence 0 can be marked complete. The seven findings are non-blocking carry-forward requirements for Build Sequence 1, not BS0 rework.
Impact: BS0 is closed. Build agents must not continue adding BS0 scope unless a regression is discovered. The next executable work is preparing and approving the Build Sequence 1 packet.
Review status: approved and closed.
Sources: `runs/codex-bs0/architecture-rebuild-review/build-sequence-0/isolated-review-workspace/outputs/reviewer-verdict.md`, `runs/codex-bs0/architecture-rebuild-evidence/build-sequence-0/codex-bs0/BS0-CLOSEOUT.md`.
Rollback path: If a later reviewer or production-host proof finds a BS0 blocker, reopen BS0 with a new dated decision, patch the defect, re-run affected evidence, and request a fresh independent review.

Stable Control Path And Fleet Watchdog — checkpoints 46 recorded

#046

Checkpoint 046 — Control Portal Server Command Surface

We updated the main page of the Servers control panel based on what users told us, making it less cluttered and easier to navigate than before. The Servers front page was corrected again after user feedback that it still read like a dense log/info page.

2026-04-28 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-046-control-portal-server-command-surface.md

Complete

#045

Checkpoint 045 — Control Portal Route Split

The Works Control Portal has been split into separate pages instead of one long scrollable page with collapsible sections, making it easier to navigate and find information. The Works Control Portal now uses separate pages instead of one long page with anchors and collapsible sections.

2026-04-28 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-045-control-portal-route-split.md

Complete

#044

Checkpoint 044 — Control Portal MSP Server Control View

# Checkpoint 044 — Control Portal Update The Control Portal homepage was redesigned to shift from showing how servers get set up and what they're currently doing into a unified management interface—the first layer where Managed Service Providers can view and command their servers. The Control Portal front page was reshaped from a server-onboarding explainer and runtime evidence surface into the first MSP layer: **Server Control**.

2026-04-28 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-044-control-portal-msp-server-control-view.md

Complete

#043

Checkpoint 043 — Control Portal Server Onboarding Slice

The Control Portal app now includes the first foundational server setup phase outside of its core operating environment, enabling teams to begin onboarding servers through the application. The first non-runtime server onboarding slice is implemented in the milestone Control Portal app.

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-043-control-portal-server-onboarding-slice.md

Complete

#042

Checkpoint 042 — Mutation 8 Review And Server Onboarding Direction

The independent review of Mutation 8 has been completed and the team is ready to move forward with server setup and onboarding. The independent Mutation 8 review is complete.

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-042-mutation-8-review-and-server-onboarding-direction.md

Complete

#041

Checkpoint 041 — Nomad OpenClaw Stable Launch Path

Mutation 8 updates how Nomad (our job scheduling system) isolates work assignments, shifting from a general approach to a specific launch process that works with the corrected OpenClaw recipe. Mutation 8 moved from generic Nomad job-shape isolation to a working launch path for the corrected OpenClaw recipe.

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-041-nomad-openclaw-stable-launch-path.md

Complete

#040

Checkpoint 040 — OpenClaw Official Docker Conformance Marked

Checkpoint 040 passed Docker conformance testing and is ready to use—no changes were made to the runtime environment. **Runtime touched:** no

2026-04-28 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-040-openclaw-official-docker-conformance.md

Complete

#039

Checkpoint 039 — Direct Docker Stability Isolation

# Checkpoint 039 — Docker Stability Isolation When Docker containers run on a server, we isolate them so problems in one container don't crash others or the host machine itself. This means setting up a secure two-step connection path to verify that the server and its containers remain stable and separate from each other. **Target:** Manifest host via standing two-hop access path

2026-04-28 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-039-direct-docker-stability-isolation.md

Complete

#038

Checkpoint 038 — Recipe-Driven Gateway Stability Mutation

We need to verify that the gateway system remains stable when we route traffic through a two-step connection path, ensuring that the primary server can handle requests sent through this alternate route. **Target:** Manifest host via standing two-hop access path

2026-04-27 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-038-recipe-driven-gateway-stability-mutation.md

Complete

#037

Checkpoint 037 — Provider Auth vs Runtime Stability

# Checkpoint 037 — Provider Authentication vs System Reliability When services confirm a provider's identity, that security check can sometimes slow down or interrupt the system's ability to run smoothly, requiring teams to balance stronger verification against keeping operations running consistently. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-037-provider-auth-vs-runtime-stability.md

Complete

#036

Checkpoint 036 — OpenClaw Install Oversight And Recipe

# Checkpoint 036 — OpenClaw Installation Oversight and Process Guide **Date: April 27, 2026** This checkpoint documents the procedures for supervising the OpenClaw software installation and following the established setup instructions to ensure the system deploys correctly across all environments. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-036-openclaw-install-oversight-and-recipe.md

Complete

#035

Checkpoint 035 — OpenClaw Fast Activation Pattern

# Checkpoint 035 — OpenClaw Fast Activation Pattern **Date: 2026-04-27** This checkpoint documents a rapid deployment procedure for the OpenClaw system, enabling quick activation when time-sensitive operational needs arise. Teams can use this pattern to enable OpenClaw capabilities faster than standard activation protocols require. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-035-openclaw-fast-activation-pattern.md

Complete

#034

Checkpoint 034 — OpenClaw Deployment Solution Research

# Checkpoint 034 — OpenClaw Deployment Solution Research **Date: 2026-04-27** The team investigated methods for deploying OpenClaw, an open-source software tool, across different environments and determined the technical requirements needed to roll out the system effectively. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-034-openclaw-deployment-solution-research.md

Complete

#033

Checkpoint 033 — OpenClaw Degraded Allocation Read-Only Diagnosis

# Checkpoint 033 — OpenClaw System in Read-Only Mode As of April 27, 2026, the OpenClaw system is operating in read-only mode due to reduced storage capacity, which means users can view existing data but cannot create, modify, or delete files. The system will remain in this limited state until storage resources are restored. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-033-openclaw-degraded-allocation-readonly-diagnosis.md

Complete

#032

Checkpoint 032 — Control Portal Review Follow-Up Fixes

# Checkpoint 032 — Control Portal Review Follow-Up Fixes **Date: 2026-04-27** The team completed fixes identified during the recent review of the Control Portal, which is the central system that manages access and security settings across our infrastructure. These updates address technical issues found during the inspection and restore full functionality to the portal. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-032-control-portal-review-followup-fixes.md

Complete

#031

Checkpoint 031 — Control Portal Operator Console And E2E Harness

# Checkpoint 031 — Control Portal Operator Console And E2E Harness **Date: 2026-04-27** The team completed testing of the Control Portal Operator Console and the end-to-end testing harness, which enable operators to manage the system and verify that all components work together correctly. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-031-control-portal-operator-console-and-e2e-harness.md

Complete

#030

Checkpoint 030 — Control Portal Live Read-Only Status

# Checkpoint 030 — Control Portal Read-Only Access As of April 27, 2026, the Control Portal is operating in read-only mode, which means team members can view information and reports but cannot make changes or updates to any settings. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-030-control-portal-live-readonly-status.md

Complete

#029

Checkpoint 029 — Control Portal Action Readiness

# Checkpoint 029 — Control Portal Ready for Action As of April 27, 2026, the control portal has completed its readiness checks and is prepared to process operational requests. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-029-control-portal-action-readiness.md

Complete

#028

Checkpoint 028 — Control Portal Admin Console Review Fixes

# Checkpoint 028 — Control Portal Admin Console Review Fixes **Date: 2026-04-27** The administrative console for the Control Portal has been updated to fix issues identified during recent system reviews, making it easier for administrators to manage user access and system settings. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-028-control-portal-review-fixes.md

Complete

#027

Checkpoint 027 — Control Portal Runtime Scaffold

# Checkpoint 027 — Control Portal Runtime Scaffold **Date: 2026-04-27** The Control Portal's underlying framework that executes and supports live system operations is now established and ready for testing. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-027-control-portal-runtime-scaffold.md

Complete

#026

Checkpoint 026 — Runtime Adapter Pre-UI Hardening

# Checkpoint 026 — Runtime Adapter Pre-UI Hardening On April 27, 2026, the system strengthened security protections for the runtime adapter before the user interface loads, preventing unauthorized access during the critical startup phase. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-026-runtime-adapter-pre-ui-hardening.md

Complete

#025

Checkpoint 025 — Runtime Adapter Review Amendments

# Checkpoint 025 — Runtime Adapter Review Changes On April 27, 2026, updates were made to how the Runtime Adapter (the software layer that connects different system components) is reviewed and approved. - Date: 2026-04-27

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-025-runtime-adapter-review-amendments.md

Complete

#024

Checkpoint 024 — Works Control Plane Naming and Adapter Slice

The main system that manages and directs all Works operations now has a consistent official name — **Works Control Plane** — which appears throughout architecture documents and records. The central product/control-plane name is now standardized as **Works Control Plane** in BS1-authored architecture and evidence records.

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-024-works-control-plane-naming-and-adapter-slice.md

Complete

#023

Checkpoint 023 — Runtime Adapter Read-Only Proof

We collected a read-only snapshot showing how the Works Control Plane (the system that manages operations) is currently interacting with the target server. A read-only Works Control Plane runtime adapter observation was captured for the current Manifest target host.

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-023-runtime-adapter-readonly-proof.md

Complete

#022

Checkpoint 022 — Runtime Adapter Definition

# Checkpoint 022 — Runtime Adapter Definition The team has documented the Nomad-focused architecture changes in the project's evidence tracking system, and the technical team has finalized how the Works Control Plane will connect to and manage runtime environments. The Nomad-first amended architecture review is recorded into the BS1 evidence-draft decision trail, and the Works Control Plane runtime adapter shape is defined.

· runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-022-runtime-adapter-definition.md

Complete

#021

Checkpoint 021 — SecretRef Control Bootstrap Proof

# Checkpoint 021 — System Startup Verification This checkpoint confirms that the system can reliably start up by verifying that credential storage (SecretRef Control) functions correctly during initial bootstrap, ensuring the control plane—the core system that manages operations—activates as designed. Prove the durable control-plane bootstrap model:

2026-04-27 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-021-secretref-control-bootstrap-proof.md

Complete

#020

Checkpoint 020 — Secure Control Path Proof

# Checkpoint 020 — Secure Control Path Proof Verify that requests entering a Nomad-managed OpenClaw runtime (a container orchestration and deployment system) travel through authenticated, encrypted channels rather than bypassing security controls. This proof confirms that all administrative commands and configuration changes follow the designated secure pathway and cannot be intercepted or altered in transit. Prove the safe outside-control path into a Nomad-managed OpenClaw runtime:

2026-04-27 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-020-secure-control-path-proof.md

Complete

#019

Checkpoint 019 — OpenClaw Functional Proof and Nomad License Follow-Through

The user confirmed that OpenClaw (our software tool) works as intended when running through Nomad (the system that manages how applications execute on our infrastructure). - User approved the review outcome as: "OpenClaw functional proof through the Nomad-managed runtime."

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-019-openclaw-functional-proof-and-license-followthrough.md

Complete

#018

Checkpoint 018 — Nomad Review, License Gate, and OpenClaw Functional Proof Approval

The team finished reviewing the Nomad testing spike for Checkpoint 018, and the user confirmed that the BS1 system completed its portion of the work. The user reported that the BS1 Nomad spike review completed.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-018-nomad-review-license-and-oc-functional-proof-approval.md

Complete

#017

Checkpoint 017 — Nomad OpenClaw Model Proof

The Nomad spike demonstrated that our system can treat a Nomad-managed work unit as a standard scheduling task, while maintaining a clear separation between our business logic layer and Nomad's execution environment. The Nomad spike proved that AW can model an OpenClaw runtime unit as Nomad-managed work while keeping AW's business/control layer separate from Nomad's runtime truth.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-017-nomad-openclaw-model-proof.md

Complete

#016

Checkpoint 016 — Nomad Local Spike Staged

The Nomad workload scheduler was installed and set up as a test cluster on the Manifest fixture host to explore how it works in an isolated environment. Nomad was installed and staged as a local-only spike cluster on the Manifest fixture host.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-016-nomad-local-spike-staged.md

Complete

#015

Checkpoint 015 — Target Host Data Root Migration

The Manifest fixture host system was moved to use the `/data/` storage location, which was approved by the user. The Manifest fixture host was migrated to the user-approved `/data/` storage root.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-015-target-host-data-root-migration.md

Complete

#014

Checkpoint 014 — `/data/` Target-Host Root and Nomad Spike Authorization

**Checkpoint 014 — Data Directory Access and Job Scheduler Permissions** The user approved all four outstanding security recommendations, which allow the root-level account on the target machine to access the `/data/` folder and grant the Nomad job scheduler the elevated permissions it needs to run. The user confirmed all four pending recommendations from the handoff:

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-014-data-root-and-nomad-spike-authorization.md

Complete

#013

Checkpoint 013 - Host-systemd Supervision (Boundary C)

# Checkpoint 013 - Host-systemd Supervision Both application stacks are now running under systemd (the system service manager on Linux), and the Docker daemon successfully restarts and maintains its state when the system reboots. The build process has completed 3 of its planned steps, with user choices from April 26, 2026 recorded and integrated into all previous decisions. User decisions captured 2026-04-26 in this session, building on all prior:

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-013-host-systemd-supervision.md

Complete

#012

Checkpoint 012 - AW Naming Convention Adopted; BS1 Fixture Renamed

# Checkpoint 012 Summary The team adopted a naming system for infrastructure components and renamed the BS1 fixture to match it. The system confirmed that Traefik (a routing tool) correctly directs traffic based on host headers, and is now waiting for approval to proceed with Boundary C changes related to how the system starts up on boot. User decisions captured 2026-04-26 in this session, building on all prior approvals:

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-012-naming-convention-applied.md

Complete

#011

Checkpoint 011 - Path C Runtime Up (Boundary B v2 + firewall hygiene)

# Checkpoint 011 - Path C Runtime Up The production system is running smoothly with all health checks passing, and we're ready to move forward once you give the final approval to activate Boundary C—the next phase that will handle automatic startup and recovery of the networking components that route traffic through the system. User decisions captured 2026-04-26 in this session, in addition to all prior:

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-011-pathc-runtime.md

Complete

#010

Checkpoint 010 - OpenClaw Container Runtime Preflight (Boundary A)

The system has completed its initial setup checks (Boundary A) and is ready to start the container runtime, but it's waiting for approval before pulling the necessary software files and launching the application (Boundary B). You approved this step on April 26, 2026. User approved 2026-04-26 in this session, in order:

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-010-openclaw-container-preflight.md

Complete

#009

Checkpoint 009 - Docker Engine Install on BS1 P2 Fixture

Docker software (the system that runs containerized applications) and its supporting tools have been installed and are running on server s187-u007.manifest0.net, and the system is ready to support container work—though actual container operations haven't begun yet. Configuration choices made on 2026-04-26 have been documented. User decisions captured 2026-04-26 during this session:

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-009-docker-engine-install.md

Complete

#008

Checkpoint 008 - Target Host Capability Survey (Read-Only)

# Checkpoint 008 - Target Host Capability Survey (Read-Only) The initial assessment of the target system's capabilities has completed successfully, but the container deployment step (pulling and running Docker) is currently paused and waiting for a specific network condition to be met before proceeding; the runtime changes have not been applied yet, and approval documentation for the next phase was recorded on April 26, 2026 with the exact language specified in the updated requirements. User recorded the resumed BS1 approval on 2026-04-26 with the exact wording required by the amended packet §12:

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-008-target-host-capability-survey.md

Complete

#007

Checkpoint 007 - Packet, Decision, and Redaction Review Fixes

This checkpoint documents three corrections—related to how we handle data packets, make decisions, and remove sensitive information—along with improvements to how we organize and store supporting evidence, though the system remains paused and ready for the next phase of work to begin. This checkpoint records the three review-finding fixes completed before any resumed BS1 runtime work.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-007-packet-decision-redaction-review-fixes.md

Complete

#006

Checkpoint 006 - Host-Native OpenClaw Cleanup

A test installation of OpenClaw (an admin tool) was removed from server s187-u007.manifest0.net, and this checkpoint records that cleanup work. This checkpoint records cleanup of the pre-clarification host-native OpenClaw CLI/admin artifact on `s187-u007.manifest0.net`.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-006-openclaw-host-native-cleanup.md

Complete

#005

Checkpoint 005 - OpenClaw Install and Readiness Prep

The OpenClaw command-line tool has been installed and is ready on server s187-u007.manifest0.net, with the installation paused before starting the Gateway service and SSH port forwarding features. This checkpoint records the approved OpenClaw install/readiness-prep step on `s187-u007.manifest0.net`.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-005-openclaw-install-readiness-prep.md

Complete

#004

Checkpoint 004 - Host Hygiene Fix

The system completed approved security and maintenance updates on the production test server `s187-u007.manifest0.net` and paused before installing OpenClaw software or setting up network tunnels. This checkpoint records the narrowly approved host hygiene mutations for the Debian 13 production-like OpenClaw candidate host `s187-u007.manifest0.net`.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-004-host-hygiene-fix.md

Complete

#003

Checkpoint 003 - Privileged Read-Only Degraded Inspection

The system paused before making any changes to allow authorized users with elevated permissions to inspect files without needing to enter a password (using the `sudo -n` command). This checkpoint documented that the inspection completed successfully in read-only mode. This checkpoint records authorized privileged read-only inspection with `sudo -n` only.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-003-privileged-readonly-degraded-inspection.md

Complete

#002

Checkpoint 002 - Systemd Degraded Classification

This checkpoint examines a host computer that is running in a reduced-capacity state and stopped before making any changes, using only commands that gather information without modifying the system. The investigation focuses on identifying which services managed by systemd (the system's service manager) are not operating normally. This checkpoint investigates the target host's degraded systemd state using read-only commands only.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-002-systemd-degraded-classification.md

Complete

#001

Checkpoint 001 - Baseline Stop

The system paused at the first checkpoint to establish a baseline before making any changes, and then created a workspace to document and store the evidence from this baseline state. - Created BS1 evidence workspace.

2026-04-26 · runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-001-baseline-stop.md

Complete

Current gate

Packet amendment review: Pass
BS1 packet amendment approved; ready for renewed user approval before Docker runtime proof
Amended packet committed: Pass
Docker / containerized runtime proof: In progress
Docker engine installed (runs/codex-bs1/architecture-rebuild-evidence/build-sequence-1/codex-bs1/checkpoints/checkpoint-009-docker-engine-install.md); runtime container work not yet started.

Architecture warning

Runtime constraint. Linux systems must run OpenClaw through Docker containers, with the single exception that macOS can run it directly on the host machine. Running OpenClaw natively on Linux servers does not meet the BS1 runtime requirements. Linux OpenClaw runtime must be Docker/containerized. macOS is the only host-native runtime exception. Host-native Linux OpenClaw CLI/Gateway is not acceptable BS1 runtime proof.

Evidence hygiene

Redaction status: applied
No real customer data: Pass
No real provider credentials: Pass
No raw secret material exposed: Pass

Review history

BS0 — fresh provisioning · claude-opus-4-7[1m]

Approved

The first sequence in the build process has passed review by an independent team. Build Sequence 0 approved on independent review.

findings: 0 blocking, 7 non-blocking · fix status: carry-forward into BS1

BS1 — architecture wide review 1 · Codex

sound-with-amendments

The system architecture needs changes to safely support multiple customers sharing the same infrastructure before it can go live. Architecture sound with required amendments before production multi-tenancy.

findings: 0 blocking, 0 non-blocking · fix status: Continue building, but amend Docker network isolation, firewall scope, naming grammar, brain-sync contract, Cloudflare portability gates, and SOC 2 primitives before treating the design as production-ready.

BS1 — control portal admin console review 1 · independent fresh-context reviewer (Claude / Anthropy Works review pool)

approve-with-non-blocking

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — control portal live readonly status review 1 · independent

approve-with-non-blocking

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — control portal operator console review 1 · independent

approve-with-non-blocking

The control panel now shows operators the information they need most—which specific system is selected, what state it's in, what action to take next, and what's preventing changes—while hiding detailed technical dumps behind an expandable panel for troubleshooting. The system prevents accidental changes by removing write commands from the live-read code and blocking restarts and credential updates until conditions are met, and all five previously identified issues have been resolved. Operator-first Control Portal surface lands cleanly. Selected Instance, current state, recommended next action, blocked runtime writes, readiness blockers, Org boundary, and evidence are now the primary surface; the prior runtime dump is preserved behind a single collapsible diagnostics panel. Runtime mutation remains structurally blocked (no write verbs in live-read.ts; Action Readiness blocks restart and secret/device changes). All five prior live-read review findings (N1-N5) are addressed. The Playwright hang is plausibly fixed by a temp-local CommonJS spec runner, corroborated by a passing .last-run.json and screenshot attachments. No blockers before the next read-only diagnosis of the degraded OpenClaw allocation.

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — mutation 8 openclaw nomad gateway review 1 · independent

Pending

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — nomad first amended architecture review 1 · independent

nomad-first-amended-approved-with-production-amendments

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — nomad spike review 1 · independent

nomad-first-approved-with-amendments

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — openclaw gateway stability diagnosis 1 · Claude (independent diagnostic agent)

diagnosis-with-recommended-mutation

Mutation 6 fixed the startup crash caused by invalid configuration and showed that fixing runtime dependencies needs to happen before the system starts serving traffic rather than during it. The remaining crashes (exit code 137, which signals out-of-memory errors) need to be tested in isolation using Docker directly, rather than through the Nomad orchestration layer, to identify the actual cause. Mutation 6 correctly resolved the early invalid-config crash and identified that runtime dependency repair must move out of the live serving path. The remaining non-OOM exit-137 failure is not yet root-caused; the next step must be a layer-isolation proof (direct Docker, no Nomad), not another recipe tweak.

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — p2 sequence review 1 · Codex independent reviewer

approve-with-non-blocking

# BS1 P2 passes with evidence-packaging cleanup needed The second phase of the BS1 system has passed its testing requirements, though the team needs to organize and prepare the supporting documentation and test materials before considering the work complete. BS1 P2 passes with evidence-packaging cleanup needed

findings: 0 blocking, 3 non-blocking · fix status: The production-like host fixture, containerized Gateway, loopback-only proxy exposure, and supervisor persistence proof are passable. Clean up exit-code/stale-name evidence hygiene before final BS1 packaging.

BS1 — packet amendment review · Claude Opus 4.7 independent reviewer

Approved

The BS1 packet amendment has been approved and is ready for users to review and sign off on before we deploy it to the Docker runtime environment for testing. BS1 packet amendment approved; ready for renewed user approval before Docker runtime proof

findings: 0 blocking, 3 non-blocking · fix status: Independent reviewer approves the BS1 packet amendment. Linux containerized runtime boundary and macOS-only host-native exception are recorded in DECISIONS.md, both packet copies are synchronized, host-native Linux CLI is excluded from readiness/pass evidence, the provider-token prefix is redacted, and no runtime work was resumed. Three P3 cosmetic notes are non-blocking. Commit the amendment fixes and seek explicit renewed user approval using the amended packet's recorded approval wording before any Docker/Gateway/SSH-tunnel runtime work resumes.

BS1 — works control plane runtime adapter amendments review 1 · independent

approve-with-non-blocking

findings: 0 blocking, 0 non-blocking · fix status: —

BS1 — works control plane runtime adapter review 1 · independent

approve-with-amendments

findings: 0 blocking, 0 non-blocking · fix status: —

Project context how much road is left, honestly

This dashboard tracks Phase 1 of 3 — the platform underneath the eventual product. Phase 2 (the product itself) and Phase 3 (real customers) are not yet started. When every build sequence on this dashboard ships, the result is the foundation. Not the product.

The three phases

01

Platform Current

The multi-tenant runtime that hosts an isolated copy of Works for each customer. Provisioning, control path, supervisor, fleet watchdog, ops readiness.
02

Works (the product) Not started

The AI operating system itself — agents that answer phones, book jobs, message crews, chase invoices. Built on top of the platform.
03

Real customers Not started

Onboarding the first paying SMB. Then iterating to N customers across multiple trades on the same codebase.

When all of Phase 1 ships, the gap to the actual product is still this big

Phase 1 ships When BS0–BS5 are all complete

What the team will have built. Foundation only — no customer-facing product.

What will exist

A multi-tenant runtime that can host an isolated, secure copy of the eventual Works product for each customer.
Provisioning workflow that creates a new customer's runtime on a Linux host.
Stable control path so the operator can talk to any customer's runtime through a managed tunnel.
Fleet watchdog: live health/status of every customer's runtime.
Operations readiness: upgrades, rollback, backup, disaster recovery, audit log.
Compliance scaffolding (SOC 2 readiness, HIPAA/GDPR design hooks).
Proven on a fixture customer Org. Zero real paying customers.

What still won't exist

No customer-facing portal.
No agents that answer phones, book jobs, message crews, or chase invoices.
No industry-specific surfaces (HVAC / dental / legal / etc.).
No customer-driven onboarding (operator-driven only, fixture data).
No marketing site, no billing, no real-world workflows.

Phases 2 + 3 (not started) Works — the actual product

Built on top of the platform. What customers will actually pay for and use day-to-day.

What customers will actually use

Agents that answer phones, book jobs, message field crews on Telegram, chase invoices, and learn each business as they go.
A company portal each SMB uses every day — calendars, customer records, conversations, invoices, all coordinated.
Industry-specific surfaces and default tools per trade (church, electrician, M&A advisor, AgTech firm — same platform, different shape).
Bring-your-own AI, tools, and data — Works orchestrates them.
Customer-driven onboarding: an SMB tells Works what they do, Works builds out their company portal and agents.
Real paying customers across multiple verticals on the same codebase.

The platform is the foundation Works runs on. None of the customer-facing capabilities below the line exist yet, and they aren't planned in BS0–BS5. Phase 2 (Works the product) and Phase 3 (real customers) are the next two phases after the foundation is sound.

Ask · non-technical answers from the current data