archy/CLAUDE.md
archipelago 57a013bc66 test(gate): make 5× the canonical gate, drop 20x naming
Rename run-20x.sh → run-gate.sh, default ARCHY_ITERATIONS 20→5, and scrub
20× references across CLAUDE.md, the master plan, TESTING.md, app-registry
status, the orchestrator/config doc-comments, and the bats suites. Also add
a minimal fail() helper to mempool.bats so guard failures report cleanly.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-06-22 18:12:41 -04:00

2.9 KiB
Raw Blame History

Archipelago — agent guide

🚩 TOP PRIORITY (until production testing passes)

Read docs/PRODUCTION-MASTER-PLAN.md first. It is the authoritative plan and overrides ad-hoc direction until the production test gate is green. Goal: a world-class, developer-ready app platform where every app is manifest-driven, manifests ship via the signed registry (not OTA disk files), and third-party developers publish apps via an external/decentralized registry — all rootless, secure, robust, and 100%-uptime-capable.

Detailed sub-plans (all linked from the master):

  • App platform / packaging phases + security model → docs/APP-PACKAGING-MIGRATION-PLAN.md
  • Registry-distributed manifests (in progress) → docs/registry-manifest-design.md
  • External/decentralized marketplace for devs → docs/marketplace-protocol.md
  • Current per-app state → docs/app-registry-status-2026-06-21.md
  • Production test gate (exit criterion) → tests/lifecycle/TESTING.md

Invariants (never violate)

  • Rootless Podman only. No rootful, no Docker-socket mounts, no privileged containers unless explicitly approved.
  • No per-app Rust installers / no OS-level reliance. Apps are declarative; the orchestrator owns the lifecycle. install_immich_stack (hardcoded podman run + sudo chown) is the anti-pattern being deleted, not a template.
  • Secrets are manifest-declared (generated_secrets, materialised by container::secrets, 0600/rootless) — never hardcoded, per-app, or logged.
  • Migrations never destroy data — preserve /var/lib/archipelago/<app>, secrets, credentials, ports, and adoption container names; keep a rollback path.
  • Verify on the real node .228 before any tag. (Fleet-wide multinode verification is a separate plan: docs/multinode-testing-plan.md.)

Build / verify

  • Rust workspace root is core/ (no Cargo.toml at repo root). cargo from core/.
  • If a cargo test/build hits rust-lld: undefined hidden symbol, it's incremental-cache corruption — rebuild with CARGO_INCREMENTAL=0.
  • Frontend: neode-ui/npm run build outputs to web/dist/neode-ui/. Grep the built bundle for new strings before shipping (build can silently no-op).
  • App manifests load from disk on nodes at /opt/archipelago/apps/*/manifest.yml (today); the goal is to distribute them via the signed catalog instead.

Production test gate (definition of done)

tests/lifecycle/run-gate.sh green across install / UI / stop / start / restart / reinstall / reboot-survive / archipelago-restart-survive / uninstall — 5× on .228 (ARCHY_ITERATIONS=5). Run the gate ON the node (it uses local podman/systemctl/bitcoin probes), not via RPC from another host. Until green, the master plan is the priority. Multinode testing (.198 + the rest of the fleet) is a SEPARATE plandocs/multinode-testing-plan.md — not part of this single-node gate criterion.