archy

lfg2025/archy

Author	SHA1	Message	Date
archipelago	b701e125b4	fix(update): relax apply rate limit	2026-05-17 23:15:07 -04:00
archipelago	837ba63466	chore: update release lockfile v1.7.63-alpha	2026-05-17 23:03:44 -04:00
archipelago	8191d92bed	chore: release v1.7.63-alpha	2026-05-17 23:03:06 -04:00
archipelago	ae8359da4b	fix(release): rebuild backend artifacts	2026-05-17 22:54:37 -04:00
archipelago	d91b858d9b	chore: release v1.7.62-alpha v1.7.62-alpha	2026-05-17 22:40:36 -04:00
archipelago	19f2125a4d	fix(apps): repair stale nginx proxy manager ports	2026-05-17 22:38:04 -04:00
archipelago	a992abcd06	chore: release v1.7.61-alpha v1.7.61-alpha	2026-05-17 22:13:21 -04:00
archipelago	4d6b4f76af	chore: release v1.7.60-alpha v1.7.60-alpha	2026-05-17 20:45:56 -04:00
archipelago	0a94c0097f	chore: release v1.7.59-alpha v1.7.59-alpha	2026-05-17 19:44:54 -04:00
archipelago	413d50116e	fix(apps): restore mobile and website launching	2026-05-17 19:22:18 -04:00
archipelago	daad50325b	chore(release): require curated release notes	2026-05-17 18:59:12 -04:00
archipelago	e05e356d64	chore: release v1.7.58-alpha v1.7.58-alpha	2026-05-17 18:40:50 -04:00
archipelago	cfb304a001	feat(mesh): add meshtastic serial radio support	2026-05-17 18:07:40 -04:00
archipelago	7804223152	chore: release v1.7.57-alpha v1.7.57-alpha	2026-05-17 17:30:04 -04:00
archipelago	a322b04021	fix(iso): avoid polkit in live debootstrap seed	2026-05-15 18:32:14 -04:00
archipelago	645cf69ed7	chore(release): refresh v1.7.56-alpha manifest after wifi fix	2026-05-15 18:26:17 -04:00
archipelago	01ec0565a6	fix: restore wifi setup and ssh password updates	2026-05-15 18:15:06 -04:00
archipelago	30505f41ff	chore(release): refresh v1.7.56-alpha notes and artifacts	2026-05-15 17:54:32 -04:00
Dorian	5818541721	chore: release v1.7.56-alpha v1.7.56-alpha	2026-05-14 09:13:58 -04:00
Dorian	b8053c00ca	fix: clear stale health notifications	2026-05-14 08:57:54 -04:00
Dorian	f95e9a1cd0	fix: quote quadlet environment values	2026-05-14 01:15:22 -04:00
Dorian	be50dc3235	fix: avoid bootstrap bitcoin restarts	2026-05-14 00:03:16 -04:00
Dorian	2ff47f88a7	fix: harden container reconcile and launch behavior	2026-05-13 22:59:55 -04:00
Dorian	835c525218	chore(release): stage v1.7.55-alpha v1.7.55-alpha	2026-05-13 15:09:22 -04:00
Dorian	3202b79e41	chore(release): move artifacts to gitea releases	2026-05-13 14:11:42 -04:00
archipelago	c0751e2551	chore(release): stage v1.7.54-alpha v1.7.54-alpha	2026-05-06 09:23:57 -04:00
archipelago	1a0d8a432c	chore(release): stage v1.7.53-alpha v1.7.53-alpha	2026-05-05 13:59:50 -04:00
archipelago	745cb1c626	chore(release): stage v1.7.52-alpha v1.7.52-alpha	2026-05-05 11:29:18 -04:00
archipelago	10fbb8f87c	docs(testing): track Phase 3.4 race fix + drift-sync hook * L0 unit count: 630 → 631 (translate_health_check_http_does_not_double_prefix_scheme) * Phase 3 row: add TimeoutStartSec=600 race fix (44f275ed) + drift-sync hook (0889367d) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 11:53:18 -04:00
archipelago	aad0ba5234	feat(orchestrator): drift-sync existing Quadlet units on each reconcile When a Quadlet unit file already exists for an orchestrator-managed backend, sync its on-disk bytes against what the current renderer produces. write_if_changed makes this idempotent — when bytes match, no IO; when they differ (post-deploy of a renderer change), the file is rewritten and systemctl --user daemon-reload runs once. We deliberately do NOT restart the .service when the file changes: running containers keep their current config until the operator restarts them. That's the right tradeoff — file updates are cheap and non-destructive; service restarts are the SIGKILL cascade we're trying to eliminate. Why this matters: pre-this-commit, every renderer change required a fresh package.install RPC per app to take effect. Observed live on .228 2026-05-02 — the TimeoutStartSec=600 fix shipped in code but existing units stayed on the old format because nothing triggered a re-render. Combined with state.json being empty (so the reconciler's auto-install path didn't fire either), the fix was invisible until manual unit deletion. Companions (UI_APP_IDS) are skipped — companion.rs renders those units with a different shape; syncing here would clobber them. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 11:43:18 -04:00
archipelago	281e65e697	fix(quadlet): TimeoutStartSec=600 when Notify=healthy is set Bug surfaced live on .228 2026-05-02 — every backend Quadlet unit (lnd, electrumx, fedimint, btcpay-server, mempool-api, bitcoin-knots) hit systemd's default 90s start timeout because Notify=healthy makes systemctl wait for the first green health probe, but HealthInterval=30s × HealthRetries=3 = 90s minimum even on a healthy service. Race: timeout fires the moment the third probe MIGHT succeed. Result was three different post-states (inactive+running, failed+missing, inactive+stopped) depending on whether systemd's ExecStopPost ran podman rm before the orchestrator's adoption logic re-grabbed the container. Fix: when health is set, render TimeoutStartSec=600 (10 minutes) into [Service]. Long enough for slow-starting backends (electrumx index replay, lnd wallet unlock) without being so long that a truly stuck unit hangs forever. Companions stay unchanged (no health → no override, default 90s applies). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 07:14:48 -04:00
archipelago	384f12de7a	fix(quadlet): http:// double-prefix + companion migration race Two bugs surfaced by the first real-node validation of Phase 3.2-3.4 on .228 (2026-05-02), both caught before flipping the default. Bug 1 — translate_health_check double-prefixed http://. Manifests in the wild carry the scheme inside the endpoint string ("http://localhost:8175"), and we were prepending another http:// unconditionally. Result on .228: every backend HealthCmd read `curl -fsS -m 5 http://http://localhost...`, every probe failed, fedimint hit a 14-restart loop. Now we accept either form and skip appending hc.path when the endpoint already carries one. Regression test asserts no double-prefix and that an in-endpoint path is honoured. Bug 2 — Phase 3.3 migration ran for UI companions (bitcoin-ui / electrs-ui / lnd-ui) that have shipped via Quadlet since v1.7.41. Migration tore down the running companion + raced companion.rs render, producing "Phase 3.3: re-install archy-bitcoin-ui via Quadlet" reconcile errors and leaving archy-bitcoin-ui down. Companions now short-circuit out of migrate_to_quadlet_if_needed before any IO. Also: when try_exists returns Err for an unrelated reason (permissions, EIO), we now skip migration instead of treating "I can't tell" as "go ahead and migrate" — migrating on top of a possibly-existing unit is destructive. What this does not fix yet: * the orchestrator's reconciler iterating every manifest in /opt/archipelago/apps/, not just installed apps. Pre-existing behavior (also affects the legacy path) — separate scope. * fedimint /data UID mismatch surfaced when Quadlet started fedimint fresh. Likely orthogonal — defer. * no rollback when install_via_quadlet fails after a remove_container. Tracked as Phase 3.3.1 — defer. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 06:37:37 -04:00
archipelago	bd96c0475d	feat(config): ARCHIPELAGO_USE_QUADLET_BACKENDS env override Adds an env-var lever for Phase 3.2's use_quadlet_backends flag so the 20× harness can flip the path on per-node without a config.json edit (which would require an archipelago.service restart — and that triggers FM3 cgroup cascade until Phase 3.5 ships, so we can't ask anyone to reconfigure live nodes that way today). Truthy parsing centralised in `parse_truthy_env` (1, true, yes, on — case-insensitive, whitespace-trimmed). Anything else is false. The helper is unit-tested so future env-var flags can reuse the same shape. Also adds a default-off regression test for use_quadlet_backends so flipping the default ahead of the 20× verification fires immediately. TESTING.md documents the Environment= snippet for the systemd drop-in so the next operator can flip the flag on a debug node without re-deriving the recipe. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 05:44:09 -04:00
archipelago	9a89a000d4	test(lifecycle): post-condition gate for use_quadlet_backends path A six-test bats suite that validates what install_via_quadlet (Phase 3.2) is supposed to leave behind: * `.container` unit on disk in $XDG_CONFIG_HOME/containers/systemd/ with [Container] / [Service] / [Install] sections, Image= present, and Restart=on-failure (the backend invariant — companions use Always) * Phase 3.4 cross-check: any unit with HealthCmd= must also emit Notify=healthy, otherwise systemctl start won't gate on health * `systemctl --user is-active` returns 0 for the .service * podman shows the container running * the container's cgroup is under user.slice/, NOT under archipelago.service — the kernel-level proof that FM3 cgroup cascade SIGKILL is structurally fixed for this container Auto-skips on every test when no backend Quadlet units exist (today's default state, use_quadlet_backends=false) — so the suite is a no-op on current fleet boxes and turns into a hard regression gate the moment anyone flips the flag and reinstalls. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 05:34:47 -04:00
archipelago	97ce23d773	feat(quadlet): Phase 3.4 — health-gated startup via Notify=healthy QuadletUnit gains an optional HealthSpec; from_manifest translates the manifest's health_check (tcp/http/cmd) into a HealthCmd= directive and emits Notify=healthy alongside it. systemctl start <unit>.service then blocks until the container's first green probe — eliminating the "container up but RPC not ready" race the orchestrator currently papers over with post-start polling. Translation policy: * tcp, endpoint "host:port" -> nc -z host port * http, endpoint "host:port", path -> curl -fsS -m 5 http://endpoint<path> * cmd, endpoint "<shell command>" -> verbatim * unknown type / malformed endpoint -> None (skip Notify=healthy rather than emit a HealthCmd that hangs the unit start forever) Companion units leave health: None and remain byte-identical to before this PR — the renderer only emits the Health* / Notify= block when set. +4 quadlet unit tests (19 total). Dropped a never-used test setter that was generating a dead_code warning. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 05:21:57 -04:00
archipelago	65576bd755	feat(orchestrator): Phase 3.3 — in-place migration to Quadlet When use_quadlet_backends flips from off → on, existing fleet boxes have backend containers parented under archipelago.service's cgroup (the bad shape that triggers FM3 cascade SIGKILL on every archipelago restart). ensure_running now notices and corrects this: * If there's already a `<name>.container` unit on disk → no-op (subsequent reconcile ticks take this fast path). * Else if a podman container with that name exists → it's a pre-3.3 artifact. Stop+remove it (volumes survive — bind mounts are not touched by `podman rm`), then write the Quadlet unit, daemon-reload, and start the new managed service. * Else → fall through to install_fresh, which already routes through install_via_quadlet when the flag is on. The migration is idempotent and self-healing: if a fleet box is half-migrated (unit on disk but no service active, or service active but stale unit), the next reconcile tick converges. Bitcoin chain data, lnd wallet state, and electrumx index all live on host bind mounts and are unaffected by the container-record swap. Volume safety audited per backend in `uses_orchestrator_install_flow` allowlist — every entry mounts its data dir as a host bind mount. Default still off. To migrate a node: /etc/archipelago/config.toml: use_quadlet_backends = true followed by `systemctl restart archipelago` — the next reconcile tick walks every managed app and migrates each in turn. Tests: 624 passing, 0 cargo warnings. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 17:27:59 -04:00
archipelago	5b2e02bd43	feat(orchestrator): Phase 3.2 — wire Quadlet path behind feature flag prod_orchestrator::install_fresh now branches on the new Config::use_quadlet_backends flag (default false): * off (today's production behavior) — unchanged: runtime.create_container + start_container, container parented under archipelago.service's cgroup, FM3 cascade SIGKILL on every archipelago restart. * on — install_via_quadlet renders the manifest as a Quadlet unit via QuadletUnit::from_manifest, writes it atomically into ~/.config/containers/systemd/, calls daemon-reload, and starts the generated <name>.service. Container ends up under user.slice — no more cgroup parented under archipelago, so archipelago restarts don't touch the container's lifetime. Default off so this commit is structurally safe to ship: nothing changes at runtime until an operator opts in. Flip the default once tests/lifecycle/run-20x.sh has gone green against the new path on .228 + .198 (the v1.7.52 release gate). Plumbing: * config.rs — `use_quadlet_backends: bool` w/ Default false * prod_orchestrator.rs — flag stored on the struct, threaded through new(), with set_use_quadlet_backends(bool) test setter * prod_orchestrator.rs — install_via_quadlet helper * dropped the Phase-3.1 #[allow(dead_code)] markers on from_manifest / parse_memory_mib / RestartPolicy::OnFailure now that the call path exists; if a future revert removes the wiring, the warnings come back. Tests: 624 passing, cargo check clean (0 warnings). Existing companion behavior unaffected — render_skips_backend_directives_when_default still passes byte-equal to before quadlet.rs grew the new fields. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 17:22:10 -04:00
archipelago	9becafafd3	feat(quadlet): backend-manifest renderer (Phase 3.1 of v1.7.52) The QuadletUnit struct now covers everything a backend manifest needs (ports, environment, devices, add_hosts, entrypoint+command, read-only root, no_new_privileges, cpu_quota, restart policy choice). Adds QuadletUnit::from_manifest(&AppManifest, name) that translates a parsed manifest into a unit, plus parse_memory_mib for "1g"/"512m"/raw-MiB forms. The renderer skips empty/false directives so existing companion units render byte-identically — no behavior change for shipping companions; the backend renderer is dead code until Phase 3.2 wires it into the orchestrator. Eight new unit tests cover: * parse_memory_mib forms (1024, 512m, 2g, garbage) * shell_join quoting (whitespace, embedded quotes) * RestartPolicy → systemd string mapping * render emits backend directives when set * render skips them when defaulted (companion regression gate) * from_manifest happy path on a bitcoin-knots-shaped manifest * from_manifest read-only volume detection * from_manifest tmpfs filtering * end-to-end manifest → render bytes assertion Tests: 615 → 624 (+9 net; one pre-existing parse_memory_mib path was implicitly covered before but is now explicit). Cargo warnings: 0. `from_manifest`, `parse_memory_mib`, and `RestartPolicy::OnFailure` are marked allow(dead_code) with explicit references to Phase 3.2 — if 3.2 doesn't wire them, the dead-code warning resurfaces. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 17:09:50 -04:00
archipelago	5074572373	test(lifecycle): add btcpay + fedimint + mempool suites Brings L1 (RPC API) + L3 (lifecycle survival) parity coverage to the three multi-app stacks that were previously only touched by required-stack.bats. Combined with bitcoin-knots / lnd / electrumx already shipping, the six core apps now have dedicated bats files. Each suite is shaped like the existing single-container suites (bitcoin-knots / lnd / electrumx) and gates every assertion on the backing container actually being present, so a node without the stack installed gets clean skip messages instead of false fails. * btcpay.bats — 9 tests, including stack-wide presence and a "supporting containers don't cascade-restart" guard * fedimint.bats — 8 tests, single container * mempool.bats — 9 tests, mixed legacy + orchestrator-managed stack; reuses the :8999 mempool-api probe from required-stack for parity Total bats now: 88 (was 53 → +35). TESTING.md matrix advances 23 → 50 of 110 cells. UI URL coverage for these three apps already lives in ui-coverage.bats, so this PR doesn't duplicate proxy-path probes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:55:31 -04:00
archipelago	ec1dce93a9	docs(testing): canonical scorecard for container subsystem testing Single source of truth for "where are we, where are we going" on the v1.7.52 container excellence work. Replaces ad-hoc tracking in chat. Sections: * Test layers L0..L6 with toolchain + per-iteration latency * Per-app × per-state coverage matrix (23 of 110 cells today; goal 110) * Layer-by-layer status (L0+L1+L2 ●; L3 ◐; L4..L6 ○) * Run commands (single suite / full suite / 20×) * LoC budget — -270 committed, ~1,616 more possible if Phase 3 ships * Performance KPIs (TBD — measure first, target second) * Release gates — 8 boxes that must tick before v1.7.52 ships The file lives in-repo so PR diffs to it answer "what did this commit improve?". If you can't tick the box, the change isn't ready. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:52:42 -04:00
archipelago	b9eb6eb18a	test(lifecycle): add UI surface coverage — HTTPS proxy + iframe URLs Closes the coverage gap where existing bats suites would report green on a node whose dashboard tiles 502 because the proxy upstream is dead. First pass against .198 caught real prod issues immediately: /app/lnd/ → 502 (lnd container exited) /app/mempool/ → 502 (mempool container exited) /app/fedimint/ → 502 (fedimint container exited) while existing tests reported only "container is up: false" with no 404/502 distinction. * lib/ui-probes.bash — sourced helper. probe_https_200, probe_app_url (skip-if-container-down else assert-200), probe_dashboard_shell (asserts the Vue SPA HTML, not nginx default — catches the layout regression from feedback_release_tarball_layout.md), probe_dashboard_catalog (asserts /catalog.json non-empty). * bats/ui-coverage.bats — 9 @test cases covering the dashboard + bitcoin-ui :8334 + the seven HTTPS_PROXY_PATHS most users hit (lnd, electrumx, mempool, fedimint, btcpay, filebrowser). URL list mirrors HTTPS_PROXY_PATHS in neode-ui/src/views/appSession/appSessionConfig.ts. Divergence between the two is the exact bug class we're guarding against. Loops clean under run-20x.sh. Container-state oracle is via local podman inspect, so the suite must run on the archy host (same as companion-survives-archipelago-restart.bats). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:49:30 -04:00
archipelago	c55a4f4e86	test(bootstrap): regression gate for the heal_podman_state socket bug Extracted the heal_podman_state cleanup list as a module-level HEAL_RUNTIME_SUBDIRS const so a unit test can structurally enforce the invariant: the list must contain "containers" + "libpod" but must NOT contain "podman" (which holds systemd's podman.sock listener and was the bug fixed in commit bb421803). If anyone re-adds "podman" — accidentally, by reverting, or by copy-paste from old plan memory — this test fires before we ship, not on the next deploy when it nukes the orchestrator's HTTP path. Total tests: 614 → 615. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:32:59 -04:00
archipelago	01f416ae5d	test(lifecycle): regression gate for FM3 cgroup-cascade SIGKILL Sister suite to companion-survives-archipelago-restart.bats. That one tests the same property for UI companions, which already ship via Quadlet (commit 6e716f68) and so already pass. This new suite tests the property for backend containers (bitcoin-knots / bitcoin-core / lnd / electrumx). Until v1.7.52 Phase 3 ships these under Quadlet too, the suite is EXPECTED TO FAIL on fleet boxes — it's the executable definition of "FM3 fixed". Observed live on .198 on 2026-05-01: `sudo systemctl stop archipelago` killed every container in archipelago.service's cgroup. The dedicated "backends survive archipelago restart" test catches exactly that, and also verifies the SAME container instance survives (compares pre/post .Id), so an orchestrator that recreates a fresh container after the SIGKILL doesn't read as pass. Three @test cases: * destructive gate (skip-marker for the suite) * baseline: at least one backend installed + running * backends survive: same .Id pre + post archipelago restart Don't gate releases on this passing until Phase 3 lands; before then treat it as a "expected to fail / shows progress" indicator. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:17:27 -04:00
archipelago	f80daff8ba	test(lifecycle): add dedicated electrumx.bats suite Same shape as bitcoin-knots.bats and lnd.bats so the 20× release-gate exercises electrumx through the same state matrix it uses for the other two core apps. electrumx previously had a single TCP-port check inside required-stack.bats; this adds destructive + cascade-destructive tiers. 10 @test cases: * read-only: presence, valid state, TCP port (50001) reachable, no orphan containers beyond {electrumx, archy-electrs-ui} * destructive: stop, start, restart, TCP port recovers within 120s of cold restart (longer than bitcoind because electrumx replays its index against bitcoind on start) * cascade: uninstall, reinstall (240s timeout for index rebuild) With this suite, the three single-container core apps (bitcoin-knots, lnd, electrumx) now have parity coverage. Multi-container stacks (btcpay, mempool, fedimint) come next. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:11:02 -04:00
archipelago	1103c2c710	test(lifecycle): add dedicated lnd.bats suite Mirrors bitcoin-knots.bats so the 20× release-gate run exercises lnd through the same state matrix. lnd previously had only a single read-only check inside required-stack.bats; this adds the destructive and cascade-destructive tiers that match what we already test for bitcoin-knots. 10 @test cases: * read-only: presence, valid state, lncli getinfo, no orphan containers * destructive (ARCHY_ALLOW_DESTRUCTIVE=1): stop, start, restart, RPC recovers within 90s of cold restart (longer than bitcoind because the wallet has to unlock first) * cascade (ARCHY_ALLOW_CASCADE_DESTRUCTIVE=1): uninstall, reinstall Reuses the same lncli invocation as required-stack.bats so divergence shows up clearly if either test breaks. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:09:43 -04:00
archipelago	1b6c500657	test(lifecycle): add setup-teardown + run-20x harness scaffolding Phase 4 of the v1.7.52 container excellence plan: a release-gate harness that loops the bats suite N times in a row, with teardown between iterations, and reports a pass/fail tally. * setup-teardown.sh — clears /tmp/archy-rpc-session-* between runs so iteration N+1 doesn't reuse a logged-out cookie from iteration N. Idempotent; safe to run anytime. Designed to grow as we add suites that leave other transient state. * run-20x.sh — wraps run.sh in a loop of ARCHY_ITERATIONS (default 20). Tracks per-iteration pass/fail with wall-clock timing, prints a results block, exits non-zero on any failure. Honors ARCHY_FAIL_FAST for short-circuit during dev. Suggested release-gate command: ARCHY_PASSWORD=password123 ARCHY_ALLOW_DESTRUCTIVE=1 \ tests/lifecycle/run-20x.sh Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 16:06:09 -04:00
archipelago	5be2febe13	fix(bootstrap): don't nuke podman socket dir during runtime self-heal Observed live on .198: heal_podman_state was removing $XDG_RUNTIME_DIR/podman/ alongside containers/ and libpod/. That dir holds the systemd-bound podman.sock — the listener systemd creates for socket-activated podman.service. Removing it broke every libpod HTTP call from the orchestrator until `systemctl --user restart podman.socket` ran. Far worse than any wedge it was trying to repair. Drop podman/ from the cleanup list. The runtime state we actually want to clean for FM6 (bolt_state.db drift) lives in containers/ and libpod/ only. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 15:57:15 -04:00
archipelago	6bbe1b96cf	refactor: drop dead code surfaced by cargo cargo check was showing five real warnings, all genuinely dead: * container/mod.rs — re-exports compute_container_name, AdoptionReport, ReconcileAction, ReconcileReport were unused outside prod_orchestrator. Drop from the pub use line. * prod_orchestrator — with_runtime + insert_manifest_for_test only exist for the test module in the same file. Mark them #[cfg(test)] so they don't appear in release builds. * async_lifecycle — remove_package_entry has no callers; doc claims "used for install-failure cleanup" but nothing cleans up. Delete (10 lines). * registry.rs — `use tracing::{debug, info};` had no consumers. * fips.rs — unused-assignment chain on last_status. The poll loop always sets it on every break path, so the initial `None` and the unwrap_or_else fallback were both dead. Refactored to `let after = loop { ...; break s; };`. cargo check is now clean. cargo test --workspace --bins: 614 passed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 15:34:02 -04:00
archipelago	8f13298805	fix(bootstrap): self-heal wedged podman runtime state at startup Closes FM6 (podman bolt_state.db / runtime drift) — observed live on .198 today: bitcoind was running for several minutes, but podman's state DB reported the container as Exited. The reconciler then tried to "restart" it, racing the still-bound port 8332 and failing in a loop. heal_podman_state() runs as the last bootstrap stage, BEFORE the orchestrator's reconcile loop ticks. It probes `podman info` with a 5s timeout; on failure it removes the runtime-state dirs under $XDG_RUNTIME_DIR and re-probes. Persistent storage under ~/.local/share/containers/storage/ is never touched, so containers re-discover from manifests on next call. Cleanup never includes `podman system reset` or `system renumber` — those are destructive and must stay operator-only. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 15:23:36 -04:00
archipelago	ba2eece9aa	refactor(container): drop unused dependency_resolver module DependencyResolver had zero call sites in prod or tests outside the module itself. The actual install-time dependency check lives in install.rs::detect_running_deps + check_install_deps; this DAG-walk solver was never wired up. -268 LoC. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 15:22:07 -04:00

1 2 3 4 5 ...

1055 Commits