archy

lfg2025/archy

Author	SHA1	Message	Date
archipelago	02b6b52a8c	feat(mesh): Meshtastic RSSI/SNR + peer-location map wiring (backlog #14/#15, part 1) Backend: parse_mesh_packet now decodes MeshPacket.rx_snr (field 8, float) and rx_rssi (field 12, int32), and a new POSITION_APP branch decodes Position. latitude_i/longitude_i (fields 1/2, sfixed32) -- all field numbers confirmed against the canonical meshtastic/protobufs mesh.proto, not guessed. Threaded through ParsedContact -> refresh_contacts -> MeshPeer (mirroring how pkc_capable was wired for #17), so mesh.peers now surfaces real rssi/snr/lat/ lon instead of always-null. Fixed a real bug found along the way: update_node_info's unconditional contact replace would have silently wiped any already-tracked signal/position data on the next NodeInfo packet -- now preserves it. Frontend: mesh.ts's updateNodePositionsFromPeers() feeds real position data into the SAME nodePositions map MeshMap.vue already renders from (parallel to the existing Coordinate/Alert-message path) -- MeshMap.vue itself needed zero changes, it was already built for this. 105/105 mesh tests pass (4 new: rx_snr/rx_rssi decode, position decode + incomplete-field handling, full packet_to_inbound_frame integration). Co-Authored-By: Claude Sonnet 5 <noreply@anthropic.com>	2026-06-30 22:52:42 -04:00
archipelago	dfca007949	wip(mesh): parse MeshPacket rx_snr/rx_rssi fields (Meshtastic backlog #14 , part 1/many) Field numbers confirmed against the canonical meshtastic/protobufs mesh.proto (rx_snr=8 float, rx_rssi=12 int32), not guessed. Not yet threaded through to ParsedContact/MeshPeer/mesh.peers — that's the next step. Part of the Meshtastic 1.8.0 backlog plan (RSSI/SNR indicator, peer-location map, Device tab, provisioning robustness, onboarding modal) — see .claude/plans/floofy-riding-seahorse.md for the full plan. Co-Authored-By: Claude Sonnet 5 <noreply@anthropic.com>	2026-06-30 22:27:54 -04:00
archipelago	0eb5c258f5	fix(mesh): Meshtastic 3ccc pkc_capable pill + Sideband image interop + critical CBOR wire-bloat fix Merges in the meshtastic agent's now-finished work alongside this session's continuation: stock-peer (3ccc) PKI-capability is now stamped through get_contacts -> refresh_contacts -> MeshPeer.pkc_capable, so a directed DM to/from a PKC-capable stock Meshtastic peer correctly shows the E2E pill on the Sent row, not just received messages. Confirmed live: .198 sees "Meshtastic 3ccc" with pkc_capable=true. Also fixes two real interop/correctness bugs found while live-testing the Reticulum <-> Sideband link: - Receive: the daemon only ever read LXMF's plain-text content, silently dropping native FIELD_IMAGE/FIELD_FILE_ATTACHMENTS fields — a stock Sideband/NomadNet photo vanished into a blank-space message. Now decoded into the same ContentInline typed envelope our own attachments use. - Send: images to a non-archy (stock) peer now use native LXMF FIELD_IMAGE instead of our own opaque CBOR wire format, which Sideband can't decode. - Root cause of a garbled MC-chunk-fragment bug: TypedEnvelope.v/.sig (the OUTER wrapper every message type uses) serialized raw bytes as a CBOR array-of-integers instead of a native byte string, bloating every message on the wire ~2-3.5x — enough to push even a tiny ReadReceipt over the 140-byte single-frame chunking threshold. Root-caused by reading ciborium's deserializer source directly (deserialize_bytes only works within its internal scratch buffer; deserialize_byte_buf streams unbounded). Frontend: consolidated the attach/record buttons into a single animated "+" menu (was overflowing the compose row). 857/857 tests pass. Verified live across all 5 deploy-roster nodes. Co-Authored-By: Claude Sonnet 5 <noreply@anthropic.com>	2026-06-30 22:07:45 -04:00
archipelago	f54c853128	feat(mesh): Reticulum LoRa hardware gates pass + RNS Resource transfer + image/voice attachments Phase 0 gates #2/#3 (two-node LXMF-over-LoRa, external Sideband interop) passed on real hardware (.116's flashed Heltec V3 RNode <-> a phone-flashed RNode running Sideband) — RNS announce, encrypted DM round-trip, and contact binding all verified live. Fixed two bugs found in the process: the Reticulum send path wasn't stamping outbound messages as E2E despite LXMF being unconditionally encrypted, and the per-message transport pill collapsed Meshcore/Meshtastic into one generic "lora" color instead of distinguishing the three radio transports. Built on top of that link: a Columba-style image/file send experience — compression-quality presets with a real transfer-time estimate (mesh.transport-advice, now device-throughput-aware), receive-side thumbnail previews + auto-render for already-local attachments, and async voice messages, all reusing the existing ContentRef/ContentInline attachment pipeline. The headline addition is genuine RNS Resource transfer support (daemon-side RNS.Link + RNS.Resource, Rust-side send_resource/resource_recv plumbing, a new "resource-mesh" transport-advice tier) so compressed photos up to 2MB now actually transfer over LoRa for Reticulum peers instead of always falling back to Tor past the small inline-chunk cap. Co-Authored-By: Claude Sonnet 5 <noreply@anthropic.com>	2026-06-30 19:57:01 -04:00
ssmithx	f3cbeb2834	first commit of openwrt-tollgate integration	2026-06-30 20:30:26 +00:00
archipelago	12e7990b10	fix(mesh): route Meshtastic public-channel text to the channel thread, not DMs Inbound Meshtastic text addressed to BROADCAST_NUM (the default public LongFast channel, or any channel slot) was filed into a per-sender 1:1 DM thread, so public-channel messages polluted individual people's DM chats and appeared as if sent directly to the user. packet_to_inbound_frame now detects `to == BROADCAST_NUM` and emits a new synthetic RESP_MESHTASTIC_CHANNEL_TEXT frame ([channel_idx][sender_prefix(6)][text]) that the listener files under the channel thread (contact_id = u32::MAX - idx) while still attributing the message to its real sender. Directed text (to == our node) still routes to the DM thread — a regression test locks that split in. send_channel_text now sets MeshPacket.channel (field 3) so archy actually transmits on channel 0 (public) instead of ignoring the slot. Mesh.vue keeps the synthetic "Meshtastic !xxxx" sender id when that is the best identity available for a stock public-channel device. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-30 14:33:30 -04:00
ssmithx	edbad30501	fix(openwrt): TollGate apk-native install for OpenWrt 25.x - WISP wizard: step-by-step flow for WiFi, DHCP, masquerade config - WAN status: expose lan_ip, dhcp_start/limit, masq, sta_state, wifi_log - wifi_scan: detect CCMP as WPA2 (psk2) so association succeeds - opkg: PkgManager enum — detect apk-native mode when opkg not in repos - tollgate: apk-native install path using manual ipk extraction - arch detection: read DISTRIB_ARCH from /etc/openwrt_release; normalise bare mipsel/mips from uname -m to mipsel_24kc/mips_24kc - install_ipk: install binutils via apk when ar not in BusyBox - install_ipk: wget --no-check-certificate for routers without CA bundle - install_ipk: ar fallback to tar -xzf for non-standard ipk formats - install_ipk: 5MB overlay space check with clear user-facing error - middleware: allow "Not enough flash/space" errors through sanitizer Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	a862877189	feat(openwrt): add WAN diagnostics to get-status and UI get_wan_status now returns: radio0_disabled, sta_iface (from iw dev), sta_state (operstate), assoc_ssid (actually associated SSID vs configured), and recent wifi_log lines from logread. The WAN panel shows a diagnostic grid when configured but not connected so the user can see exactly what's wrong without digging into server logs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	33b96f4acf	fix(openwrt): enable radio0 when configuring WISP configure_wisp was setting up wireless.wwan but leaving radio0.disabled=1, so wifi reload did nothing and the sta interface never appeared. Explicitly set radio0.disabled=0 before committing the wireless UCI config. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	5ab569f150	fix(openwrt): use iw phy interface add for scan when no UCI wifi-iface exists wifi up does nothing without a wifi-iface section in UCI (common on fresh flash). Instead, create a temporary managed interface directly on phy0 via nl80211 (iw phy phy0 interface add scan0 type managed), scan on it, then delete it. No netifd/UCI involvement needed for scanning. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	9dc2343b60	fix(openwrt): enable radio0 and run wifi up before scanning On a freshly-flashed OpenWrt router, radio0 is disabled by default so iw dev returns empty. Detect the PHY via /sys/class/ieee80211/, enable radio0, run `wifi up`, then poll up to 8s for netifd to create the virtual interface before handing it to iwinfo scan. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	ddc839400a	fix(openwrt): use iw dev for wireless interface detection wlan0 doesn't exist on OpenWrt 25.x with mt76 drivers (Cudy TR1200); interfaces are named phy0-ap0 etc. `iw dev` handles all mac80211 naming styles. The old while-read loop also exited with code 1 when no match was found, causing run_ok to fail. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	9a782fb551	feat(openwrt): WAN/WISP setup from the UI with WiFi network scan New RPC methods: - openwrt.scan-wifi: triggers iwinfo scan on the router radio, returns networks sorted by signal strength - openwrt.configure-wan: creates UCI wireless.wwan (sta mode) + network.wwan (DHCP) + adds wwan to firewall WAN zone, then calls `wifi reload` get-status now includes a `wan` object with configured/ssid/ip/ internet fields so the UI can show current uplink state. Frontend WAN panel: scan → pick SSID (signal bars) → enter password → apply. Shows "Configure WAN first" hint above TollGate install button when internet is not available. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	dd3a3dfbac	fix(openwrt): capture apk stderr and run apk update before apk add opkg apk errors were being silently dropped (stdout only). Run apk update first and fail with a clear "router may have no internet" message if it fails, rather than a cryptic exit-1 from apk add. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	5d82e6ff8d	fix(openwrt): bootstrap opkg via apk on OpenWrt 25.x routers OpenWrt 25.x switched from opkg to apk as the default package manager, so devices like the Cudy TR1200 on 25.12.4 don't have /usr/bin/opkg. When opkg is missing but apk is present, install opkg through apk first so the rest of the provisioning flow can proceed unchanged. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	58266dea66	fix(openwrt): allow opkg-not-found error through RPC sanitizer "opkg not found at /usr/bin/opkg" was being swallowed by the error sanitizer and shown as generic "Operation failed". Also fix bare `opkg list-installed` call in get-status handler to use full path. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	bc1ec9aa3e	fix(openwrt): use full opkg path and pre-check availability `channel.exec()` doesn't source the shell profile, so PATH may not include /usr/bin on some routers. Using /usr/bin/opkg explicitly avoids exit-127 surprises. Added opkg_check() to give a clear error ("firmware may not support package management") before attempting opkg_update, rather than a confusing "command not found" exit code. Also split the BusyBox-hostile `grep -v 'all\\|noarch'` into two separate greps for the arch-detection fallback. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	4c56e1bb96	fix(openwrt): detect opkg silent failure and show disk space error BusyBox opkg exits 0 even when 'Cannot install' due to insufficient space, causing the fallback to silently report success. Now captures stderr and checks for the failure string explicitly. Adds user-visible error for the common case where the router flash is too small for the TollGate package (~19 MB needed vs ~9 MB available on typical budget routers). Adds error prefixes to the RPC sanitizer allowlist so the message reaches the UI instead of showing 'Check server logs'. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	f69fac627a	style(openwrt): adopt glass-card design system for contrast Replace bg-white/5 card containers with glass-card (rgba(0,0,0,0.65) + backdrop-blur), match input styling to Login.vue, and use glass-button variants for actions. Fixes low contrast against the background image. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	f054766a58	feat(openwrt): add TollGate provision button and direct-download fallback - OpenWrtGateway.vue: add "Install TollGate" button when not installed; tracks connected credentials for reuse in the provision call - install.rs: fall back to wget download from GitHub releases when the package is not in any opkg feed (mips_24kc and other arches supported) - openwrt.rs: provision-tollgate now falls back to saved router_config for credentials, matching the behaviour of get-status Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	6c534715ec	fix(openwrt): allow No router/OpenWrt errors through RPC sanitizer Without these prefixes in the allowlist, sanitize_error_message swallowed the "No router configured" error and returned a generic "Operation failed", so the frontend could never detect the unconfigured state and show the connect form. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	d71f36370d	feat(openwrt): add OpenWrt gateway status view and get-status RPC Backend: new `openwrt.get-status` RPC endpoint SSHes into the saved (or provided) OpenWrt router and returns system info, TollGate config, and WiFi AP interfaces via UCI. Frontend: new OpenWrtGateway.vue view at /dashboard/server/openwrt shows system hostname, OpenWrt version, uptime, TollGate install/enable state with pricing and mint URL, and all AP-mode WiFi interfaces. Linked from the Local Network section of the Server view. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
ssmithx	e0cc00be0f	feat(openwrt): add archipelago-openwrt crate with TollGate provisioning New `archipelago-openwrt` workspace crate provides SSH/UCI-based management of OpenWrt routers, including automated TollGate installation and configuration of a pay-as-you-go "archipelago" SSID backed by the local Cashu mint. Exposes two RPC endpoints: - `openwrt.scan` — discover OpenWrt routers on the LAN - `openwrt.provision-tollgate` — install tollgate-module-basic-go, write UCI config (TIP-01/TIP-02), and create isolated WiFi SSID + firewall zone Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-30 17:12:57 +00:00
archipelago	f392670e2a	feat(mesh): show sender identity on received channel messages Received messages snapshot peer_name at receive time, so a Meshtastic text that arrived before its sender's NodeInfo was stuck showing the synthetic "Meshtastic !xxxx" id forever, and channel/group bubbles showed no sender at all. Add a per-bubble sender label for received messages in multi-sender views (mesh + Archipelago channels), resolved LIVE from the peer table so it always shows the current archy identity (e.g. "Arch Optiplex") the moment NodeInfo is learned. Falls back to "Unknown sender" rather than echoing a Channel/synthetic placeholder. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-30 13:04:41 -04:00
archipelago	a57ae388ec	fix(mesh): restore Meshtastic inbound stream after radio reboot archy went deaf to inbound LoRa packets after every config write. A config write (region/channel/owner) reboots the radio, which resets the firmware PhoneAPI to STATE_SEND_NOTHING; it won't stream received packets again until the client re-sends want_config. archy ignored FromRadio.rebooted (field 8) so never resubscribed — which is why old messages only arrived after a full restart (restart = fresh want_config). - meshtastic.rs: handle FROM_RADIO_REBOOTED -> set pending_reinit; try_recv_frame re-sends want_config to resubscribe the packet stream. Add send_keepalive (bare heartbeat) and pin modem_preset=LONG_FAST in set_lora_region so all radios share frequency. - listener/session.rs: MeshRadioDevice::send_keepalive; 10s sync_timer sends a keepalive each tick (insurance vs 15-min idle serial close). - mod.rs send_message: device-aware send — Meshtastic archy peers get a plain TEXT_MESSAGE_APP DM (firmware PKC E2E); Meshcore archy peers keep the typed envelope (no meshcore regression). Verified: .198->.228 directed DM arrives as RECEIVED enc=True peer="Arch Optiplex"; all 3 nodes (.116/.198/.228) + 3ccc hear each other. Binary 737b16c3 deployed+active on all three. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-30 12:44:31 -04:00
archipelago	fbfeeeb0f5	fix(mesh): native E2E DM for archy↔archy text + software radio-reboot - send_message now sends archy↔archy plain text as a native TEXT_MESSAGE_APP DM (firmware PKC-encrypts E2E), not wrapped in the binary typed envelope that silently broke archy↔archy LoRa delivery. Archy peers' Sent rows are marked encrypted so the E2E pill shows; rich typed msgs still use the typed-wire path. - Add a software radio-reboot to recover a wedged/RX-deaf radio without physical access (and for the Device-tab settings panel): driver reboot() via AdminMessage reboot_seconds=97 (verified vs meshtastic/protobufs), MeshCommand::RebootRadio, MeshService::reboot_radio, RPC mesh.reboot-radio. - Handoff doc: docs/SESSION-1.8.0-OTA-PROGRESS.md "RESUME HERE" — RF link is the proven blocker (radios not hearing each other); modem_preset mismatch is the prime suspect; on-device Meshtastic-app check + fix plan documented. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-30 10:39:34 -04:00
archipelago	b4531bb4fc	fix(mesh): enforce LoRa-only off-grid labels	2026-06-30 06:22:45 -04:00
archipelago	2ac0711f8e	fix(ui): refresh mesh transport labels after send	2026-06-30 06:05:41 -04:00
archipelago	a91814641e	fix(mesh): set Meshtastic hop limit and show LoRa pill	2026-06-30 05:59:53 -04:00
archipelago	c2c4b5af7d	merge: demo build updates # Conflicts: # neode-ui/src/stores/appLauncher.ts # neode-ui/src/views/AppSession.vue	2026-06-30 05:22:42 -04:00
archipelago	daf750688d	merge: mesh multiversion and transport pills # Conflicts: # core/archipelago/src/mesh/listener/decode.rs # core/archipelago/src/mesh/meshtastic.rs	2026-06-30 05:19:58 -04:00
archipelago	4b7cbf2b5e	merge: bitcoin version bulletproof and OTA work	2026-06-30 05:08:27 -04:00
archipelago	df9d3a55be	integration: preserve deployed 1.8.0 OTA work	2026-06-30 05:08:17 -04:00
archipelago	7b0748c868	fix(mesh): respect the radio's flashed LoRa region (don't force ours) ensure_lora_region previously force-overrode the device's region with the mesh-config region (EU_868) whenever they differed — which would shove a US/ANZ user's radio onto EU_868: an illegal band that also cuts it off from its local mesh. Off-the-shelf interop must respect whatever region the user flashed. Now: a radio that already reports a REAL region (US, EU_868, ANZ, …) is left untouched. We only set a region when the device reports UNSET (a fresh radio is RF-silent and can't mesh at all), using the operator-configured region as the fallback. Unknown/None (never reported) is also left alone. Pairs with the default-channel change so a meshtastic archy node behaves like a stock device. cargo check green (built into the same binary as the channel fix). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 08:36:04 -04:00
archipelago	810127fd3e	feat(mesh): meshtastic off-the-shelf interop — default channel + private archipelago Make a meshtastic-equipped archy node work like a stock Meshtastic device AND keep the private archy group, instead of being isolated on a custom primary: - slot 0 (PRIMARY) = the DEFAULT public channel (empty name + default key) → interoperates with every off-the-shelf device on LongFast and picks up default-channel users; our NodeInfo broadcasts ride here like normal. - slot 1 (SECONDARY) = "archipelago" (deterministic psk) → private archy↔archy. Previously the driver set "archipelago" as the PRIMARY, isolating archy from the public mesh. Now ensure_channel writes at most one channel per call (default primary first, then archipelago secondary), reusing the existing reboot→ reconnect→re-check loop so it converges in ≤2 cycles without reboot-looping; primary_is_default() accepts the default key in 1-byte or expanded form so a stock radio is never needlessly rewritten. set_channel generalized to (index, name, psk, role); want_config parse tracks both slots. MeshCore needs no change — it never overrides channels (ensure_channel is a no-op) and already rides MeshCore's default Public channel off the shelf. cargo check green. NEEDS radio verify on .116/.198 (default-channel RX + archy group on the secondary). Channel provision cap (3) covers the 2-write migration. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 07:40:10 -04:00
archipelago	067002b04b	Merge branch 'bitcoin-version-bulletproof' into mesh-multiversion-integration	2026-06-29 06:45:50 -04:00
archipelago	20f762cb2c	feat(fips): auto-peer LAN-discovered federation nodes directly over FIPS Mesh/federation messages between co-located nodes were always falling back to Tor because the FIPS overlay had no direct peering — every node depended on the global anchor's spanning tree, and when that anchor link flaps a node is isolated and all FIPS dials time out. (Diagnosed live on .116/.198: pure-FIPS direct peering over UDP 8668 fixes it — 2.5ms vs timeout.) Generalize the manual fix: in the existing 5-min FIPS seed-anchor apply loop, also auto-connect every federation peer the PeerRegistry knows both a LAN address AND a FIPS npub for, dialing its FIPS UDP transport (port 8668) at its LAN IP via the same idempotent `fipsctl connect` path (new anchors::lan_fips_anchors). This is FIPS's own transport over the LAN — NOT Tailscale, NOT the HTTP/LAN messaging port. Transient (recomputed each tick from live mDNS discovery, never persisted) so changing IPs self-correct. Remote peers with no LAN address are untouched (still routed via the anchor). Registry Arc hoisted out of the transport-init block so the loop can read all_peers(). cargo check green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 06:42:18 -04:00
archipelago	11155055aa	feat(mesh): meshtastic PKI E2E pill — surface pki_encrypted on received DMs The synthetic meshcore-style frame the meshtastic driver builds can't carry the radio's PKI-encryption status, so received meshtastic DMs never lit the E2E pill. Thread it out-of-band: the device records `last_rx_encrypted` (= packet pki_encrypted) when it yields a text frame; the session loop reads it via `take_rx_encrypted()` right after dispatch and stamps the just-stored received message E2E (dispatch::stamp_received_encrypted, monotonic-id keyed). Meshcore returns false here (its E2E is derived in the frames decrypt path). Pure out-of-band signal — no change to the shared meshcore wire format. Built + deployed live in binary d937814e on .116/.198. cargo check green. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 06:25:01 -04:00
archipelago	f4f45c1a09	docs: mark .228 reindex finish/verify as other-agent owned Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 06:04:01 -04:00
archipelago	ed1352d3a3	docs+catalog: bitcoin multi-version rollout handoff + reproducible generator - generate-app-catalog.sh: VERSIONS map now lists the full Knots set (29.3.knots20260508/20260507/20260210 + 29.2.knots20251110) and Core (adds 29.2 + a `latest` entry → newest); generator forces top-level `version` == the default entry's version (the 169ff2e2 invariant) so regeneration is reproducible. releases/app-catalog.json regenerated. - docs/bitcoin-version-bulletproof-rollout.md: full handoff — root causes, fixes, current .228 state, the coordinated fleet-rollout steps (incl. :latest repoint sequencing / fleet-safety), reindex finish procedure, and the switch-matrix test plan. - PRODUCTION-MASTER-PLAN.md: link the rollout doc (§6b-bis). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 06:02:24 -04:00
archipelago	095a76cd20	fix(bitcoin): bulletproof multi-version switching (Knots & Core) Three stacked bugs made "switch version" silently fail / crash-loop, and the data-access mismatch corrupted a node's index during recovery attempts. Backend renderer: - sync_quadlet_unit ignored the per-app pinned version and re-rendered the quadlet with the manifest's :latest every reconcile tick, reverting any switch. Factor the install-time catalog/pin resolution into a shared resolve_catalog_image() and call it in BOTH install_fresh and sync_quadlet_unit. - The renderer folded manifest `entrypoint: ["sh","-lc"]` into Exec=, which only worked when the image entrypoint was a passthrough shell wrapper. The versioned images use ENTRYPOINT ["bitcoind"], so Exec=sh -lc ... became `bitcoind sh -lc ...` and crash-looped. Emit a real Entrypoint= override; exec_changed now also compares Entrypoint=. Images: - Build all bitcoin images (Core + Knots, every version) as container-root (USER removed) like the legacy :latest image. Chain data is owned by the data_uid (container uid 102); root reads it via CAP_DAC_OVERRIDE (granted in the manifest). A non-root USER (the previous uid 1000) can't read existing chain data → "Error initializing block database". Still fully rootless: container-root maps to the unprivileged host service user. Catalog: - bitcoin-knots versions[]: 29.3.knots20260508/20260507/20260210 + 29.2.knots20251110, "latest" tracking newest. - bitcoin-core versions[]: add 29.2 + a "latest" entry. All images rebuilt root and published to the mirror. Frontend: - AppSidebar version dropdown: rename the latest option to "Always use the latest version" (no v prefix), fix right padding, and guarantee the current selection matches a real option (was rendering blank). - New InstallVersionModal: full-screen version chooser shown from the App Store / Discover install button for multi-version apps (Bitcoin Knots/Core), app icon + "Install <name>", latest pre-selected. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 05:46:04 -04:00
archipelago	3c7c04a662	fix(mesh): meshtastic receive — drain frame batch per poll + rx diagnostics Addresses the open Meshtastic parity bug (project_meshtastic_parity): the running driver received nothing (`mesh.messages` stayed []) though the radio got the packets and sends worked. Root-cause candidate: `try_recv_frame` decoded ONE serial frame per poll and returned Ok(None) for every non-text FromRadio frame, so the session loop slept 50ms between frames. Under Meshtastic's frequent NodeInfo/telemetry stream a received text packet queued behind them, and read_from_radio's 64KB buffer cap could drain (drop) it before it was ever decoded — reception silently dead while sends kept working. - try_recv_frame now drains a bounded batch (64) per poll, processing each frame's side effects and returning the first inbound text frame, so a text packet is decoded the same poll it arrives and the buffer never grows enough to hit the lossy cap. Bounded so a continuous flood still yields to select!. - packet_to_inbound_frame logs every decoded packet (from/portnum/payload_len) and a "did not parse (dropped)" case, so one live radio pass is conclusive. The rest of the decode path was verified correct by inspection (FROM_RADIO_PACKET =2, wire-type-5 handled, parse_mesh_packet sound, 60s heartbeat present) — not a parse bug. cargo check green. NEEDS a live radio pass on a rig that isn't .228 (off-limits: bitcoin testing) to confirm. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 05:04:09 -04:00
archipelago	11038cdcc9	feat(mesh,ui): per-message transport pill (Mesh/FIPS/Tor) + fix E2E pill Adds a per-message transport badge to archy↔archy mesh chats and fixes the long-broken E2E badge — both meshcore and meshtastic, styled like the existing E2E pill. Transport pill: - New `MeshMessage.transport` ("lora"/"fips"/"tor"), surfaced in the UI beside the E2E badge (Mesh.vue transportLabel() → Mesh/FIPS/Tor, mesh-styles.css). - Sent LoRa → "lora"; sent federation → finalized to the real leg ("fips"/"tor") once the background send resolves (req.send_json transport), via an id-keyed store update. - Received: a post-dispatch stamp on handle_typed_envelope_direct's output (monotonic ids) tags both transports without threading through all 20 typed- dispatch sites — radio wrapper stamps "lora", federation injector stamps the peer's last_transport ("fips"/"tor", default tor; the inbound HTTP carries no FIPS-vs-Tor signal). - Plain native/channel LoRa frames → "lora"; channel broadcasts stay non-E2E. E2E pill fix: - `encrypted` was hardcoded false at every MeshMessage construction site, so the UI badge (Mesh.vue `v-if="msg.encrypted"`) never showed. Now: federation envelopes are E2E (identity-signed over an encrypted transport); the meshcore native-DM receive path already had a real `encrypted` flag (now also tagged with transport). meshtastic-PKI radio E2E flag threading is a noted follow-up. Backend cargo check + frontend vue-tsc build both green. Needs a live radio + multi-transport pass on .116/.228 to confirm end-to-end (see project_transport_pill / project_meshtastic_parity). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-29 04:29:25 -04:00
archipelago	169ff2e2cd	fix(bitcoin): knots catalog default must equal top-level version The knots versions[] marked 29.3.knots20260508 as default while the top-level catalog version is the floating 'latest' tag — violating the generator's own invariant (default:true MUST equal the top-level version so selecting it un-pins / tracks latest). Live effect via package.versions: catalog_default_version='latest' so the UI-highlighted default actually PINS+recreates (opposite of un-pin) and 'latest' was unreachable from the Version & Updates card. Add a 'latest' default entry (== the manifest's floating tag) and keep 29.3.knots20260508 as a pinnable option. Verified on .228: package.versions now returns default=latest with 2 selectable versions. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 19:56:49 -04:00
archipelago	da20f67462	Merge bitcoin-multi-version: multi-version support for Core & Knots Integrate the bitcoin-multi-version feature (commit 6aa74c73): per-node choice/pin/switch of Bitcoin Core & Knots versions with auto-update toggle — catalog versions[] schema, install-time selection, package.versions + package.set-config RPCs, hourly per-app auto-update tick, build-bitcoin-image.sh (GPG+SHA verified rootless image builder), and UI (version select + Version & Updates card). Catalog regenerated; preserves the mempool 127.0.0.1 health fix. Not yet live-verified on .228 — gate any tagged release on that per CLAUDE.md.	2026-06-28 18:48:38 -04:00
archipelago	6aa74c7386	feat(bitcoin): multi-version support for Core & Knots (install/switch/pin/auto-update) Lets a node runner choose which Bitcoin Core / Knots version to install (latest pre-selected), then switch, pin, or opt into auto-update from the app's interface — all manifest/catalog-driven, rootless, signed-registry, zero-data-loss. Motivated by upcoming BIP-110 signalling: runners need a real choice of software version. Backend: - version_config.rs: per-app pin + auto-update persistence (atomic, merge- preserving), downgrade detection, auto-update enumeration (+ unit tests). - app_catalog.rs: CatalogVersion / versions[] schema, catalog_versions(), catalog_image_for_version() (same-repo guard); a pin suppresses the update badge. - prod_orchestrator.rs: pinned version wins over the catalog default on every install/recreate. - install.rs: install-time `version` param persisted (default = unpinned). - set_config.rs: package.versions (read) + package.set-config (write) RPCs; downgrade is gated behind explicit confirm (warn + confirm + allow). - update.rs/main.rs: hourly per-app auto-update tick via the orchestrator (opt-in, pin-respecting); fix handle_package_update to be non-fatal for orchestrator-managed apps lacking a catalog primary image (bitcoin-core). UI: - MarketplaceAppDetails.vue: install-time version selector (shown when an app offers >=2 versions). - appDetails/AppSidebar.vue: "Version & Updates" card (switch / pin / auto- update toggle / downgrade warning), per app. - rpc-client.ts + en.json: RPC methods, types, strings. Phase 0 image pipeline: - scripts/build-bitcoin-image.sh: download official tarball + SHA256SUMS(.asc), verify SHA-256 + pinned-maintainer OpenPGP signature (fail-closed), build a minimal rootless image, smoke-test, tag + push. - apps/bitcoin-core/Dockerfile rewritten (drops stale community base); apps/bitcoin-knots/Dockerfile added. - generate-app-catalog.sh: emit curated versions[]; published + catalog now offers Core 25.2/26.2/27.2/28.4/29.3/30.2/31.0 + Knots 29.3.knots20260508. docs/bitcoin-multi-version-design.md: live progress tracker. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 18:46:17 -04:00
archipelago	3cea7dd6c5	test(phase3): fix Phase-3 quadlet gates — define fail(), drop stale Notify=healthy assert Two Phase-3 bats suites used `fail` (a bats-assert helper) but bats-assert isn't installed on the alpha fleet (only bats-core), so every tripped assertion crashed with `fail: command not found` (status 127) instead of reporting a real pass/fail. Define the same minimal `fail() { echo ...; return 1; }` the other suites already use (see mempool.bats). Without this the gates were silently non-functional. Also rewrite the obsolete "HealthCmd= implies Notify=healthy" assertion in use-quadlet-backends-install.bats. Phase 3.4's Notify=healthy was deliberately reverted: gating `systemctl start` on health hung boot reconciliation for dependency-waiting apps (fedimint idles until Bitcoin IBD; lnd until macaroon unlock), leaving units stuck "deactivating". The renderer now emits HealthCmd= for Podman's health state but TimeoutStartSec=0 and NO Notify=healthy (quadlet.rs render() + contains_stale_health_gate()). The test now asserts the current invariant: no backend unit gates start on health. Verified on the .228 canary node (ARCHIPELAGO_USE_QUADLET_BACKENDS=1): use-quadlet-backends-install 6/6, backend-survives-archipelago-restart 3/3. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 16:09:05 -04:00
archipelago	d7c6f8c348	fix(mempool): health-check 127.0.0.1 not localhost (stops false-unhealthy loop) The archy-mempool-web health_check endpoint used http://localhost:8080. Inside the frontend image, wget resolves `localhost` to ::1 (IPv6) first, but nginx binds 0.0.0.0:8080 (IPv4) only -> the baked HealthCmd gets "connection refused" every probe -> container is perpetually unhealthy -> the reconciler recreates it forever (observed on .228: mempool container re-Started every ~3 min, Health=unhealthy). Proven live: in-container `wget http://localhost:8080/` = refused, `wget http://127.0.0.1:8080/` = OK. Pin the probe to 127.0.0.1 so it matches nginx's IPv4 bind. Updated both the source manifest and the embedded copy in releases/app-catalog.json (the catalog overlay wins over the disk manifest on fleet nodes, so the catalog copy is the one that actually reaches .228). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 15:09:34 -04:00
archipelago	83344b9f3a	fix(orchestrator): drop legacy mempool umbrella manifest on catalog-driven nodes The split-mempool-stack guard that skips the legacy monolithic `mempool` manifest (whose container collides with its split-stack frontend member `archy-mempool-web`) only ran over DISK manifests. On catalog-driven nodes (no disk manifests — e.g. the Phase-3/registry-manifest path), the legacy `mempool` manifest arrives via the registry-catalog overlay AFTER that guard, so both `mempool` and `archy-mempool-web` end up owning container `mempool` and rewrite+restart each other forever ("port binding drift" / "network alias drift" loop observed on .228, leaving mempool down). Enforce the guard once more over the merged (disk + catalog) manifest set: drop the `mempool` umbrella whenever all three split members are present. Installing `mempool` assembles the split stack, so `archy-mempool-web` owns the frontend container either way. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 14:04:41 -04:00
archipelago	05c22b6085	fix(mempool): correct frontend container port 4080->8080 (stops restart loop) The mempool manifest + embedded catalog declared the frontend container port as 4080, but mempool-frontend nginx listens on 8080 (the stack creates it as -p 4080:8080 with FRONTEND_HTTP_PORT=8080, see api/rpc/package/stacks.rs). So every reconcile rendered the quadlet as PublishPort=4080:4080, disagreed with the working 4080:8080 container, and restarted it ("port binding drift" -> "host port 4080 did not become reachable within 5s" -> "host listener disappeared; restarting") in a perpetual loop on .228. Correcting the manifest container port to 8080 makes the rendered quadlet match reality so the drift/restart loop stops. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-28 13:49:54 -04:00

1 2 3 4 5 ...

1569 Commits