[Bug] Reconciler trusts podman 'Up' for conmon-died containers (netbird up-but-not-serving) #53
Loading…
x
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Observed live on .198 (#15): both
netbird-serverandnetbird-dashboardwere inState.Error = "conmon died without writing exit file"—podman psreported them "Up 29 hours" while they were actually dead and refusing all connections. The proxy then 502'd and the dashboard showed "Unauthenticated", with no automatic recovery.The reconciler trusts
podman ps"Up" status and didn't detect a container that is up-but-not-serving. Ask: treat aconmon died/ connection-refused upstream as unhealthy and recreate the container (a plainrestartdid not recover it on .198; arm -f+ recreate did). Relates to the container-lifecycle failure modes already tracked.