Security & Trust Model

Security Overview

Bootstrap tokens are one-time-use with a short TTL. They are deleted from disk after successful registration.
Private keys are generated during registration and stored in /var/lib/plexd/. They never leave the node.
Control plane communication is TLS-encrypted (HTTPS). The agent validates the server certificate. Every SSE event is additionally signed with the control plane's Ed25519 key and verified by the agent before processing.
Mesh traffic is encrypted end-to-end via WireGuard.
Compromised nodes can be force-removed from the control plane, triggering key rotation across all affected peers.
Hook integrity is enforced via SHA-256 checksums computed at startup, monitored via inotify, and re-verified before every execution. Mismatches block execution and trigger alerts.
Binary verification - plexd reports its own SHA-256 checksum at registration and with every heartbeat. The control plane compares against known-good checksums per version.

Key Exchange and Trust Model

plexd uses WireGuard's Noise_IKpsk2 handshake with static Curve25519 key pairs. The control plane acts as a trusted key distribution center but never has access to private keys. All events from the control plane (peer changes, policy updates, action requests, key rotations) are signed with an Ed25519 signing key and verified by the agent before processing.

Trust Chain

Bootstrap Token (one-time, short TTL)
        │
        ▼
   Control Plane  ──── Trust anchor: distributes public keys, PSKs,
        │               and its own Ed25519 signing public key
        │
        ├──► Signing Key ──── Verifies all SSE events + session JWTs
        │
        ▼
   Node Identity  ──── Public key bound to node ID and mesh IP
        │
        ▼
   Peer Tunnels   ──── WireGuard E2E encryption (private key stays local)

Phase 1: Registration

During registration the node generates its Curve25519 key pair locally. Only the public key is sent to the control plane. The private key never leaves the node.

Phase 2: Tunnel Setup

The client configures the local WireGuard interface using the registration response:

Create WireGuard interface (plexd0)
Assign mesh IP and set private key
Add each peer with its public key, endpoint, allowed IPs, and PSK
Run STUN discovery and report the node's public endpoint to the control plane
Receive NAT-discovered endpoints of peers and update WireGuard accordingly

Phase 3: Steady State

The control plane pushes peer and key updates via SSE. Every SSE event is signed with the control plane's Ed25519 signing key. The client verifies the signature before applying any change.

Signed Event Envelope:

Every SSE event is wrapped in a signed envelope. The signature covers the canonical JSON serialization of all fields except signature itself (i.e. event_type, event_id, issued_at, nonce, and payload):

json

{
  "event_type": "peer_added",
  "event_id": "evt_d4e5f6",
  "issued_at": "2025-01-15T10:30:00Z",
  "nonce": "unique-random-nonce-value",
  "payload": {
    "peer_id": "n_peer456",
    "public_key": "...",
    "mesh_ip": "10.100.1.5",
    "endpoint": "203.0.113.10:51820",
    "allowed_ips": ["10.100.1.5/32"],
    "psk": "..."
  },
  "signature": "base64-encoded-ed25519-signature"
}

Verification on every event:

Verify Ed25519 signature over the canonical JSON of all fields except signature, using the control plane's signing public key (received during registration).
Check issued_at staleness (max 5 minutes).
Check nonce uniqueness (bounded in-memory set with automatic expiry).
If any check fails, reject the event and log a security warning.

This ensures that even if the TLS connection is compromised (e.g. through a rogue proxy or certificate authority), events cannot be forged or replayed.

SSE Events:

SSE Event	Client Action
`peer_added`	Add peer with public key, endpoint, and PSK
`peer_removed`	Remove peer from WireGuard interface
`peer_key_rotated`	Replace peer's public key and PSK
`peer_endpoint_changed`	Update peer's WireGuard endpoint
`policy_updated`	Update local firewall rules
`action_request`	Validate, ACK, and execute the requested action (see Actions & Hooks)
`session_revoked`	Add session to local revocation set, reject future actions with that session's token
`ssh_session_setup`	Set up SSH session: start listener, inject session token
`rotate_keys`	Generate new Curve25519 keypair and initiate key rotation (see Phase 4: Key Rotation)
`signing_key_rotated`	Update the control plane's signing public key (see Signing Key Rotation)
`node_state_updated`	Update local node state cache (metadata, data entries) and notify Node API consumers
`node_secrets_updated`	Fetch updated secret values from control plane via HTTPS and update local secret store (names and versions only in SSE, never plaintext)

Phase 4: Key Rotation

Key rotation is triggered by the control plane - either on a schedule, by admin action, or in response to a compromised node. The rotate_keys SSE event is signed like all other events and verified before processing.

When a node is force-removed from the control plane, all peers that had a tunnel to the compromised node receive a peer_removed event followed by fresh PSKs for their remaining peer pairs.

Signing Key Rotation

The control plane's Ed25519 signing key (used for SSE event signatures and session JWTs) can be rotated independently of WireGuard mesh keys. During rotation, both the old and the new key are valid for a transition period.

The signing_key_rotated event is signed with the current (old) key, which the node already trusts. This creates a chain of trust - each key vouches for its successor.

Pre-Shared Keys (PSK)

Each peer pair uses a unique PSK generated by the control plane and distributed to both peers. PSKs provide:

Post-quantum resistance: An additional symmetric key layer on top of the Curve25519 ECDH, protecting against future quantum attacks on elliptic-curve cryptography.
Defense in depth: Even if the Curve25519 key exchange is compromised, the PSK layer prevents decryption.

PSKs are rotated together with the main key pairs and whenever a peer is removed from the mesh.

Threat Model

Scenario	Impact	Mitigation
Control plane compromised	Attacker has signing key - can forge SSE events and inject malicious peers	PSK layer for mesh traffic; signing key rotation to limit exposure window; admin-side integrity monitoring; nodes log all applied events for forensic analysis
Node compromised	Attacker has private key and NSK of one node	Force-remove node, trigger key rotation + PSK refresh on all affected peers; rotate NSK to prevent decryption of future secrets; secrets are not cached so no plaintext on disk to exfiltrate
Bootstrap token stolen	Attacker could register a rogue node	One-time-use + short TTL limits the attack window
MITM during registration	Could intercept public key exchange and signing key	TLS + server certificate validation on all control plane communication
MITM on SSE stream	Could inject forged events (peer changes, action requests, key rotations)	Ed25519 signature verification on every event; TLS as first layer; forged events rejected without valid signature
Signing key compromised	Attacker can forge SSE events until key is rotated	Signing key rotation via `signing_key_rotated` event (signed with current key); transition period for graceful rollover
Session token stolen	Attacker could execute scoped actions on target node	Short TTL, node-bound, scoped action list, revocation on session end
Unauthorized local action execution	SSH user runs actions without permission	Requires valid session JWT; `--local` restricted to root and logged as emergency
Unauthorized secret access (local)	Attacker on node reads secrets via socket or K8s Secret	Socket requires `plexd-secrets` group; K8s Secrets contain only NSK-encrypted ciphertext; decryption requires plexd API with valid bearer token + live control plane
NSK compromised	Attacker could decrypt secret ciphertext from K8s Secrets or intercepted responses	NSK rotation invalidates old key; secrets are fetched in real-time so no historical ciphertext accumulates on-node; control plane re-encrypts with new NSK

Network Requirements

plexd requires the following network connectivity. All control plane communication is outbound-initiated from the node.

Node Mode

Direction	Protocol	Port	Destination	Purpose
Outbound	TCP/443	-	Control plane API	Registration, heartbeat, observability, log/audit forwarding, callbacks
Outbound	TCP/443	-	Control plane SSE	Real-time event stream (persistent connection)
Outbound	UDP/3478, UDP/19302	-	STUN servers	NAT type discovery, public endpoint detection
Inbound/Outbound	UDP/51820	51820	Mesh peers	WireGuard encrypted mesh traffic (P2P)

Bridge Mode (additional)

Direction	Protocol	Port	Destination	Purpose
Inbound	UDP/51820	51820	NAT relay clients	WireGuard relay for nodes behind symmetric NAT
Inbound	TCP/443	443	Public internet	Public ingress (if `ingress.enabled`)
Inbound	UDP/51821	51821	User access clients	WireGuard user access (if configured)
Outbound	UDP/varies	-	Site-to-site peers	VPN tunnels to external networks

Note: Nodes behind NAT do not need any inbound port forwarding. STUN discovery and relay fallback handle NAT traversal automatically.

Security & Trust Model ​

Security Overview ​

Key Exchange and Trust Model ​

Trust Chain ​

Phase 1: Registration ​

Phase 2: Tunnel Setup ​

Phase 3: Steady State ​

Phase 4: Key Rotation ​

Signing Key Rotation ​

Pre-Shared Keys (PSK) ​

Threat Model ​

Network Requirements ​

Node Mode ​

Bridge Mode (additional) ​