mipo

No Heartbeat

A scanner stopped heartbeating but has assigned or running jobs. The threshold is 2× the offline-after window (4 minutes by default).

How to

Investigate

  1. Check Health > Services > Scanners for the affected scanner
  2. SSH to the scanner host (if reachable) and check the binary process / logs
  3. Network: confirm outbound HTTPS to the platform is reachable from the scanner network

If the scanner is coming back

  1. Drain the scanner (24h grace) so the dispatcher does not pile on more jobs while it stabilizes

If the scanner is gone

  1. Reassign jobs to another scanner OR fail them outright

Gotchas

  • Recently provisioned scanners may not have heartbeated yet (no last_heartbeat) — those do NOT trigger no_heartbeat (use failed_scanner_auth checks).
  • A heartbeat resumes auto-resolution; the alarm and state row clear without operator action.

API calls (1)

MethodPathDescription
POST /api/health/job-errors/:stateId/repair action=drain | reassign | fail

Related