Vigil
Sub-threshold anomaly detection. Watches event streams, trace samples, and contract violations for pattern divergence that doesn't trip any single alarm.
Problem Statement
Real systemic failures often present as 'too many small things' rather than a single threshold breach. '47 LLM timeouts in an hour for tenant X on the same prompt' is alarming as a cluster, invisible as individual events.
MIT · Python