Monitor worker nodes to detect data source connection issues. Automatically reconfigure problematic nodes to reestablish a functional connection.
Avoid query bottlenecks by monitoring and automatically scaling worker and coordinator nodes. Dynamically adjust query.max-memory and query.max-memory-per-node.
Oversee read/write capacity unit usage and get notified when reservations are near capacity.
Monitor and automatically compact collections, repair databases, and resize the oplog. Automatically increase EC2 node disk sizes for self-hosted clusters with minimal downtime.
Get notified and (optionally) restart daemons when process count, max_used_connections, or aborted connections exceed your thresholds. Also, watch and alert on deadlocks and row locks.
Perform real-time monitoring of uptime, configurations, directory listings, server signatures, active worker requests, resource usage, and more. Automatically notify operators and (optionally) restart.
Receive automatic notifications when Envoy and Istio diffs breach a user-defined threshold. Watch for Istio proxy container connectivity issues and get immediate alerts.