Environment Health Report
staging-us, live, last 30 days
Stability
97.3%
Computed from crash and uptime ratio
Total Crashes
3
Stored in SQLite event history
OOM Events
1
0.6 avg restarts/day
Per-Container Health
| Container | Status | Uptime | Restarts | Last Crash | Stability |
|---|---|---|---|---|---|
| nginx-proxy | Running 3d 14h | 3d 14h | 0 | Never | 100% |
| api-service | Restarting | 2m | 14 | 6/6/2026 (OOM killed) | 72% |
| postgres | Running 21d | 21d 7h | 0 | Never | 100% |
| redis-cache | Exited (137) | -- | 3 | 6/5/2026 (SIGKILL, often OOM) | 94% |
| worker-queue | Running 7d | 7d 2h | 1 | 5/30/2026 (Application error) | 98% |
Crash Feed
Jun 6, 03:12 AM
api-serviceExit 137OOM killed
Jun 5, 10:44 PM
redis-cacheExit 137SIGKILL, often OOM
May 30, 01:44 AM
worker-queueExit 1Application error
Recommendations
api-service
OOM events detected
redis-cache
SIGKILL exit code 137
DevOps Health Metrics
| Metric | Source | Frequency | Current Value |
|---|---|---|---|
| Container uptime | docker inspect State.StartedAt | 60s | 4d 12h |
| Creation time | docker inspect Created | snapshot | 2026-05-31 |
| Restart count | docker inspect RestartCount | 60s | 18 total |
| Exit code | docker events die | real-time | 137, 1 |
| OOM kill flag | docker events oom | real-time | 3 events |
| Last crash timestamp | docker events | real-time | Today 03:12 |
| Crash reason | exit code table | on event | OOM killed |
| Crash duration | computed runtime | on event | 2d 4h |
| Health check status | docker inspect State.Health | 30s | 1 unhealthy |
| CPU usage | docker stats | 5s | 34% |
| Memory usage | docker stats | 5s | 6.2 / 16 GB |
| Network I/O | docker stats | 5s | 2.3 MB/s up |
| Block I/O | docker stats | 5s | 41 MB read |
| Stability score | computed | 60s | 97.3% |