If you’re new to OpenSRE or want a conceptual overview, see How OpenSRE fits in your stack.
What cloud-native monitoring tools do well
Cloud-native monitoring platforms are designed to:- Report metrics emitted by cloud services and infrastructure
- Track instance, container, and service health
- Collect logs and events from managed resources
- Trigger alerts based on thresholds and service state
Where cloud-reported metrics stop
Because cloud-native monitoring relies on service-reported metrics and resource state, it often lacks visibility into:- What individual processes and containers are doing at runtime
- Whether allocated resources are actively used or idle
- Execution stalls caused by I/O or network contention
- Short-lived jobs that complete between reporting intervals
- Infrastructure that remains allocated after workloads finish
Execution behavior versus infrastructure state
Cloud-native monitoring answers questions such as:- Which instances are running?
- How much CPU or memory is allocated?
- Are services healthy?
- Which task or process consumed those resources
- Whether work was compute-bound, I/O-bound, or idle
- Why infrastructure remains allocated without active execution
What OpenSRE adds
OpenSRE observes execution directly from the operating system and container runtime. When used alongside cloud-native monitoring, it adds:- Execution-level visibility for pipelines, runs, tasks, and tools
- Observed CPU, memory, disk, and network behavior
- Insight into stalls, idle execution, and contention
- Attribution of resource usage and cost to actual work
Infrastructure state analysis with OpenSRE/sweep
OpenSRE/sweep extends OpenSRE’s visibility to infrastructure state. It observes cloud resources directly using read-only access and identifies:- Idle compute resources with no active execution
- Orphaned nodes no longer associated with schedulers or clusters
- Unattached or residual storage left behind by completed workloads
OpenSRE/sweep does not modify or delete resources automatically.
Supported cloud-native monitoring integrations
The integration below describes how OpenSRE works alongside common cloud-native monitoring platforms.OpenSRE and AWS CloudWatch
Execution insight beyond service-level metricsOpenSRE observes execution behavior and infrastructure state that CloudWatch does not expose.
When OpenSRE is useful with cloud-native monitoring
OpenSRE is most useful alongside cloud-native monitoring when teams need to:- Explain slow or inconsistent pipeline runtimes
- Distinguish execution bottlenecks from infrastructure waste
- Identify idle or orphaned compute and storage resources
- Attribute cost to pipelines, tasks, or tools rather than instances
Where to go next
- How OpenSRE fits in your stack – conceptual overview
- AWS CloudWatch integration – execution and infrastructure visibility on AWS