If you’re new to OpenSRE or want a conceptual overview, see How OpenSRE fits in your stack.
What observability tools do well
Observability and telemetry platforms are designed to:- Collect metrics emitted by applications and system exporters
- Store and query time-series data
- Visualize metrics, logs, and traces in dashboards
- Trigger alerts based on thresholds or rules
Where reported telemetry stops
Because observability and telemetry tools depend on emitted signals and periodic collection, they often lack visibility into:- Short-lived processes and subprocesses that start and finish between scrapes
- Execution behavior between metric intervals, including stalls and idle time
- Resource contention inside containers or tasks, such as I/O blocking or memory pressure
- How reported metrics map to specific pipeline runs, steps, or tools
- How infrastructure cost relates to actual execution, rather than to hosts, services, or time windows
- When processes are scheduled
- When they block on I/O
- How memory is allocated and reclaimed
- How long resources are consumed by each execution unit
What OpenSRE adds
OpenSRE observes execution directly from the operating system and runtime. When used alongside observability and telemetry tools, it adds:- Execution-level visibility for pipelines, runs, tasks, and tools
- Observed CPU, memory, disk, and network behavior
- Insight into stalls, idle execution, and contention
- Resource usage and cost attribution aligned with actual work
How OpenSRE works with current observability and telemetry tools
The pages below describe how OpenSRE works alongside common observability platforms used in scientific, data, HPC, and cloud environments. Select a tool to see how OpenSRE adds additional observability.OpenSRE and Datadog
Pipeline-level insight within broad observabilityOpenSRE organizes execution behavior around pipelines instead of services or hosts.
OpenSRE and Grafana
Execution-aware dashboards without manual wiringOpenSRE provides pipeline-aware views that reduce the need to infer execution behavior from generic dashboards.
OpenSRE and Prometheus
Observed execution versus scraped metricsOpenSRE captures runtime behavior that may not appear in scraped or aggregated metrics.
When OpenSRE is useful with observability tools
OpenSRE is most useful alongside observability and telemetry platforms when teams need to:- Understand pipeline behavior beyond reported metrics
- Diagnose performance issues involving short-lived tasks
- Attribute resource usage and cost to specific workflows or tools
- Reduce manual dashboard configuration and metric correlation
Where to go next
- How OpenSRE fits in your stack – conceptual overview
- Individual integration pages – tool-specific execution gaps and observability comparisons