Back to landing page
Operations

Observability

Metrics, logs and traces — the full stack from cluster to portal.

gh0stcloud runs a complete, CNCF-based observability stack. All telemetry is collected with tenant context, stored long-term, and made accessible through the portal — without tenants needing their own monitoring infrastructure.

Telemetry pipeline

Observability pipeline from cluster to portal
Alloy and Prometheus collect in the runtime — Mimir, Loki and Tempo store long-term in the Management Plane.

Stack components

ComponentPurposeLayer
PrometheusScraping workload and system metricsRuntime
AlloyLog collection and trace forwarding (OpenTelemetry)Runtime
BeylaeBPF-based auto-instrumentation without code changesRuntime
MimirLong-term metrics storage (TSDB, Grafana-compatible)Management Plane
LokiLog aggregation and queryingManagement Plane
TempoDistributed tracing (OTLP-compatible)Management Plane
GrafanaDashboards, alerting, SLO monitoringManagement Plane
Grafana OperatorGitOps-managed Grafana instancesManagement Plane
OpenCostKubernetes cost attribution per namespaceManagement Plane

Tenant telemetry context

All metrics, logs and traces are enriched with tenant context before they enter the backend. Tenant scope is enforced at the backend level — not just as a dashboard filter. A query from Tenant A structurally cannot return data from Tenant B.

Access model by tier

TierObservability access
StarterAggregated usage trends and runtime summaries in the portal dashboard
GrowthNamespace-granular metrics, log streaming, access to Grafana dashboards
EnterpriseFull access to metrics + logs + traces, custom Grafana dashboards, alert configuration

Retention

Retention windows are configured per signal type and contract tier:

SignalDefault retention
Metrics (Mimir)90 days
Logs (Loki)30 days
Traces (Tempo)14 days

Extended retention is available for Enterprise contracts.

Alerting

Grafana AlertManager is pre-configured for platform SLOs. From the Growth tier, tenants can define their own alert rules, triggered via webhook, email or PagerDuty.

No custom monitoring stack needed

Beyla instruments applications automatically via eBPF — without code changes, without a sidecar. Tenants immediately get latency, error rate and throughput metrics for their services, even if they have no OpenTelemetry SDK integrated.

Questions or ready to get started?

Talk to us