Enable Observability

Use this when you need to trace LLM inference calls and tool executions with OpenTelemetry-compatible telemetry.

Prerequisites

A working awaken agent runtime (see First Agent)
Feature observability enabled on the awaken crate (enabled by default)
For OTel export: feature otel enabled on awaken-ext-observability, plus a configured OTel collector

[dependencies]
awaken = { package = "awaken-agent", version = "0.1", features = ["observability"] }
tokio = { version = "1", features = ["full"] }

Steps

use std::sync::Arc;
use awaken::engine::GenaiExecutor;
use awaken::ext_observability::{ObservabilityPlugin, InMemorySink};
use awaken::registry_spec::{AgentSpec, ModelSpec};
use awaken::{AgentRuntimeBuilder, Plugin};

let sink = InMemorySink::new();
let obs_plugin = ObservabilityPlugin::new(sink.clone());
let agent_spec = AgentSpec::new("observed-agent")
    .with_model("gpt-4o-mini")
    .with_system_prompt("You are a helpful assistant.")
    .with_hook_filter("observability");

let runtime = AgentRuntimeBuilder::new()
    .with_provider("openai", Arc::new(GenaiExecutor::new()))
    .with_model(
        "gpt-4o-mini",
        ModelSpec {
            id: "gpt-4o-mini".into(),
            provider: "openai".into(),
            model: "gpt-4o-mini".into(),
        },
    )
    .with_agent_spec(agent_spec)
    .with_plugin("observability", Arc::new(obs_plugin) as Arc<dyn Plugin>)
    .build()
    .expect("failed to build runtime");

After a run completes, inspect collected metrics:

let metrics = sink.metrics();
println!("inferences: {}", metrics.inferences.len());
println!("tool calls: {}", metrics.tools.len());
println!("total input tokens: {}", metrics.total_input_tokens());
println!("total output tokens: {}", metrics.total_output_tokens());
println!("tool failures: {}", metrics.tool_failures());
println!("session duration: {}ms", metrics.session_duration_ms);

for stat in metrics.stats_by_model() {
    println!("{}: {} calls, {} in / {} out tokens",
        stat.model, stat.inference_count, stat.input_tokens, stat.output_tokens);
}

for stat in metrics.stats_by_tool() {
    println!("{}: {} calls, {} failures",
        stat.name, stat.call_count, stat.failure_count);
}

use std::sync::Arc;
use awaken::engine::GenaiExecutor;
use awaken::ext_observability::{ObservabilityPlugin, OtelMetricsSink};
use awaken::registry_spec::{AgentSpec, ModelSpec};
use awaken::{AgentRuntimeBuilder, Plugin};
use opentelemetry_sdk::trace::SdkTracerProvider;

let provider = SdkTracerProvider::builder()
    // configure your exporter (OTLP, Jaeger, etc.)
    .build();
let tracer = provider.tracer("awaken");

let obs_plugin = ObservabilityPlugin::new(OtelMetricsSink::new(tracer));
let agent_spec = AgentSpec::new("observed-agent")
    .with_model("gpt-4o-mini")
    .with_system_prompt("You are a helpful assistant.")
    .with_hook_filter("observability");

let runtime = AgentRuntimeBuilder::new()
    .with_provider("openai", Arc::new(GenaiExecutor::new()))
    .with_model(
        "gpt-4o-mini",
        ModelSpec {
            id: "gpt-4o-mini".into(),
            provider: "openai".into(),
            model: "gpt-4o-mini".into(),
        },
    )
    .with_agent_spec(agent_spec)
    .with_plugin("observability", Arc::new(obs_plugin) as Arc<dyn Plugin>)
    .build()
    .expect("failed to build runtime");

Implement a custom sink (optional).

use awaken::ext_observability::{MetricsSink, GenAISpan, ToolSpan, AgentMetrics};

struct MySink;

impl MetricsSink for MySink {
    fn on_inference(&self, span: &GenAISpan) {
        // forward to your metrics system
    }

    fn on_tool(&self, span: &ToolSpan) {
        // forward to your metrics system
    }

    fn on_run_end(&self, metrics: &AgentMetrics) {
        // emit summary metrics
    }
}

Captured telemetry.

The plugin hooks into the following phases:

Phase	Data Captured
`RunStart`	Session start timestamp
`BeforeInference`	Inference start timestamp, model, provider
`AfterInference`	Token usage, finish reasons, duration, cache tokens
`BeforeToolExecute`	Tool call start timestamp
`AfterToolExecute`	Tool duration, error status
`RunEnd`	Session duration

OTel spans follow GenAI semantic conventions with attributes such as gen_ai.system, gen_ai.request.model, gen_ai.usage.input_tokens, and gen_ai.usage.output_tokens.

Verify

Run an agent with the InMemorySink.
After run() completes, call sink.metrics().
Confirm inferences is non-empty and token counts are populated.
For OTel, check your collector or Jaeger UI for spans named with the awaken tracer.

Common Errors

Symptom	Cause	Fix
Metrics are all zero	Plugin not registered	Register `ObservabilityPlugin` with the runtime builder
`OtelMetricsSink` not found	Missing `otel` feature	Enable the `otel` feature on `awaken-ext-observability`
No spans in collector	Exporter not configured	Verify `SdkTracerProvider` has an exporter and is not dropped
Token counts missing	LLM provider does not report usage	Check that your `LlmExecutor` returns `TokenUsage` in `LLMResponse`

crates/awaken-ext-observability/tests/

Key Files

Path	Purpose
`crates/awaken-ext-observability/src/lib.rs`	Module root and public re-exports
`crates/awaken-ext-observability/src/plugin/plugin.rs`	`ObservabilityPlugin` registration
`crates/awaken-ext-observability/src/plugin/hooks.rs`	Phase hooks for each telemetry point
`crates/awaken-ext-observability/src/metrics.rs`	`AgentMetrics`, `GenAISpan`, `ToolSpan` types
`crates/awaken-ext-observability/src/sink.rs`	`MetricsSink` trait and `InMemorySink`
`crates/awaken-ext-observability/src/otel.rs`	`OtelMetricsSink` with GenAI semantic conventions

Keyboard shortcuts

Awaken