Monitoring AWS Bedrock: Collecting Logs & Metrics in OpenObserve

Simran Kumari

November 28, 2025

11 min read

Don’t forget to share!

Table of Contents

As teams begin integrating AWS Bedrock into their applications, a new challenge appears almost immediately: how do we monitor model performance, latency, errors, and overall usage in a way that is actually actionable?

It’s not enough to simply know that your model was invoked. You need to understand how long it took, how often it’s being used, and whether the model is contributing to latency spikes or cost increases. These signals are essential when you are wrapping Bedrock calls inside Lambda functions, API endpoints, or backend services.

This guide walks through a practical setup for collecting both logs and metrics from Bedrock and streaming them into OpenObserve, where you can build dashboards, perform correlation, and set up real-time monitoring. The approach is simple, reliable, and works at scale , especially when your model invocation traffic begins to grow.

Why Monitor Bedrock?

Modern LLM-based applications behave differently from traditional microservices. Latency depends not only on your system but also on the model provider. Throughput fluctuates with token counts, and failures can occur due to quota limits, safety filters, or malformed prompts.

When you monitor Bedrock, you gain visibility into:

Invocation latency (p50, p90, p99)
Total invocation volume
Model-wise usage patterns
Error responses from the model
Payload sizes (tokens in / tokens out)
Cost-driving metrics
How Bedrock latency impacts upstream services (like your Lambda function)

Monitoring becomes essential when Bedrock sits on the critical path of a user-facing workflow.

Architecture Overview

In most setups, the model invocation happens inside a Lambda function using the Bedrock client. Both Lambda and Bedrock produces logs, and Bedrock itself emits metrics such as invocation latency. Both land inside CloudWatch. From there, we extract them using Firehose and deliver everything into OpenObserve.

Architecture Flow for Monitoring Amazon Bedrock This single pipeline gives you unified visibility in OpenObserve.

Step by Step Setup: Monitoring AWS Bedrock

Prerequisites

Before sending data to OpenObserve, ensure a few foundational pieces are ready inside AWS.

Execution environment: Your function must have permission to invoke Bedrock models (bedrock:InvokeModel, bedrock:InvokeModelWithResponseStream).

Sample Execution Environment: Lambda Function

An OpenObserve workspace [You can have a self-hosted instance or signup for cloud account ] , ingestion token, and the ingestion endpoint URL. These will be used when configuring Firehose.

With these basics in place, you’re ready to build the pipeline.

Step 1: Create an Amazon Kinesis Data Firehose

1. In the AWS services menu, search for Kinesis and select it. 2. Create a Delivery Stream

Click on Create Firehose Stream.
Choose Direct PUT as the source and Click Next.

3. Configure Destination

Select HTTP Endpoint as the destination.
Endpoint URL: Enter your OpenObserve Kinesis Firehose endpoint and Access Key.

OpenObserve Credentials

Example format:

    https://<your-openobserve-domain>/aws/default/<stream-name>/_kinesis_firehose

4. Configure Backup Settings

Choose whether to back up all data or only failed data. For this example, select Failed data only.
Choose an existing S3 bucket or create a new one for backups (e.g., lambda-firehose-backup) and Click Next.

5. Finalize Firehose Stream Setup

Click Next, configure buffer size and interval as needed (default values will suffice for this example), then click Next again.
Review all settings and click on Create Firehose stream.

Step 2: Enable CloudWatch Logging for AWS Bedrock

Enable AWS Bedrock logging so the service can publish invocation activity directly into CloudWatch Logs.Bedrock automatically sends details such as invocation metadata, latency, errors, and model information into its own CloudWatch Log Group once logging is enabled.

Enable AWS Bedrock Logging

You can select the data types to include with logs, and also the destination. Sending data to S3 is recommended for backups.

Select data types to include

2. Identify the Bedrock-specific Log Group created in CloudWatch. This is the log source you will stream into OpenObserve.

Step 3: Stream CloudWatch Logs to the Firehose

To capture Bedrock invocation logs:

Go to the Bedrock’s log group in CloudWatch Logs
Add a Subscription Filter
Choose Kinesis Data Firehose 4. Select the same delivery stream you created earlier Every log line, including structured model invocation logs will now flow into OpenObserve.

Step 4: Create a CloudWatch Metric Stream

AWS automatically emits metrics like:

InvocationLatency
InvocationCount
ModelErrors
ThrottlingExceptions

These metrics do not need to be "turned on." They simply exist. What you need to configure is how to export them.

Metric Streams are the cleanest way to deliver metrics out of CloudWatch in near-real-time.

Inside CloudWatch, create a new Metric Stream. When choosing a destination, select Kinesis Data Firehose. This ensures metrics are continuously pushed rather than pulled.
Select the AWS/Bedrock namespace. Include all the metrics you want to export.

Once selected, you will attach this stream to the Firehose delivery stream you created.

Step 5: Verify Data in OpenObserve

To test out the flow, invoke the function using the bedrock models.

In OpenObserve UI, go to logs, select the stream you created and you will find the log records:

Verify Bedrock logs in OpenObserve

2. You can check for metrics records in the corresponding stream. Verify Bedrock metrics in OpenObserve

Bedrock Metrics You Will Receive

When you enable a CloudWatch Metric Stream for the AWS/Bedrock namespace, the following metrics become available for export into OpenObserve. These metrics give you direct visibility into invocation performance, error trends, and cost-related token usage.

Metric Name	Unit	Description
Invocations	SampleCount	Number of successful requests to `Converse`, `ConverseStream`, `InvokeModel`, and `InvokeModelWithResponseStream`.
InvocationLatency	Milliseconds	Total latency of each invocation. Useful for building p50, p90, and p99 latency dashboards.
InvocationClientErrors	SampleCount	Count of invocation attempts that resulted in client-side errors (4xx).
InvocationServerErrors	SampleCount	Count of invocation attempts that resulted in AWS server-side errors (5xx).
InvocationThrottles	SampleCount	Number of requests that were throttled by the service. These do not count toward successful Invocations or Errors.
InputTokenCount	SampleCount	Number of tokens present in the input prompt. Useful for cost and performance analysis.
LegacyModelInvocations	SampleCount	Number of invocations made using legacy Bedrock models.

Limitations & Best Practices

Monitoring AWS Bedrock through CloudWatch, Metric Streams, and Firehose works reliably, but there are a few practical considerations worth keeping in mind. These will help you design dashboards and alerts that are accurate, cost-efficient, and meaningful.

1. Bedrock logs do not include token-level breakdowns by default

While CloudWatch logs contain invocation metadata, they don’t always expose detailed token statistics (prompt tokens, output tokens, billed tokens) for every model. If token-level cost attribution is important, you may need to compute or extract these values separately inside your application.

2. Metric Streams provide aggregate metrics, not per-request detail

Metrics such as InvocationLatency or ModelErrors arrive as aggregated CloudWatch datapoints. They don’t map directly to individual invocation logs. For full correlation, you will rely more on logs than metric streams.

3. Bedrock logging must be explicitly enabled

If logging isn’t turned on, CloudWatch will remain empty and you’ll see no data flowing into OpenObserve. This is a common initial setup issue.

4. CloudWatch Log Groups can fragment across regions

If your organization invokes Bedrock models across multiple AWS regions, every region will generate its own log group and its own metrics. You must create log subscriptions and metric streams individually per region.

5. Some Bedrock models emit different telemetry structures

Anthropic, Meta, Amazon Titan, and custom model providers may expose logs that differ slightly in schema. You may need light VRL mapping in OpenObserve to normalize fields for dashboards.

Conclusion

Monitoring AWS Bedrock isn’t just about watching invocation counts or latency spikes, it’s about understanding how your application interacts with foundation models in real time. By streaming Bedrock’s metrics and logs into OpenObserve, you gain a single place to analyze performance, identify error patterns, catch throttling issues, and track token consumption with full transparency.

Pairing this with structured logging in your Lambda, dashboards that highlight the right KPIs, and alerts tuned to real performance thresholds gives you a complete operational view of your AI workloads. As your Bedrock usage grows, OpenObserve scales with you helping you stay ahead of latency issues, unexpected cost jumps, and reliability regressions.

Next Steps

Now that Bedrock logs and metrics are flowing into OpenObserve, the final piece is enabling teams to act on the data. Here are a few practical next steps that help you move from raw ingestion to real operational observability:

Build Service Dashboards
Configure Alerts Around Bedrock Metrics

FAQs

Q: Does logging add latency to my Bedrock invocations?

No. CloudWatch logging happens asynchronously after your Bedrock API call completes. The logging process does not block or slow down your model invocation. Your Lambda function returns immediately after receiving the Bedrock response. However, ensure your Lambda execution role has proper permissions to avoid authorization delays.

Q: Can I monitor Bedrock across multiple AWS regions?

Yes, but each AWS region requires separate configuration. Every region generates its own CloudWatch Log Groups and emits its own metrics. You'll need to create individual Kinesis Firehose delivery streams and CloudWatch Metric Streams per region. In OpenObserve, you can create region-specific streams (e.g., bedrock-logs-us-east-1, bedrock-logs-eu-west-1) or use a single stream with region tags for unified dashboards.

Q: What's the difference between Bedrock logs and Bedrock metrics?

Logs contain detailed, event-level data for each model invocation including request IDs, model names, input/output token counts, and error messages. Metrics provide aggregated statistics like total invocation count, average latency, and error rates over time intervals. Use logs for debugging specific failures and metrics for monitoring overall system health and trends.

Q: Do I need to enable Bedrock logging manually?

Yes. AWS Bedrock logging is disabled by default. You must explicitly enable it through the AWS Bedrock console under "Settings → Model invocation logging." Choose which data types to capture (text, image, embeddings) and select CloudWatch Logs as the destination. Without this step, no invocation logs will be generated.

Q: Which Bedrock models are supported for monitoring?

All foundation models available in AWS Bedrock emit metrics and logs when invocation logging is enabled. This includes Anthropic Claude models, Amazon Titan, Meta Llama, Cohere, AI21 Labs, and Stability AI models. However, log schemas may vary slightly between providers for example, token count fields might have different names depending on the model family.

Q: Can I monitor token usage to track Bedrock costs?

Yes. Bedrock emits InputTokenCount and OutputTokenCount metrics through CloudWatch. These appear in your metric stream and can be visualized in OpenObserve dashboards. To calculate costs, multiply token counts by your model's per-token pricing (available in AWS Bedrock pricing documentation). You can create alerts when token usage exceeds budget thresholds.

Q: What happens if my Kinesis Firehose delivery fails?

Failed records are automatically backed up to the S3 bucket you configured during Firehose setup. Firehose retries delivery for up to 24 hours before moving data to S3. You can monitor delivery failures through CloudWatch metrics like DeliveryToHttpEndpoint.Success and DeliveryToS3.Success. Set up alerts in OpenObserve or CloudWatch to notify you of persistent delivery issues.

Q: Can I use this setup with Bedrock Agents or Knowledge Bases?

Yes. Bedrock Agents and Knowledge Bases generate their own CloudWatch logs in separate log groups (e.g., /aws/bedrock/agents/). You can create additional subscription filters for these log groups pointing to the same Firehose stream. Agent invocations also emit metrics under the AWS/Bedrock namespace, which your existing Metric Stream will capture.

Q: Can I filter which Bedrock invocations get logged?

Not directly through Bedrock's native logging. All invocations are logged once you enable the feature. However, you can use CloudWatch Logs subscription filter patterns to selectively forward only certain log events to Firehose. For example, filter only errors or specific model names. Alternatively, use OpenObserve's VRL (Vector Remap Language) to drop unwanted logs during ingestion.

About the Author

Simran Kumari

Passionate about observability, AI systems, and cloud-native tools. All in on DevOps and improving the developer experience.

Latest From Our Blogs

View all posts

Monitoring AWS Bedrock: Collecting Logs & Metrics in OpenObserve

How to

AWSAIMetrics

Monitoring AWS Bedrock: Collecting Logs & Metrics in OpenObserve

Learn how to monitor AWS Bedrock with CloudWatch, Kinesis Firehose, and OpenObserve. Track latency, errors, token usage, and model performance in real-time.

Prometheus Alertmanager VS OpenObserve’s In-Built Alerting : Unified Alerting and Observability

Simplify Prometheus Alertmanager setups with OpenObserve -unified alerts for metrics, logs, and traces, no YAML required.

Faster MTTD and MTTR with OpenObserve: From Alert Fatigue to Intelligent Incidents

Learn how OpenObserve reduces Mean Time to Detect and Mean Time to Resolve through intelligent alert correlation, deduplication, and automated incident creation. Cut through alert fatigue with SLO-based prioritization and Actions automation.

Manas Sharma

2025-11-25

How to

KubernetesMicrosoftObservability

How to Monitor Azure Kubernetes Service (AKS) with OpenObserve: End-to-End Setup

Learn how to set up comprehensive AKS monitoring with OpenObserve. Deploy the OpenObserve Collector to capture logs, metrics, and traces from your Azure Kubernetes clusters. Get unified observability with significant cost savings compared to Azure Log Analytics.

How to Export Azure Monitor Metrics using OpenTelemetry to OpenObserve

Collect and export Azure Monitor metrics to OpenObserve using the OpenTelemetry Collector. Build real-time dashboards, query metrics, and set up SQL-based alerts for Azure VMs, AKS, and other resources.

Prometheus Metric Types (Counters, Gauges, Histograms, Summaries)

A clear, developer-focused guide to Prometheus metric types, when to use each one, and how OpenObserve enhances Prometheus by solving retention, scalability, and correlation limitations.

Azure Monitoring with Otel Collector and OpenObserve: Collect Logs & Metrics from Any Resource

Monitor Azure VMs, databases, storage, and networking with a single pipeline using Event Hub → OTel Collector → OpenObserve. Simplify logging & metrics.

Simran Kumari

2025-11-18

How to

OpentelemetryAWSOpenObserve

How to Send AWS Lambda Traces to OpenObserve Using ADOT (AWS Distro for OpenTelemetry)

Learn how to implement distributed tracing for AWS Lambda using the AWS Distro for OpenTelemetry (ADOT) layer. This step-by-step guide shows you how to automatically capture traces from AWS SDK calls and send them to OpenObserve without writing any instrumentation code. Get full visibility into your serverless applications with open standards.

ServiceNow Integration with OpenObserve: Automate Incident Creation from Alerts

Learn how to integrate ServiceNow with OpenObserve to automatically create incidents from alerts. Step-by-step guide covering webhook integration and openobserve actions with deduplication support.

Md Mosaraf,Manas Sharma

2025-11-14

Full-Stack Observability: How Logs, Metrics, and Traces Work Better Together

Engineering

LoggingMetricsOpenObserve

Full-Stack Observability: How Logs, Metrics, and Traces Work Better Together

Discover how full-stack observability helps teams correlate telemetry across systems to cut MTTR, reduce data costs, and improve performance.

Raven Welch,Simran Kumari

2025-11-13