Prometheus Metrics Count Basics: A Beginner's Complete Guide

Simran Kumari

September 26, 2025

9 min read

Don’t forget to share!

Table of Contents

Prometheus Metrics Count Basics: A Beginner's Complete Guide

Prometheus is a powerful monitoring system that excels at collecting and aggregating metrics. To make sense of what it measures, Prometheus uses four main metric types:

Counter → a value that only ever goes up (e.g., total HTTP requests).
Gauge → a value that goes up and down (e.g., current memory usage).
Histogram → samples observations into configurable buckets (e.g., request durations).
Summary → similar to histogram but provides quantiles directly (e.g., 95th percentile latency).

Among these, the Counter is the backbone of “counting” in Prometheus. It’s what helps you answer essential operational questions like:

How many errors happened today?
How many unique users visited my site?
How many servers are currently running?

In this guide, we’ll demystify counting in Prometheus, starting from the basics and working up to practical strategies you can apply in your monitoring setup.

Understanding the Basics: What Are We Counting?

Before writing Prometheus queries, it’s important to know what exactly we’re counting. Prometheus data is built on two concepts: metrics and labels.

Metrics are the actual measurements you collect. Examples include:
- http_requests_total → total number of HTTP requests
- node_cpu_seconds_total → CPU usage time
- db_connections → number of active database connections
Labels are key–value pairs that add context to a metric. They let you slice and filter the data. For example:
- method="GET" or method="POST" for request types
- status="200" or status="500" for response codes
- region="us-east" or region="europe" for deployment location

So if you’re tracking http_requests_total, labels could tell you how many requests came from us-east, how many were 500 errors, or how many used the POST method.

Understanding Time Series

A time series is a sequence of data points collected over time. In Prometheus, each unique combination of a metric name and its labels creates a separate time series. For example:

website_visits{country="usa", device="mobile"} is one time series
website_visits{country="usa", device="desktop"} is another time series
website_visits{country="canada", device="mobile"} is yet another time series

The Challenge of Counting in Prometheus

Counting unique values in Prometheus isn't as straightforward as you might expect. Unlike traditional databases where you can simply use "COUNT DISTINCT", Prometheus is designed for time-series data and optimized for performance over complex queries.

The main challenges include:

High-cardinality data (metrics with many unique label values) can cause performance issues
Standard PromQL functions aren't designed for direct unique counting
Results can be misleading if you don't understand the underlying time-series structure

This is why learning the proper techniques for counting is essential for effective Prometheus usage.

How Prometheus Stores and Retrieves Data

To understand counting better, it's helpful to know how Prometheus works behind the scenes. Prometheus scrapes (collects) metrics from your applications and infrastructure at regular intervals, typically every 15-30 seconds.

Each time it scrapes, it creates data points with timestamps. These data points are stored as time series, and each unique combination of metric name and labels becomes a separate time series. When you run a count query, Prometheus looks at all the relevant time series and performs calculations on them.

This architecture is why Prometheus is so fast at handling large amounts of time-series data, but it also explains why counting unique values requires special techniques.

count(count by (status_code) (http_requests_total))

This query works in two steps. First, count by (status_code) (http_requests_total) groups all HTTP requests by their status code (like 200, 404, 500). Then, the outer count() counts how many unique status codes exist. If your application returns status codes 200, 404, and 500, this query returns 3.

The reason we use this double count approach is that the inner count groups your data, and the outer count gives you the final number. It's like organizing books by genre first, then counting how many genres you have.

Prometheus Count Fundamentals

Let's start with the absolute basics of counting in Prometheus. We'll build your understanding step by step.

1. Basic count() Function - Counting Time Series

Syntax:

count(metric_name)

Example:

count(up)

What it does: Counts how many time series exist for the up metric (which tracks if services are running).

Use case: You want to know "How many services is Prometheus monitoring?" If you have 5 services being monitored, this returns 5. This is the foundation of all counting in Prometheus.

2. count by() Pattern - Grouping Before Counting

Syntax:

count by (label_name) (metric_name)

Example:

count by (job) (up)

What it does: Groups the up metric by job label, then counts how many instances exist for each job.

Use case: You want to see "How many instances of each service am I monitoring?" This might return:

prometheus-job: 1
web-server: 3
database: 2

This shows you have 1 Prometheus instance, 3 web servers, and 2 database instances.

3. Double count() Pattern - Counting Unique Label Values

Syntax:

count(count by (label_name) (metric_name))

Example:

count(count by (status_code) (http_requests_total))

What it does: First groups by status_code, then counts how many unique status codes exist.

Use case: You want to know "How many different HTTP status codes am I seeing?" The inner count by creates groups for each status code, the outer count tells you how many groups exist. If you see 200, 404, and 500 status codes, this returns 3.

4. count without() Pattern - Excluding Labels

Syntax:

count without (label_name) (metric_name)

Example:

count without (instance) (up)

What it does: Counts while ignoring the instance label, effectively grouping by all other labels.

Use case: You want service-level counts instead of instance-level counts. Instead of counting each server separately, you count services as units.

5. Filtered count() - Adding Conditions

Syntax:

count(metric_name{label="value"})

Example:

count(up{job="web-server"})

What it does: Counts only the time series where job equals "web-server".

Use case: You want to know "How many web server instances am I monitoring?" This filters out all other services and counts only web servers.

6. Multi-Label count by() - Multiple Groupings

Syntax:

count by (label1, label2) (metric_name)

Example:

count by (job, instance) (up)

What it does: Groups by both job AND instance, showing the count for each job-instance combination.

Use case: You want a detailed breakdown showing each specific service instance. This gives you a complete inventory of every monitored service instance.

7. Combination Count - Counting Unique Combinations

Syntax:

count(count by (label1, label2) (metric_name))

Example:

count(count by (method, status_code) (http_requests_total))

What it does: Counts how many unique combinations of method AND status code exist.

Use case: Now that you understand the basics, you can count complex combinations. If you have GET/POST/PUT methods across 200/404/500 status codes, this returns the number of unique combinations you're actually seeing (might be less than the theoretical maximum if some combinations don't occur).

8. Time-Based Count - Counting Within Time Windows

Sometimes you only want to count data from a specific period, like the last hour or day. That’s where range functions like last_over_time() come in.

Syntax:

count(last_over_time(<metric>[<range>]))

Example:

count(last_over_time(user_activity[1d]))

What this actually does:

Prometheus looks at each unique time-series of user_activity. A time series is a metric + label combination, like:
- user_type="premium", location="us-east"
- user_type="trial", location="europe"
For each time series, it finds the last recorded value within the past day. It doesn’t care about all the earlier samples — only the most recent one.
The count() then counts how many series had at least one data point in that period.

Analogy: Imagine you have several employees submitting daily reports. You only check the last report each employee submitted today. Then you count how many employees submitted reports. That’s exactly what this query does for metrics.

Tip: Swap [1d] for [1h], [7d], or [30m] depending on the period you want to analyze.

9. Using count_values()

Syntax:

count_values("label_name", metric_name)

Example:

count_values("response_code", http_status)

What it does: Creates a new time series showing each unique value and its count.

Use case: When you want to see both what the unique values are AND how many times each appears. Instead of just knowing "you have 3 unique status codes," you see "200 appears 1000 times, 404 appears 50 times, 500 appears 10 times."

10. Global System Count - Understanding Your Data Volume

Syntax for Active Time Series:

prometheus_tsdb_head_series

Example result: Returns a number like 15420, meaning Prometheus is tracking 15,420 different data streams.

Use case: Monitor Prometheus performance and storage usage. High numbers might indicate cardinality issues.

Syntax for All Metrics:

count({__name__=~'.+'})

Example result: Returns total count of all metric data points.

Use case: Get a bird's-eye view of your monitoring scope. Useful for capacity planning and understanding system complexity.

Making Your Metrics Counts More Powerful with OpenObserve

While Prometheus excels at real-time monitoring and counting, it is primarily designed for short-term data storage. Many default setups retain data for around two weeks, though this can be configured. For long-term storage, historical analysis, and advanced visualization, integrating Prometheus with a tool like OpenObserve is highly valuable. This setup allows you to maintain real-time monitoring while also keeping months or even years of historical metrics for deeper insights.

Learn more about ingesting Prometheus metrics into OpenObserve.

Setting Up the Combination

To send your Prometheus data to OpenObserve, add this configuration to your Prometheus config file:


remote_write:
  - url: https://<openobserve_host>/api/<org_name>/prometheus/api/v1/write
    queue_config:
      max_samples_per_send: 10000
    basic_auth:
      username: <openobserve_user>
      password: <openobserve_password>

This setup allows Prometheus to focus on collecting data while OpenObserve handles storage and visualization, creating a powerful monitoring combination.

Key Takeaways

Prometheus counting helps quantify unique occurrences across your systems.
Basic pattern: count(count by (label) (metric))
Time intervals: [1h], [1d], etc.
OpenObserve integration adds long-term storage, advanced dashboards, and historical analysis.

Ready to Get Started?

Prometheus counting might seem complex at first, but with these basics, you can start monitoring effectively. Focus on simple queries, build expertise over time, and prioritize metrics that matter most. For advanced visualization and long-term storage, integrate with OpenObserve to unlock the full potential of your monitoring data.

For deeper guidance, check out the OpenObserve documentation on advanced setups and custom dashboards.

Ready to put this into practice? Sign up for an OpenObserve cloud account (14 day free trial) or visit our downloads page to self-host OpenObserve.

About the Author

Simran Kumari

Passionate about observability, AI systems, and cloud-native tools. All in on DevOps and improving the developer experience.

Latest From Our Blogs

View all posts

Monitoring AWS Bedrock: Collecting Logs & Metrics in OpenObserve

How to

AWSAIMetrics

Monitoring AWS Bedrock: Collecting Logs & Metrics in OpenObserve

Learn how to monitor AWS Bedrock with CloudWatch, Kinesis Firehose, and OpenObserve. Track latency, errors, token usage, and model performance in real-time.

Simran Kumari

2025-11-28

Building Modern Observability: Why Rust Powers the Next Generation of Monitoring Platforms

Engineering

OpenObserveObservability

Building Modern Observability: Why Rust Powers the Next Generation of Monitoring Platforms

Traditional monitoring tools fail when you need them most. Learn how Rust-powered observability platforms solve memory safety issues and deliver reliable, high-performance monitoring at 60-90% and lower cost.

Prometheus Alertmanager VS OpenObserve’s In-Built Alerting : Unified Alerting and Observability

Simplify Prometheus Alertmanager setups with OpenObserve -unified alerts for metrics, logs, and traces, no YAML required.

Faster MTTD and MTTR with OpenObserve: From Alert Fatigue to Intelligent Incidents

Learn how OpenObserve reduces Mean Time to Detect and Mean Time to Resolve through intelligent alert correlation, deduplication, and automated incident creation. Cut through alert fatigue with SLO-based prioritization and Actions automation.

Manas Sharma

2025-11-25

How to

KubernetesMicrosoftObservability

How to Monitor Azure Kubernetes Service (AKS) with OpenObserve: End-to-End Setup

Learn how to set up comprehensive AKS monitoring with OpenObserve. Deploy the OpenObserve Collector to capture logs, metrics, and traces from your Azure Kubernetes clusters. Get unified observability with significant cost savings compared to Azure Log Analytics.

How to Export Azure Monitor Metrics using OpenTelemetry to OpenObserve

Collect and export Azure Monitor metrics to OpenObserve using the OpenTelemetry Collector. Build real-time dashboards, query metrics, and set up SQL-based alerts for Azure VMs, AKS, and other resources.

Prometheus Metric Types (Counters, Gauges, Histograms, Summaries)

A clear, developer-focused guide to Prometheus metric types, when to use each one, and how OpenObserve enhances Prometheus by solving retention, scalability, and correlation limitations.

Azure Monitoring with Otel Collector and OpenObserve: Collect Logs & Metrics from Any Resource

Monitor Azure VMs, databases, storage, and networking with a single pipeline using Event Hub → OTel Collector → OpenObserve. Simplify logging & metrics.

Simran Kumari

2025-11-18

How to

OpentelemetryAWSOpenObserve

How to Send AWS Lambda Traces to OpenObserve Using ADOT (AWS Distro for OpenTelemetry)

Learn how to implement distributed tracing for AWS Lambda using the AWS Distro for OpenTelemetry (ADOT) layer. This step-by-step guide shows you how to automatically capture traces from AWS SDK calls and send them to OpenObserve without writing any instrumentation code. Get full visibility into your serverless applications with open standards.

ServiceNow Integration with OpenObserve: Automate Incident Creation from Alerts

Learn how to integrate ServiceNow with OpenObserve to automatically create incidents from alerts. Step-by-step guide covering webhook integration and openobserve actions with deduplication support.

Md Mosaraf,Manas Sharma

2025-11-14