Sensitive Data Redaction

This document explains how to configure and manage regex patterns to redact, hash, and drop sensitive data in OpenObserve.

Availability

This feature is available in Enterprise Edition and Cloud. Not available in Open Source.

Overview

The Sensitive Data Redaction feature helps prevent accidental exposure of sensitive data by applying regex-based detection to values ingested into streams and to values already stored in streams. Based on this detection, sensitive values can be either redacted, hashed, or dropped. This ensures data is protected before it is stored and hidden when displayed in query results. You can configure these actions to run at ingestion time or at query time.

Ingestion time

Note: Use ingestion time redaction, hash, or drop when you want to ensure sensitive data is never stored on disk. This is the most secure option for compliance requirements, as the original sensitive data cannot be recovered once it is redacted, hashed, or dropped during ingestion.

Redact: Sensitive data is masked before being stored on disk.
Hash: Sensitive data is replaced with a searchable hash before being stored on disk.
Drop: Sensitive data is removed before being stored on disk.

Query time

Note: If you have already ingested sensitive data and it is stored on disk, you can use query time redaction or drop to protect it. This allows you to apply sensitive data redaction to existing data.

Redaction: Sensitive data is read from disk but masked before results are displayed.
Hash: Sensitive data is read from disk but masked with a searchable hash before results are displayed.
Drop: Sensitive data is read from disk but excluded from the query results.

Where to find

To access the Sensitive Data Redaction interface:

Select the appropriate organization from the dropdown in the top-right corner.
Select Management > Sensitive Data Redaction.

Sensitive Data Redaction

This opens the Sensitive Data Redaction interface, where you can view, create, and manage regex patterns available to the selected organization.

Who can access

Root users have full access to both pattern creation and pattern association by default. For other users, permissions are controlled via the Regexp Patterns and Streams module in the IAM settings, using role-based access control (RBAC).

Pattern Creation:

Users need permissions on the Regexp Patterns module to create, view, edit, or delete regex patterns.
You can control access at both the module level (all regex patterns) and the individual pattern level for precise control.

Pattern Association:

To associate patterns with stream fields, users need List permission on Regexp Patterns AND edit permission on Streams modules.

Important note

Regex patterns can only be applied to fields with UTF8 data type.
The stream must have ingested data before you can apply regex patterns. Empty streams will not show field options for pattern association.

Create regex patterns

To create a regex pattern:

Step 1: Discover sensitive data

Identify which fields may contain sensitive data.

From the left-hand menu, select Logs.
In the stream selection dropdown, select the stream.
Select an appropriate time range and click Run Query. This shows the records for the selected time range.

Look for common sensitive patterns.

Sensitive Data Category	Examples	Common Fields
Personal Information	Names, emails, phone numbers	`message`, `user_info`, `contact`
Financial Data	Credit cards, SSNs, bank accounts	`payment_info`, `transaction_data`
Authentication	API keys, tokens, passwords	`headers`, `auth_data`, `debug_info`
Network Data	IP addresses, MAC addresses	`client_ip`, `network_info`

Example Sensitive Data in Logs:

{
"message": "User John Doe with email john.doe@company.com logged in from IP 192.168.1.100. SSN: 123-45-6789. Credit Card: 4111-1111-1111-1111",
"timestamp": "2025-07-30T10:30:00Z"
}

Step 2: Create and test regex patterns

To create regex patterns, naviagte to Management > Sensitive Data Redaction > Create Pattern.

Create regex

In the pattern creation form, enter the following details:

Name: Enter a clear, descriptive name. For example, Email Detection.
Description: (Optional) Explain what the pattern is intended to detect.
Regex Pattern: Paste or write the regular expression you want to use. Refer to the following common patterns.
Test Pattern: Provide a sample input to validate that the regex works as expected.
Click the Create and Close button to save the pattern.

Common Patterns

Type	Pattern	Example
Email	`\b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Z\\|a-z]{2,}\b`	`user@company.com`
Full Name	`\b[A-Z][a-z]+ [A-Z][a-z]+\b`	`John Doe`
Phone (US)	`\+?1?[-.\s]?\(?[0-9]{3}\)?[-.\s]?[0-9]{3}[-.\s]?[0-9]{4}`	`+1-555-123-4567`
Credit Card	`\b(?:\d{4}[-\s]?){3}\d{4}\b`	`4111-1111-1111-1111`
SSN (US)	`\b\d{3}-?\d{2}-?\d{4}\b`	`123-45-6789`
API Key	`\b[A-Za-z0-9]{32,}\b`	`sk_live_1234567890abcdef`
IP Address	`\b(?:[0-9]{1,3}\.){3}[0-9]{1,3}\b`	`192.168.1.100`
Password Field	`(?i)password[\"':\s=]+[\"']?([^\"'\s,}]+)`	`password: "secret123"`

Example
The following screenshots illustrate the pattern creation process:

Review the logs that include PII.
The message field in the pii_test_stream contains names, email addresses, IP addresses, SSNs, and credit card numbers.
Create and test the regex patterns.
Full Name:
Email Addresses:

Apply regex patterns

Once your patterns are created and tested, you can apply them to specific fields in a stream to redact or drop sensitive data during ingestion or at query time.
To apply a pattern to a field:

Step 1: Go to the stream field

From the left-hand menu, go to Streams.
Locate the stream where you want to apply regex patterns and select Stream Details from the Actions column.
In the Stream Settings tab, locate the field that contains sensitive data.

Field with sesitive data

Step 2: Add pattern

Select Add Pattern for the target field. This opens the pattern panel, where you can view already applied patterns and add new ones.
From the All Patterns section, select a pattern you want to apply.
After selecting a pattern, a detail view appears.

Step 3: Choose whether to Redact, Hash, or Drop

Regex pattern execution action- redact or drop

When applying a regex pattern, you must choose one of the following actions in the pattern details screen:

Redact:

Replaces only the matching portion of the field value with [REDACTED], while preserving the rest of the field.
Use this when the field contains both sensitive and non-sensitive information and you want to retain the overall context.

Hash:

Replaces the matched sensitive value with a searchable hash while keeping its position within the field.

Drop:

Removes the entire field from the log record if the regex pattern matches.
Use this when the entire field should be excluded from storage or analysis.

Select the appropriate action.

Step 4: Choose when the action needs to be executed

In the pattern details screen, select when the chosen action (redact, hash, or drop) should be executed, at ingestion time, query time, or both.

Regex pattern execution time

Ingestion:

The data is redacted, hashed, or dropped before it is written to disk.
This ensures that sensitive information is never stored in OpenObserve.
Example: If an email address is redacted at ingestion, only the masked value [REDACTED] will be stored in the logs.

Query:

The data is stored in its original form but is redacted, hashed, or dropped only when queried.
This allows administrators to preserve the original data while preventing exposure of sensitive values during searches.
Example: An email address stored in raw form will be hidden as [REDACTED] in query results.

You can select one or both options depending on your security and compliance requirements. If neither ingestion time nor query time is selected, no redaction or drop is applied.

Step 5: Add pattern and update changes

To add the regex pattern to Applied Patterns, click Add Pattern.
Select Update Changes.

Step 6: (Optional) Apply multiple patterns

You can apply multiple patterns to the same field, as shown below: apply-multiple-reg-pattern All applied patterns will appear in the left-hand panel with check marks.

Step 7: Save configuration

When finished, click Update Changes to save the configuration. This activates the regex rules for the selected field.

Test Redact, Hash and Drop operations

Test 1: Redact at ingestion time

Redact at ingestion time

Pattern Configuration: redact-at-ingestion-time-test-config

Test Steps:

From the left-hand menu, select Logs.
Select the pii_test stream from the dropdown.

Ingest a log entry containing a full name in the message field.

$ curl -u example@example.com:FNIB8MWshsuhyehH -k https://example.zinclabs/api/default/pii_test/_json -d '[{"level":"info","job":"test","message":"User John Doe logged in successfully"}]'
{"code":200,"status":[{"name":"pii_test","successful":1,"failed":0}]}

Set the time range to include the test data.
Click Run Query.
Verify results:

Key points:

The name "John Doe" is replaced with [REDACTED].
The rest of the message field remains intact.
This is the actual stored value on disk.

Test 2: Drop at ingestion time

Drop at ingestion time

Pattern Configuration: drop-at-query-time-test-config

Test Steps:

From the left-hand menu, select Logs.
Select the pii_test stream from the dropdown.

Ingest a log entry containing an IP address in the message field.

$ curl -u example@example.com:FNIB8MWshsuhyehH -k https://example.zinclabs/api/default/pii_test/_json -d '[{"level":"info","job":"test","message":"Connection from IP 192.168.1.100 established"}]'
{"code":200,"status":[{"name":"pii_test","successful":1,"failed":0}]}

Set the time range to include the test data.
Click Run Query.
Verify results:

Key points:

The entire message field is missing from the stored record.
Other fields remain intact.
This demonstrates field-level drop at ingestion.

Test 3: Hash at ingestion time

Hash at ingestion time

Pattern Configuration: config-hash-pattern-ingestion-time

Test Steps:

From the left-hand menu, select Logs.
Select the pii_test stream from the dropdown.

Ingest a log entry containing a card details in the logs field.

$ curl -u example@example.com:FNIB8MWshsuhyehH -k https://example.zinclabs/api/default/pii_test/_json -d '[{"job":"test","level":"info","log":"Payment processed with card 4111-1111-1111-1111"}]'

Set the time range to include the test data.
Click Run Query.
Verify results:

Test 4: Redact at query time

Redact at query time

Pattern Configuration: redact-at-query-test-config

Test Steps:

From the left-hand menu, select Logs.
Select the pii_test stream from the dropdown.

Ingest a log entry containing an email addresses in the message field.

$ curl -u example@example.com:FNIB8MWshsuhyehH -k https://example.zinclabs/api/default/pii_test/_json -d '[{"level":"info","job":"test","message":"Password reset requested for john.doe@company.com"}]'
{"code":200,"status":[{"name":"pii_test","successful":1,"failed":0}]}

Set the time range to include the test data.
Click Run Query.
Verify results:

Key points:

Original data is preserved on disk.
Email address appears as [REDACTED] in query results.
Useful for compliance while maintaining data for authorized access.

Test 5: Drop at query time

Drop at query time

Pattern Configuration: Drop at Query Time- Test Config

Test Steps:

From the left-hand menu, select Logs.
Select the pii_test stream from the dropdown.

Ingest a log entry containing credit card details in the message field.

$ curl -u example@example.com:FNIB8MWshsuhyehH -k https://example.zinclabs/api/default/pii_test/_json -d '[{"level":"info","job":"test","message":"Payment processed with card 4111-1111-1111-1111"}]'
{"code":200,"status":[{"name":"pii_test","successful":1,"failed":0}]}

Set the time range to include the test data.
Click Run Query.
Verify results:

Key points:

Original data is preserved on disk.
The message field with the credit card details gets dropped in query results.
This demonstrates field-level drop at query time.

Test 6: Hash at query time

Hash at query time

Pattern Configuration: config-hash-pattern-query-time

Test Steps:

From the left-hand menu, select Logs.
Select the pii_test stream from the dropdown.

Ingest a log entry containing a card details in the logs field.

$ curl -u example@example.com:FNIB8MWshsuhyehH -k https://example.zinclabs/api/default/pii_test/_json -d '[{"job":"test","level":"info","log":"Payment processed with card 4111-1111-1111-1111"}]'

Set the time range to include the test data.
Click Run Query.
Verify results:

Search hashed values using `match_all_hash`

The match_all_hash user-defined function (UDF) complements the SDR Hash feature. It allows you to search for logs that contain the hashed equivalent of a specific sensitive value. When data is hashed using Sensitive Data Redaction, the original value is replaced with a searchable hash. You can use match_all_hash() to find all records that contain the hashed token, even though the original value no longer exists in storage.
Example:

match_all_hash('4111-1111-1111-1111')

This query returns all records where the SDR Hash of the provided value exists in any field. In the example below, it retrieves the log entry containing [REDACTED:907fe4882defa795fa74d530361d8bfb], the hashed version of the given card number.

match-all-hash

Import patterns from built-in library

OpenObserve provides a built-in library of 147+ pre-configured regex patterns that can be imported directly into your organization. These patterns cover common sensitive data types and security-related formats, allowing you to quickly implement data protection without writing regex patterns from scratch.

To import patterns from the built-in library:

Step 1: Navigate to the Import section

Go to Management > Sensitive Data Redaction.
Click the Import button in the top-right corner.
The Import Pattern screen opens with three tabs:
- Built-in Patterns: Pre-configured patterns from OpenObserve's pattern library
- File Upload/JSON: Import patterns from a JSON file
- URL Import: Import patterns from a URL
Select the Built-in Patterns tab.

Built-in Patterns Import Interface

Step 2: Browse and search patterns

The built-in patterns library displays 147 patterns. You can:

Search patterns: Use the search bar to find patterns by name
Filter by tags: Use the "Filter by Tag" dropdown to narrow patterns by category
Refresh: Click the Refresh button to pull the latest patterns from the GitHub repository

Step 3: View pattern details

To view details about a pattern before importing:

Click the three dots (⋮) icon next to any pattern in the list.
A detail panel displays:
- Description: What the pattern detects
- Pattern: The actual regex expression
- Tags: Categories the pattern belongs to
- Rarity: How commonly this pattern is used
- Valid Examples: Sample data that matches this pattern

Pattern details view

This helps you verify the pattern will match your expected data format before importing.

Step 4: Select and import patterns

Select patterns: Check the box next to each pattern you want to import. You can select multiple patterns at once.
Import: Click the Import button in the top-right.

After importing patterns, you can edit, export, and delete the patterns. Manage patterns

Duplicate handling

The system does not allow you to import the same pattern more than once to avoid duplicates.

Limitations

Pattern Matching Engine: OpenObserve uses the Intel Hyperscan library for regex evaluation. All Hyperscan limitations apply to pattern syntax and matching behavior.
Field Type Restrictions: Regex patterns can only be applied to fields with a UTF8 data type. Other field types are not supported.
Data Requirements: Patterns can only be applied after the stream has ingested data. Empty streams will not show any fields in the Stream Settings tab for pattern association.
Performance: Complex patterns may impact ingestion speed, but overall performance remains faster than VRL-based redaction.

Troubleshooting

Issue	Cause	Solution
The Add Pattern option is not visible in Stream Details.	The field is not of UTF8 type.	Check the field type in the Stream Details view. Only UTF8 fields support regex patterns.
Pattern does not apply.	Configuration changes were not saved.	Ensure that you selected Update Changes after applying the pattern.

Sensitive Data Redaction

Overview

Create regex patterns

Step 1: Discover sensitive data

Step 2: Create and test regex patterns

Apply regex patterns

Step 1: Go to the stream field

Step 2: Add pattern

Step 3: Choose whether to Redact, Hash, or Drop

Step 4: Choose when the action needs to be executed

Step 5: Add pattern and update changes

Step 7: Save configuration

Test Redact, Hash and Drop operations

Redact at ingestion time

Drop at ingestion time

Hash at ingestion time

Redact at query time

Drop at query time

Hash at query time

Search hashed values using match_all_hash

Import patterns from built-in library

Step 1: Navigate to the Import section

Step 2: Browse and search patterns

Step 3: View pattern details

Step 4: Select and import patterns

Limitations

Troubleshooting

Search hashed values using `match_all_hash`