2025-05-09 13:07:38 +05:30

2.7 KiB

title slug collate
Data Retention Application /applications/data-retention true

Data Retention Application

The Data Retention application in OpenMetadata automates the cleanup of the internal database to maintain long-term performance and prevent data bloat. It provides administrators with an efficient mechanism to define retention policies for event-related records, ensuring scalability and compliance with data management practices.

Overview

As OpenMetadata processes events related to metadata changes and pipeline executions, it stores these events in internal tables. Over time, these tables can grow significantly and affect system performance. The Data Retention App enables automated and scheduled cleanup of outdated records.

Key Features

  • Automated cleanup of outdated internal records.
  • Configurable retention periods for different data categories.
  • Helps maintain system performance and database efficiency.
  • Supports compliance with organizational data retention policies.

Configuration Options

  • Configuration Key: changeEventRetentionPeriod
    Description: Number of days to retain change event records
    Default Value: 7

{% note %}

The retention period is applied to each cleanup target based on its timestamp.

{% /note %}

Cleanup Targets

The app deletes old records from the following internal tables:

  • change_events: Stores all metadata change event logs.
  • successful_sent_change_events: Tracks successfully processed events.
  • consumers_dlq: Dead Letter Queue for failed events.

Implementation Details

  • Schedule: Runs weekly by default.
  • Cron: 0 0 * * 0 (every Sunday at midnight).
  • Batch Processing: Cleans up records in batches of 10,000.
  • Triggering: Can be executed manually or allowed to run on schedule.
  • Application Class: org.openmetadata.service.apps.bundles.dataRetention.DataRetention
  • Permission Requirement: Requires admin privileges to configure or trigger.

How to Access

  1. Go to Settings > Applications and then click on Add apps.

{% image src="/images/v1.8/applications/ret.png" alt="Install Data Retention Application" caption="Install Data Retention Application" /%}

  1. Select or install the Data Retention application.

{% image src="/images/v1.8/applications/ret1.png" alt="Install Data Retention Application" caption="Install Data Retention Application" /%}

  1. Configure the desired retention period.

{% image src="/images/v1.8/applications/ret2.png" alt="configuration" caption="Configuration" /%}

  1. Run manually or let it operate on its default schedule.

{% image src="/images/v1.8/applications/ret3.png" alt="scheduling" caption="scheduling" /%}