Skip to main content
Version Next

OpenMetadata Configuration

What is OpenMetadata?

OpenMetadata is a data catalog and governance platform.

In practical terms, it is where you register all of your organization's data assets — ClickHouse tables, Kafka topics, Superset dashboards, Airflow pipelines — and get a unified view of who created them, what they mean, who uses them, and how they relate to each other.

In TDP Kubernetes, OpenMetadata addresses the following need: with so many components (Trino, ClickHouse, Kafka, Airflow, Superset), it becomes difficult to know where a specific piece of data lives, what it means, and whether it is still trustworthy.

OpenMetadata is the central point for discovery and documentation.

Learn more

See OpenMetadata — Concepts for a complete overview of the tool, its architecture and how it works.

Deployed components

ComponentDescription
OpenMetadata ServerCatalog API and web interface
MySQL (internal default)OpenMetadata metadata database
OpenSearch/ElasticsearchSearch backend for finding data assets
Datasources job (optional)Automatically registers TDP services after installation

The datasource registration job

A specific feature of the tdp-openmetadata chart is the datasources integration job (datasourcesIntegration.enabled=true).

It runs automatically after installation and registers the TDP services you enabled (ClickHouse, Trino, Superset, Airflow, Kafka) in OpenMetadata.

This way, you do not need to register each service manually through the UI.

Details at Integrations — OpenMetadata.

This page summarizes the configuration of the tdp-openmetadata chart: installation via OCI registry, main parameters, and the optional datasource registration job that runs after installation or upgrade.

Overview

PropertyValue
Charttdp-openmetadata
OpenMetadata Version1.9.11
Chart Version3.0.0
Registry (OCI)oci://registry.tecnisys.com.br/tdp/charts/tdp-openmetadata

Installation (OCI)

Terminal input
helm install <release> oci://registry.tecnisys.com.br/tdp/charts/tdp-openmetadata -n <namespace> --create-namespace

Main parameters

ParameterDescriptionDefault value
openmetadata.enabledEnable the deploytrue
openmetadata.image.tagServer image tag1.9.11
openmetadata.ingress.enabledEnable Ingressfalse
openmetadata.config.database.dbSchemeMetadata database schememysql
openmetadata.config.database.hostMetadata database hostmysql
openmetadata.config.elasticsearch.searchTypeSearch backendopensearch
openmetadata.config.elasticsearch.hostSearch backend hostopensearch
externalServices.postgresql.*Reference to external PostgreSQL (hooks / admin password)""
datasourcesIntegration.enabledPost-install/upgrade job that registers external servicesfalse

Other keys: see the output of helm show values for the package version you are using.

Datasource integration

After install or upgrade, a Job can register services in OpenMetadata when datasourcesIntegration.enabled=true:

  • ClickHouse (Database Service)
  • Trino (Database Service)
  • Superset (Dashboard Service)
  • Airflow (Pipeline Service)
  • Kafka (Messaging Service)

Prerequisites: OpenMetadata and the target services deployed and reachable; credentials in the values file and/or in Kubernetes Secrets.

YAML details and examples are available at Integrations — OpenMetadata.

Access

Terminal input
kubectl -n <namespace> port-forward service/<release> 8585:8585

Troubleshooting

Terminal input
kubectl -n <namespace> get jobs
kubectl -n <namespace> logs job/<release>-register-datasources

Common failures: OpenMetadata not yet ready, authentication error, missing Secret, or unreachable target service.

Uninstallation

Terminal input
helm uninstall <release> -n <namespace>

Authentication and security

Advanced authentication providers (e.g. LDAP) are outside the scope of this package documentation; align with the official OpenMetadata documentation and your values file. See Security — OpenMetadata for what the TDP documentation covers.