Configuring TDP Kubernetes components
Each component has a configuration page with the Helm chart parameters. Depending on the component, there are also Security, Ingress, and Integrations pages.
Read General configuration to understand concepts and patterns shared by all components: how to pass values, resources, PVCs, Secrets, external PostgreSQL, and S3.
Configuring TDP Kubernetes components
General configuration
General concepts for configuring TDP Kubernetes components: namespaces, resources, images, and Secrets.
Airflow
Umbrella chart: Airflow with values under `tdp-airflow`; Kubernetes executor by default.
Configuration
Executor, internal or external PostgreSQL, DAG/log PVCs, `TDPConfigurations.s3Connection`.
Security
Optional LDAP via `tdp-airflow.ldap` and `tdp-airflow.extraEnv` (Flask-AppBuilder).
Ingress
External exposure via Ingress with optional hostname and TLS.
Integrations
TDP helpers: external PostgreSQL (`externalDatabase`) and S3 connection Secret.
ArgoCD
GitOps on the Kubernetes cluster with automatic Application synchronization.
ClickHouse
Columnar analytical database on Kubernetes with HA, S3, and consumption by other TDP tools.
Configuration
Installation, main parameters, persistence, resources, and an HA starter example.
Security
Passwords, default user access control, additional profiles, and LDAP.
Ingress
Play UI and Dashboard exposure via Ingress with a dedicated hostname.
Integrations
S3/MinIO, storage policy, and use from CloudBeaver, Superset, and Trino.
CloudBeaver
Web-based database administration on Kubernetes.
Delta Lake
Maintenance routines and chart-supported parameters.
Hive Metastore
Chart parameters and supported internal or external PostgreSQL options.
Iceberg
Iceberg maintenance jobs with Spark runtime and dependency on Hive Metastore and S3.
JupyterLab
Multi-user notebooks on Kubernetes with Spark and RWX StorageClass.
Kafka
Kafka on Kubernetes using the Strimzi operator with CRDs.
NiFi
NiFi on Kubernetes with TLS managed by cert-manager and Ranger integration.
OpenMetadata
Data governance on Kubernetes with PostgreSQL and service integrations.
Apache Ozone
S3-compatible object storage for TDP on Kubernetes.
PostgreSQL
PostgreSQL chart parameters, persistence, resources, and metrics.
Ranger
Centralized access control for Kafka, NiFi, and Trino.
Spark
Spark on Kubernetes with Spark Operator, Hive Metastore, Delta Lake, and Apache Ozone.
Superset
Analytical dashboards on Kubernetes with Redis and PostgreSQL.
Trino
Distributed queries on Kubernetes with Hive, Iceberg, and Delta catalogs.