Skip to main content

Version 2.2.0

Highlights

Platform	Version	Highlights
Tecnisys Data Platform	TDP 2.2.0	- Support for Enterprise Linux 8 and 9 operating systems (Rocky Linux, AlmaLinux, Red Hat, among others). - Apache Ambari with extensive support for Python 3. - Inclusion of HBase REST API. - Improvements in Hive authentication mechanism with LDAP. - Use of OpenJDK8 as the default JDK for Ambari. - Fixed issues with Pydruid library for Kerberos authentication. - Kafka-UI as a graphical interface for managing Kafka clusters. - Fixed jaas_conf of Spark for Kerberized environments. - Added hive.server2.leader.zookeeper.namespace property to Hive. - Added Rolling Upgrade option for new TDP stack versions. - Improvements in the integration of various components with SSL/TLS enabled. - Improvements in the integration of Flink, Iceberg, and Zeppelin components. - Addition of ResourceManager UI in Knox topology. - Flink integration with Avro, JDBC, ORC, Parquet, CSV, Hive, Kafka, and Iceberg. - Spark 3.3.4 integration with Delta 2, Elastic, Iceberg, Kafka, PostgreSQL, Phoenix (HBase), and Solr. - Improvements in Ambari Metrics charts. - Updated components: Delta, Iceberg, Grafana (integrated with Ambari Metrics), Phoenix, and Zeppelin.

Table A. Platform Highlights

Component Highlights

See also the highlights of the updated components:

Service	Version	Category	Highlights
Apache Ambari	Ambari 3.0.0.0	Administration	- Modernization of the interface, making it more intuitive and modern. - Improvements and bug fixes in Ambari Metrics. - Operation with Python 3 on EL 8 and 9 operating systems.
Apache Iceberg	Iceberg 1.5.0	Analytics	- Fixed issue in FileIO where an extra header request was made when reading manifests. - Marked HTTP 502 and 504 status codes as retryable in the REST Client. - Bug fixes and improvements in JDBC Catalog.
Apache Phoenix	Phoenix 5.1.3	Analytics	- Improvements in stability and bug fixes with the JDBC client. - Fixes related to transaction management to ensure greater data consistency and integrity. - Query execution improvements, especially for high data volume scenarios, reducing response time and optimizing resource usage. - Improvements in handling and using secondary indexes to speed up read operations. - Updates to better support integration with Apache Spark, allowing the use of Phoenix as an SQL query layer for HBase data within Spark pipelines. - Greater compatibility with recent versions of HBase.
Apache Spark	Spark 3.3.4	Analytics, Data Science, Graph, Streaming	- Adjustments in the behavior of several SQL functions, such as percentile_disc and percentile_approx(), to correctly handle NULL values. - Improvement in the behavior of SQL functions, including to_number, try_to_number, and handling of columns with GROUPING SETS. - Fixes in the behavior of expression pushdown in DataSource V2 and binary comparison functions. - Improved thread management and structural integrity of CoarseGrainedExecutorBackend. - Fixes in performance issues related to TransportClientFactory and the use of await(). - Adjustments to improve compatibility with Kubernetes and YARN environments, ensuring that resource allocation and token renewal are handled correctly. - Dependency updates, including the ORC version to 1.7.10.
Apache Zeppelin	Zeppelin 0.11.0	Notebook	- Default support for JDK 11, which improves compatibility and performance over previous versions. - Support for the latest versions of Apache Spark (3.5.0) and Apache Flink (1.17). - Python 3.9 is now the default version for the Python interpreter, ensuring better support and performance for Python scripts. - Dynamic creation of forms within notebooks to facilitate user interaction with data. - Improvements in data visualization functionalities, including pivot charts and other interactive tools for data analysis. - Bug fixes and security improvements.

Table B. Component Highlights

Component Highlights