Skip to main content

DataHub Releases

Summary

VersionRelease DateLinks
v1.6.02026-05-21Release Notes, View on GitHub
v1.5.0.72026-05-19Release Notes, View on GitHub
v1.5.0.62026-05-11Release Notes, View on GitHub
v1.5.0.52026-05-07Release Notes, View on GitHub
v1.5.0.42026-05-06Release Notes, View on GitHub
v1.5.0.32026-04-25Release Notes, View on GitHub
v1.5.0.22026-04-13Release Notes, View on GitHub
v1.5.0.12026-03-25Release Notes, View on GitHub
v1.5.02026-03-24Release Notes, View on GitHub
v1.4.0.32026-02-19View on GitHub

v1.6.0

Released on 2026-05-21 by @david-leifker.

DataHub v1.6.0

Full upgrade guidance, including every breaking change and migration step: Updating DataHub — v1.6.0.


Feature highlights

UI and experience
  • V2 UI only — Legacy V1 UI code is removed; V2 is the sole interface with an updated collapsible navigation and home hero toggler (#17468).
  • Play 3 frontenddatahub-frontend runs on Play 3 and Apache Pekko for improved security and maintainability (#17214).
  • Theming — Broader migration to semantic color tokens across entity, lineage, home, and settings surfaces.
  • Security hardening — Content-Security-Policy on the frontend (#17277), sanitized API error responses (#17452), URL validation before rendering links (#17489), and home page template/module scope checks (#17487).
Ingestion — new sources
Ingestion — major connector improvements
  • Sigma — Data Models on by default, formula-resolved chart lineage, customSQL warehouse lineage, cross-DM fine-grained lineage, per-connection connection_to_platform_map, and workbook element-to-element edges (#17276, #17196, #17296, #17347, #17369, #17370, #17086, and related).
  • Databricks Unity Catalog — Opt-in Metric Views (include_metric_views), UPSERT ownership/properties by default (#17380, #16873).
  • BigQuery — Faster policy-tag extraction via INFORMATION_SCHEMA; richer external table metadata (#17407, #16348).
  • Glue — JDBC upstream lineage, Iceberg lineage, job subtype, structured properties on schema fields (#16505, #16562, #16636, #17325).
  • Kafka Connect — Column-level lineage for sink connectors (#16515).
  • Power BIbrowsePathsV2 hierarchy, Sql.Databases M-Query support, workspace external URLs (#16621, #16616, #16934).
  • Fivetran — Per-destination platform discovery in hybrid API + log mode (#17217).
  • Athena — Correct upstream URNs for Glue- and Iceberg-backed tables (#16842).
  • Fabric OneLake — View ingestion with column-level lineage; query usage from queryinsights (#17215, #17284).
  • dbt — Configurable URN lowercasing, stats from catalog.json, assertion severity and improved ERROR vs FAILURE mapping (#16358, #16044, assertion PRs).
  • SQL profiling — SQLAlchemy profiler is the default for SQL connectors (faster, no Great Expectations dependency by default) (#17465).
  • Postgres — Stored-procedure SQL bodies and lineage improvements (#16871).
  • Confluence — Page HTML converted to Markdown (#17475).
Search, assertions, and metadata
  • Semantic search — Elasticsearch 8.18+ semantic search; Vertex AI and local Ollama embedding providers (#17230, #17255, #17201).
  • Assertions — Failure severity in APIs and UI; failure configuration SDK (#17335, #17355, #17457).
  • Structured properties — Stricter GMS validation; CSV enricher support (#16779).
  • Search filtersCriterion / FacetFilterInput use values arrays only (see breaking changes).
  • GraphQL — Request bodies minified on the wire (#17392).
Operations and platform
  • Java 25 LTS in official Docker images; Java 21 build toolchain (#17340, #16912).
  • Spring Boot 4 on GMS and Java services (#16816, #17351) — see breaking changes if you ship custom extensions.
  • Micrometer / Prometheus — Actuator on port 4319 by default; JMX agent 1.0.1 with /metrics scrape path (see breaking changes).
  • Elasticsearch ZDU — Optional zero-downtime side upgrade path via Helm (#16887).
  • Helm — Cluster-wide metricsMode, Cleanup pre-delete hook, consolidated system-update path (deprecations in upgrade doc).
  • REST emitter — Configurable connection pool (#16486).
  • Multi-entity domain and ownership transformers (#16798).

Breaking changes

Review the Breaking Changes section in Updating DataHub before upgrading. Summary of items that may require action:

AreaWhat changedWho is affected
Spring Boot 4GMS and Java services move to Spring Boot 4.0.5/4.0.6 (Spring Framework 7, Spring Kafka 4).Custom GMS plugins or Spring extensions — recompile and retest. Standard Docker/Helm installs: no change if you do not extend the server classpath.
V1 UI removedV1 UI code deleted; V2 required (THEME_V2_* / Helm theme_v2).Anyone still on V1 env flags.
Play 3 + DATAHUB_SECRETFrontend on Play 3; secret must be ≥32 bytes or startup fails.Compose / hand-crafted short secrets (Helm usually OK).
SQL profiling defaultDefault profiler is sqlalchemy, not Great Expectations.Recipes with method: ge need acryl-datahub[profiling-ge].
Search valuevaluesSingular value on filters removed; use values array only.Custom REST/GraphQL/SDK clients.
SigmaData models on by default; lineage URN/field behavior changes; connection_to_platform_map for Redshift.Sigma ingestion operators.
Structured propertiesOrphan assignments dropped by default (configurable).Ingestion pipelines with stale property URNs.
Athena / Fivetran / two-tier SQLUpstream URN changes (Athena, Fivetran hybrid multi-destination); stored-procedure URN shape (MySQL, MariaDB, Hive, ClickHouse, Teradata).Lineage keyed on old URNs.
Unity CatalogOwnership/properties UPSERT by default.Manual owners merged via PATCH.
BigQuery policy tagsNew extraction path only; old path removed.extract_policy_tags_from_catalog: true.
Micrometer / JMXActuator on 4319; JMX scrape at /metrics.Prometheus/Grafana scrape configs.
Actions / KafkaDefault async offset commits (higher throughput; possible redelivery).Custom actions needing sync commits.
AuthcorpUserInfo.active ignored for sessions.Login gating on deprecated active.
Vertex AIModel version set URNs scoped per project.Orphaned version sets after upgrade.
dbt assertionsInfrastructure failures → ERROR not FAILURE; new severity.Dashboards filtering FAILURE only.
DataplexRenamed filter config fields.Recipes using old keys.
Docker buildBASE_IMAGE, apkRepositoryUrl build args.Custom image builds.

Potential downtime: Reindexing, optional Elasticsearch ZDU, first system-update after bootstrap moves, and aspect schema version sweep on large catalogs — documented under v1.6.0 in Updating DataHub.

Deprecations: Helm per-workload monitoring → global.datahub.monitoring, consolidated system-update, Great Expectations profiler legacy, Glossary Term AI automation — see the v1.6.0 Deprecations section in Updating DataHub.


Contributors

Thank you to everyone who contributed to v1.6.0. For the complete changelog, compare v1.5.0.7...v1.6.0.

v1.5.0.7

Released on 2026-05-19 by @david-leifker.

Security Update

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.6...v1.5.0.7

v1.5.0.6

Released on 2026-05-11 by @david-leifker.

Remove kubectl binary

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.5...v1.5.0.6

v1.5.0.5

Released on 2026-05-07 by @david-leifker.

Security Update

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.4...v1.5.0.5

v1.5.0.4

Released on 2026-05-06 by @david-leifker.

Security Update

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.3...v1.5.0.4

v1.5.0.3

Released on 2026-04-25 by @david-leifker.

Security Patch Release

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.2...v1.5.0.3

v1.5.0.2

Released on 2026-04-13 by @david-leifker.

Security Patch Release

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0.1...v1.5.0.2

v1.5.0.1

Released on 2026-03-25 by @david-leifker.

Changes

Include bundled venvs for datahub-gc, datahub-documents

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.5.0...v1.5.0.1

v1.5.0

Released on 2026-03-24 by @david-leifker.

DataHub v1.5.0 Release Notes

Helm Chart Requirement: 0.9.2 Full technical release notes: Updating DataHub

Product Features

  • V1 UI officially sunset. All development targets V2 UI going forward. Ensure THEME_V2_ENABLED=true and THEME_V2_DEFAULT=true.
  • Multiple data products per asset (backend and UI).
  • Policy targeting by Glossary Terms and Groups.
  • Domain-scoped policies now include child domain assets.
  • datahub search CLI with semantic search, query projection, and agent-context integration.

Platform

  • Java 17 runtime required. Spark upgraded to 3.3.4, Hadoop to 3.3.6. Spark lineage users must be on Spark 3.3.0+.
  • Default token signing key & salt removed. Operators must explicitly set DATAHUB_TOKEN_SERVICE_SIGNING_KEY and DATAHUB_TOKEN_SERVICE_SALT. Helm users are unaffected.
  • Retention service disabled \= no version history. Only the current version (v0) is retained when the retention service is not enabled.
  • TLS 1.0/1.1 disabled on frontend custom truststores.
  • Elasticsearch reindex/index-creation retries for improved upgrade resilience.
  • Kubernetes optional scale-down during system-update for blocking upgrades like reindexing. Disabled by default.
  • SDK: emit_mcps() now returns List[TraceData] instead of int. Trace IDs exposed for SYNC_PRIMARY and ASYNC modes.
  • Reproducible ingestion Docker builds via pinned transitive dependencies (uv.lock, constraints.txt).
  • Python deps migrating from setup.py to pyproject.toml (PEP 621); setup.py still the editing source for now.

Ingestion

New Connectors
  • RDF, Snowplow, Apache Doris
Breaking Changes (see migration guide for details)
  • PowerBI M-Query lineage rewritten using Microsoft's official parser. native_query_parsing: false behavior changed.
  • SQL view query IDs now use SHA-256 hashes — old query entities become orphaned. Use stateful ingestion to clean up.
  • Oracle multitenant URNs now use PDB name instead of CDB name when connecting via service_name.
  • Fabric OneLake workspace containers moved to fabric platform (from fabric-onelake).
  • Vertex AI pipeline URNs restructured for stable DataFlow entities; ML Metadata extraction enabled by default (requires additional GCP permissions).
  • DataHub source now uses URN pattern filtering to exclude secrets, ingestion sources, and execution requests by default.
  • Kafka Connect Debezium SQL Server platform changed from sqlserver to mssql.
Enhancements
  • dbt: Semantic model and exposures ingestion; convert_urns_to_lowercase option for case-insensitive platforms.
  • Snowflake: Metadata pattern pushdown, table type filtering, external DMF assertion ingestion.
  • Power BI: Column-level lineage enabled by default.
  • Kafka Connect: Debezium and Confluent JDBC sink connector support; bundled JVM removes system Java requirement.
  • SQL parsing: Major CTE/subquery join resolution performance improvements across all SQL-based connectors.
  • Mode: Concurrent API fetching, response caching, SQL parsing optimizations.
  • Trino: Column-level lineage on upstream datasets.
  • Iceberg: Ingestion-time domain assignment.
  • Azure Data Factory: Column lineage for Copy activity.
  • Airflow plugin: Multi-statement SQL parsing for lineage.
  • Sigma: Workbook filtering.
  • BigQuery: convert_column_urns_to_lowercase option.
  • Kafka source: Option to disable Avro schema name validation.
  • Great Expectations & SQLAlchemy profilers brought to feature parity.
  • Browse paths: DataFlow/DataJob entities get browsePathsV2 with platform instance when configured.
  • Vertex AI: Cross-platform lineage, hierarchical UI folders, stateful ingestion for large projects.
  • Oracle: Fixed container naming with service_name.
  • Configurable report sample sizes and richer failure logging.

Deprecations

  • Vertex AI: regionregions, project_idproject_ids. Old fields still work.
  • Vertex AI: normalize_external_dataset_paths will default to true in the next major version.

Full Changelog: https://github.com/datahub-project/datahub/compare/v1.4.0.3...v1.5.0

v1.4.0.3

Released on 2026-02-19 by @jjoyce0510.

View the release notes for v1.4.0.3 on GitHub.