datahub/docs/roadmap.md
2020-07-22 07:03:06 -07:00

2.5 KiB

DataHub Roadmap

Below is DataHub's roadmap for the short and medium term. We'll revise this on a regular basis and welcome suggestions from the communities.

ETAs are revisted on a regular basis and are subject to change. If you would like to see anything prioritized higher than currently is on this list, please reach out to us and we can discuss it!

Short term (3 months) [ETA October 2020]

Dashboards as entities

  • Models + UI

Jobs & Flows as entities

  • Link datasets to jobs & flows

Strongly consistent local index

  • Add query-after-write capability to local DAO

Gremlin-based Query DAO

  • Support majority of gremlin-compatible graph DBs

Integration tests

  • Add docker-based integration tests

Kubernetes migration

  • Migration from docker-compose to Kubernetes for Docker container orchestration

Medium term (3 - 6 months) [ETA January 2021]

Aspect-specific MCE & MAE

  • Split up unified events to improve scalability & modularity

Dataset field-level lineage

  • Models + impact analysis

Data Concepts

  • Models + UI

Metrics as entities

  • Models + UI

Schemas as an entity

  • Make schemas searchable
  • Support GraphQL schemas

Entity Insights

  • UI to highlight high value information about Entities within Search and Entity Pages

Data privacy management for datasets

  • Simple tag-based data privacy metadata

Operational metadata

  • Indexing in OLAP store (Pinot) with TTL

Social features

  • Users will be able to like and follow entities
  • Dataset & field-level commenting

Templatized UI

  • Config-driven UI
  • Generate TypeScript types from Pegasus

Add GraphQL endpoint to GMS

  • Use GraphQL exclusively for frontend queries

Adopt Redux

  • Use Redux exclusively for UI state management

Apache incubation

  • Donate code to Apache foundation

Azure deployment

  • Run DataHub in Azure and provide how-to guides

Long term (6 months - 1 year)

Microservices as an entity

  • Initially focus on rest.li services & GraphQL integration

JNoSQL-based Local DAO

  • Support a wide range of document stores

AWS & GCP deployment

  • Run DataHub in AWS & GCP and provide how-to guides

Visionary Goals (1 year+)

Rewrite midtier in Node

  • TypeScript-only frontend development

gRPC + protobuf

  • Modeling in protobuf + serving in gRPC

UI for metadata graph exploration