datahub/docs/roadmap.md

54 lines
1.9 KiB
Markdown
Raw Normal View History

2020-02-11 14:14:57 -08:00
# DataHub Roadmap
2020-02-11 12:25:33 -08:00
2020-02-11 14:14:57 -08:00
Below is DataHub's roadmap for the short and medium term. We'll revise this on a regular basis and welcome suggestions from the communities.
2020-02-11 12:25:33 -08:00
2020-02-11 13:36:38 -08:00
## Short term (3-6 months)
2020-02-15 06:26:37 -08:00
### Replace PDSC with PDL
- Simplified syntax + annotations
### Entity-specific MCE & MAE
2020-02-16 06:25:41 -08:00
- Spilt up unified events to improve scalability & modularity
### Jobs & Flows as an entity
- Link datasets to jobs & flows
2020-02-15 06:48:53 -08:00
### Schemas as an entity
2020-02-14 13:26:53 -08:00
- Make schemas searchable
- Support GraphQL shcmeas
2020-02-11 12:25:33 -08:00
### Compliance management for datasets
2020-02-15 06:26:37 -08:00
- Simple tag-based compliance metadata
2020-02-15 16:38:48 -08:00
### Strongly consistent local index
- Add query-after-write capability to local DAO
2020-02-15 06:48:53 -08:00
### Gremlin-based Query DAO
- Support majority of gremlin-compatible graph DBs
2020-02-15 14:54:25 -08:00
### Templatized UI
- Config-driven UI
- Generate TypeScript types from Pegasus
2020-02-11 12:25:33 -08:00
### Kubernetes migration
2020-02-15 06:48:53 -08:00
- Migration from docker-compose to [Kubernetes](https://kubernetes.io/) for Docker container orchestration
2020-02-11 12:25:33 -08:00
### Azure deployment
2020-02-15 14:57:51 -08:00
- Run DataHub in [Azure](https://azure.microsoft.com/en-us/) and provide how-to guides
2020-02-11 12:25:33 -08:00
2020-02-15 06:26:37 -08:00
## Medium term (6 months - 1 year)
2020-02-11 12:25:33 -08:00
### Fine grain lineage
2020-02-16 06:25:41 -08:00
- Models + impact analysis
2020-02-15 06:26:37 -08:00
### Operational metadata
- Indexing in OLAP store (Pinot) with TTL
2020-02-11 12:25:33 -08:00
### Social features
2020-02-15 06:48:53 -08:00
- Users will be able to like and follow entities
2020-02-16 06:25:41 -08:00
- Dataset & field-level commenting
2020-02-16 06:26:27 -08:00
### Microservices as an entity
- Initially focus on rest.li services & GraphQL integration
2020-02-15 06:48:53 -08:00
### Kundera-based Local DAO
- Support a wide range of document stores
2020-02-15 14:54:25 -08:00
### Adtopt Redux
- Use Redux exclusively for UI state management
2020-02-11 12:25:33 -08:00
### Integration tests
2020-02-15 14:54:25 -08:00
- Add docker-based integration tests
2020-02-15 14:57:51 -08:00
### AWS & GCP deployment
- Run DataHub in [AWS](https://aws.amazon.com/) & [GCP](https://cloud.google.com/gcp) and provide how-to guides
2020-02-15 06:49:47 -08:00
### Apache incubation
2020-02-15 06:26:37 -08:00
## Long term (1+ year)
### Rewrite midtier in Node
2020-02-16 06:25:41 -08:00
- TypeScript-only frontend development
2020-02-15 14:56:09 -08:00
### gRPC + protobuf
2020-02-16 06:25:41 -08:00
- Modeling in protobuf + serveing in gRPC