2020-03-10 23:32:50 -07:00
# Features of DataHub
2020-03-11 04:56:22 -07:00
DataHub is made up of a [generic backend ](what/gma.md ) and a [Ember-based UI ](../datahub-web ). Original DataHub
[blog post ](https://engineering.linkedin.com/blog/2019/data-hub ) talks about the design extensively and mentions some of
2020-03-10 23:32:50 -07:00
the features of DataHub. Our open sourcing [blog post ](https://engineering.linkedin.com/blog/2020/open-sourcing-datahub--linkedins-metadata-search-and-discovery-p )
2020-03-11 04:56:22 -07:00
also provides a comparison of some features between LinkedIn production DataHub vs open source DataHub. Below is a list of the latest features that are available in DataHub, as well as features that will soon become available.
2020-03-10 23:32:50 -07:00
## Data Constructs (Entities)
### Datasets
- **Search**: full-text & advanced search, search ranking
2020-03-11 04:56:22 -07:00
- **Browse**: browsing through a configurable hierarchy
2020-03-10 23:32:50 -07:00
- **Schema**: table & document schema in tabular and JSON format
- **Coarse grain lineage**: support for lineage at the dataset level, tabular & graphical visualization of downstreams/upstreams
- **Ownership**: surfacing owners of a dataset, viewing datasets you own
- **Dataset life-cycle management**: deprecate/undeprecate, surface removed datasets and tag it with "removed"
- **Institutional knowledge**: support for adding free form doc to any dataset
2020-03-11 04:56:22 -07:00
- **Fine grain lineage**: support for lineage at the field level [*available soon*]
- **Social actions**: likes, follows, bookmarks [*available soon*]
- **Compliance management**: field level tag based compliance editing [*available soon*]
- **Top users**: frequent users of a dataset [*available soon*]
2020-03-10 23:32:50 -07:00
### Users
- **Search**: full-text & advanced search, search ranking
2020-03-11 04:56:22 -07:00
- **Browse**: browsing through a configurable hierarchy [*available soon*]
2020-03-10 23:32:50 -07:00
- **Profile editing**: LinkedIn style professional profile editing such as summary, skills
2020-03-11 04:56:22 -07:00
### Metrics [*available soon*]
- **search**: full-text & advanced search, search ranking
- **Browse**: browsing through a configurable hierarchy
- **Basic information**: ownershp, dimensions, formula, input & output datasets, dashboards
- **Institutional knowledge**: support for adding free form doc to any metric
### Dashboards [*available soon*]
- **search**: full-text & advanced search, search ranking
- **Basic information**: ownership, location
- **Institutional knowledge**: support for adding free form doc to any dashboards
2020-03-10 23:32:50 -07:00
## Metadata Sources
2020-03-11 04:56:22 -07:00
You can integrate any data platform to DataHub easily. As long as you have a way of *Extracting* metadata from the platform and *Transform* that into our standard [MCE ](what/mxe.md ) format, you're free to *Load* /ingest metadata to DataHub from any available platform.
We have provided example [ETL ingestion ](architecture/metadata-ingestion.md ) scripts for:
2020-03-10 23:32:50 -07:00
- Hive
- Kafka
- RDBMS
- MySQL
2020-03-11 04:56:22 -07:00
- LDAP