2021-03-05 00:12:12 -08:00
<!-- HOSTED_DOCS_ONLY
import useBaseUrl from '@docusaurus/useBaseUrl ';
export const Logo = (props) => {
return (
2021-10-21 15:49:44 -05:00
< div style = {{ display: " flex " , justifyContent: " center " , padding: " 20px " } } >
< img
height="150"
2021-05-25 18:51:24 -07:00
alt="DataHub Logo"
2021-10-21 16:44:01 -05:00
src={useBaseUrl("/img/datahub-logo-color-mark.svg")}
2021-05-25 18:51:24 -07:00
{...props}
/>
< / div >
2021-10-21 15:49:44 -05:00
);
2021-03-05 00:12:12 -08:00
};
2021-10-21 15:49:44 -05:00
< Logo / >
2021-03-05 00:12:12 -08:00
<!--
HOSTED_DOCS_ONLY-->
2021-03-05 12:20:31 -08:00
< p align = "center" >
2021-10-21 15:49:44 -05:00
< img alt = "DataHub" src = "docs/imgs/datahub-logo-color-mark.svg" height = "150" / >
2021-03-05 12:20:31 -08:00
< / p >
2021-03-05 00:12:12 -08:00
<!-- -->
2020-07-26 10:24:41 -07:00
2021-11-09 18:04:19 -08:00
# DataHub: The Metadata Platform for the Modern Data Stack
2021-11-13 08:11:53 -08:00
## Built with ❤️ by <img src="https://datahubproject.io/img/acryl-logo-light-mark.png" width="25"/> [Acryl Data](https://acryldata.io) and <img src="https://datahubproject.io/img/LI-In-Bug.png" width="25"/> [LinkedIn](https://engineering.linkedin.com)
2022-04-04 21:39:30 +02:00
[](https://github.com/datahub-project/datahub/releases/latest)
2021-11-09 18:04:19 -08:00
[](https://badge.fury.io/py/acryl-datahub)
2022-03-18 22:12:19 +01:00
[](https://github.com/datahub-project/datahub/actions?query=workflow%3A%22build+%26+test%22+branch%3Amaster+event%3Apush)
2021-03-05 12:20:31 -08:00
[](https://hub.docker.com/r/linkedin/datahub-gms)
2021-03-31 12:29:01 -07:00
[](https://slack.datahubproject.io)
2022-03-18 22:12:19 +01:00
[](https://github.com/datahub-project/datahub/blob/master/docs/CONTRIBUTING.md)
2022-04-04 21:39:30 +02:00
[](https://github.com/datahub-project/datahub/pulls?q=is%3Apr)
[](https://github.com/datahub-project/datahub/blob/master/LICENSE)
2021-05-14 11:00:50 -07:00
[](https://www.youtube.com/channel/UC3qFQC5IiwR5fvWEqi_tJ5w)
[](https://medium.com/datahub-project)
2021-03-18 09:55:05 -07:00
[](https://twitter.com/datahubproject)
2021-11-13 08:11:53 -08:00
### 🏠 Project Homepage: [datahubproject.io](https://datahubproject.io/)
2021-03-05 12:20:31 -08:00
---
[Quickstart ](https://datahubproject.io/docs/quickstart ) |
[Documentation ](https://datahubproject.io/docs/ ) |
[Features ](https://datahubproject.io/docs/features ) |
[Roadmap ](https://datahubproject.io/docs/roadmap ) |
[Adoption ](#adoption ) |
[Demo ](https://datahubproject.io/docs/demo ) |
[Town Hall ](https://datahubproject.io/docs/townhalls )
---
2022-05-23 15:45:17 -05:00
> 📣 Next DataHub town hall meeting on May 26th, 9am-10am PDT ([convert to your local time](https://greenwichmeantime.com/time/to/pacific-local/))
2021-05-17 11:44:22 -07:00
>
2021-01-29 15:01:48 -08:00
> - Topic Proposals: [submit here](https://docs.google.com/forms/d/1v2ynbAXjJlqY97xE_X1DAntNrXDznOFiNfryUkMPtkI/)
2022-05-29 10:24:11 -05:00
> - Signup to get a calendar invite: [here](https://rsvp.datahubproject.io)
2021-03-31 08:48:24 -07:00
> - Town-hall Zoom link: [zoom.datahubproject.io](https://zoom.datahubproject.io)
2020-08-06 06:20:58 -07:00
> - [Meeting details](docs/townhalls.md) & [past recordings](docs/townhall-history.md)
2020-07-26 10:24:41 -07:00
2021-05-17 11:44:22 -07:00
> ✨ Latest Update:
>
2022-01-12 16:50:29 -06:00
> - Read our Monthly Project Updates [here](https://blog.datahubproject.io/tagged/project-updates).
2021-10-26 14:59:55 -05:00
> - Bringing The Power Of The DataHub Real-Time Metadata Graph To Everyone At Acryl Data: [Data Engineering Podcast](https://www.dataengineeringpodcast.com/acryl-data-datahub-metadata-graph-episode-230/)
2021-03-31 08:48:24 -07:00
> - Unleashing Excellent DataOps with LinkedIn DataHub: [DataOps Unleashed Talk](https://www.youtube.com/watch?v=ccsIKK9nVxk).
2021-05-17 11:44:22 -07:00
> - Latest blog post [DataHub: Popular Metadata Architectures Explained](https://engineering.linkedin.com/blog/2020/datahub-popular-metadata-architectures-explained) @ LinkedIn Engineering Blog.
2020-08-01 05:07:33 -07:00
> - We're on [Slack](docs/slack.md) now! Ask questions and keep up with the latest announcements.
2020-07-26 10:24:41 -07:00
## Introduction
2021-06-24 22:21:21 -07:00
DataHub is an open-source metadata platform for the modern data stack. Read about the architectures of different metadata systems and why DataHub excels [here ](https://engineering.linkedin.com/blog/2020/datahub-popular-metadata-architectures-explained ). Also read our
2022-03-12 17:16:14 -06:00
[LinkedIn Engineering blog post ](https://engineering.linkedin.com/blog/2019/data-hub ), check out our [Strata presentation ](https://speakerdeck.com/shirshanka/the-evolution-of-metadata-linkedins-journey-strata-nyc-2019 ) and watch our [Crunch Conference Talk ](https://www.youtube.com/watch?v=OB-O0Y6OYDE ). You should also visit [DataHub Architecture ](docs/architecture/architecture.md ) to get a better understanding of how DataHub is implemented.
2020-07-26 10:24:41 -07:00
## Quickstart
2021-05-17 11:44:22 -07:00
2021-03-07 10:46:27 -08:00
Please follow the [DataHub Quickstart Guide ](https://datahubproject.io/docs/quickstart ) to get a copy of DataHub up & running locally using [Docker ](https://docker.com ). As the guide assumes some basic knowledge of Docker, we'd recommend you to go through the "Hello World" example of [A Docker Tutorial for Beginners ](https://docker-curriculum.com ) if Docker is completely foreign to you.
2020-07-26 10:24:41 -07:00
2022-04-05 03:15:21 +01:00
## Development
If you're looking to build & modify datahub please take a look at our [Development Guide ](https://datahubproject.io/docs/developers ).
2021-03-05 00:12:12 -08:00
## Demo and Screenshots
2021-03-05 12:20:31 -08:00
There's a [hosted demo environment ](https://datahubproject.io/docs/demo ) where you can play around with DataHub before installing.
2021-03-05 00:12:12 -08:00
2021-10-21 15:49:44 -05:00
[](https://datahubproject.io/docs/demo)
2021-03-05 00:12:12 -08:00
2021-01-02 16:40:23 -08:00
## Source Code and Repositories
2021-05-17 11:44:22 -07:00
2022-04-04 21:39:30 +02:00
- [datahub-project/datahub ](https://github.com/datahub-project/datahub ): This repository contains the complete source code for DataHub's metadata model, metadata services, integration connectors and the web application.
2021-01-02 16:40:23 -08:00
2020-07-26 10:24:41 -07:00
## Documentation
2021-05-17 11:44:22 -07:00
2021-03-05 00:12:12 -08:00
We have documentation available at [https://datahubproject.io/docs/ ](https://datahubproject.io/docs/ ).
2020-07-26 10:24:41 -07:00
## Releases
2021-05-17 11:44:22 -07:00
2022-03-18 22:12:19 +01:00
See [Releases ](https://github.com/datahub-project/datahub/releases ) page for more details. We follow the [SemVer Specification ](https://semver.org ) when versioning the releases and adopt the [Keep a Changelog convention ](https://keepachangelog.com/ ) for the changelog format.
2020-07-26 10:24:41 -07:00
## Features & Roadmap
2021-05-17 11:44:22 -07:00
2022-01-12 16:50:29 -06:00
Check out DataHub's [Features ](docs/features.md ) & [Roadmap ](https://feature-requests.datahubproject.io/roadmap ).
2020-07-26 10:24:41 -07:00
## Contributing
2021-05-17 11:44:22 -07:00
2020-08-03 05:20:20 -07:00
We welcome contributions from the community. Please refer to our [Contributing Guidelines ](docs/CONTRIBUTING.md ) for more details. We also have a [contrib ](contrib ) directory for incubating experimental features.
2020-07-26 10:24:41 -07:00
2022-03-12 17:16:14 -06:00
### Extending
If you need to understand how to extend our model with custom types, please see [Extending the Metadata Model ](docs/modeling/extending-the-metadata-model.md )
2020-07-26 10:24:41 -07:00
## Community
2021-05-17 11:44:22 -07:00
2021-03-31 12:29:01 -07:00
Join our [slack workspace ](https://slack.datahubproject.io ) for discussions and important announcements. You can also find out more about our upcoming [town hall meetings ](docs/townhalls.md ) and view past recordings.
2020-07-26 10:24:41 -07:00
## Adoption
2021-05-17 11:44:22 -07:00
2020-07-26 10:24:41 -07:00
Here are the companies that have officially adopted DataHub. Please feel free to add yours to the list if we missed it.
2021-03-31 08:48:24 -07:00
2022-02-24 02:13:06 +01:00
- [Adevinta ](https://www.adevinta.com/ )
2021-09-18 03:39:51 +09:00
- [Banksalad ](https://www.banksalad.com )
2022-03-11 01:33:47 +01:00
- [Cabify ](https://cabify.tech/ )
2021-07-14 20:45:14 -07:00
- [DefinedCrowd ](http://www.definedcrowd.com )
2021-09-20 13:43:46 -07:00
- [DFDS ](https://www.dfds.com/ )
2021-05-17 11:44:22 -07:00
- [Expedia Group ](http://expedia.com )
- [Experius ](https://www.experius.nl )
- [Geotab ](https://www.geotab.com )
- [Grofers ](https://grofers.com )
2022-02-09 22:25:34 -06:00
- [Haibo Technology ](https://www.botech.com.cn )
2021-11-30 18:26:59 -06:00
- [hipages ](https://hipages.com.au/ )
2021-12-16 17:49:42 +01:00
- [IOMED ](https://iomed.health )
2021-05-17 11:44:22 -07:00
- [Klarna ](https://www.klarna.com )
- [LinkedIn ](http://linkedin.com )
2022-01-21 13:35:43 -06:00
- [Moloco ](https://www.moloco.com/en )
2021-10-21 14:49:09 -05:00
- [Peloton ](https://www.onepeloton.com )
2021-05-17 11:44:22 -07:00
- [Saxo Bank ](https://www.home.saxo )
2021-12-07 23:54:15 -06:00
- [Stash ](https://www.stash.com )
2021-05-17 11:44:22 -07:00
- [Shanghai HuaRui Bank ](https://www.shrbank.com )
- [ThoughtWorks ](https://www.thoughtworks.com )
- [TypeForm ](http://typeform.com )
2021-09-20 13:43:46 -07:00
- [Uphold ](https://uphold.com )
2021-05-17 11:44:22 -07:00
- [Viasat ](https://viasat.com )
- [Wolt ](https://wolt.com )
2022-02-09 22:25:34 -06:00
- [Zynga ](https://www.zynga.com )
2020-07-26 10:24:41 -07:00
## Select Articles & Talks
2021-05-17 11:44:22 -07:00
2021-11-30 18:26:59 -06:00
- [DataHub Blog ](https://blog.datahubproject.io/ )
2021-11-09 18:04:19 -08:00
- [DataHub YouTube Channel ](https://www.youtube.com/channel/UC3qFQC5IiwR5fvWEqi_tJ5w )
- [Saxo Bank: Enabling Data Discovery in Data Mesh ](https://medium.com/datahub-project/enabling-data-discovery-in-a-data-mesh-the-saxo-journey-451b06969c8f )
- Bringing The Power Of The DataHub Real-Time Metadata Graph To Everyone At Acryl Data: [Data Engineering Podcast ](https://www.dataengineeringpodcast.com/acryl-data-datahub-metadata-graph-episode-230/ )
2021-05-17 11:44:22 -07:00
- [DataHub: Popular Metadata Architectures Explained ](https://engineering.linkedin.com/blog/2020/datahub-popular-metadata-architectures-explained )
- [Driving DataOps Culture with LinkedIn DataHub ](https://www.youtube.com/watch?v=ccsIKK9nVxk ) @ [DataOps Unleashed 2021 ](https://dataopsunleashed.com/#shirshanka-session )
- [The evolution of metadata: LinkedIn’ s story ](https://speakerdeck.com/shirshanka/the-evolution-of-metadata-linkedins-journey-strata-nyc-2019 ) @ [Strata Data Conference 2019 ](https://conferences.oreilly.com/strata/strata-ny-2019.html )
- [Journey of metadata at LinkedIn ](https://www.youtube.com/watch?v=OB-O0Y6OYDE ) @ [Crunch Data Conference 2019 ](https://crunchconf.com/2019 )
- [DataHub Journey with Expedia Group ](https://www.youtube.com/watch?v=ajcRdB22s5o )
- [Data Discoverability at SpotHero ](https://www.slideshare.net/MaggieHays/data-discoverability-at-spothero )
- [Data Catalogue — Knowing your data ](https://medium.com/albert-franzi/data-catalogue-knowing-your-data-15f7d0724900 )
2021-11-09 18:04:19 -08:00
- [DataHub: A Generalized Metadata Search & Discovery Tool ](https://engineering.linkedin.com/blog/2019/data-hub )
- [Open sourcing DataHub: LinkedIn’ s metadata search and discovery platform ](https://engineering.linkedin.com/blog/2020/open-sourcing-datahub--linkedins-metadata-search-and-discovery-p )
2021-05-17 11:44:22 -07:00
- [Emerging Architectures for Modern Data Infrastructure ](https://a16z.com/2020/10/15/the-emerging-architectures-for-modern-data-infrastructure/ )
2020-07-26 10:24:41 -07:00
2020-07-26 10:41:32 -07:00
See the full list [here ](docs/links.md ).
2021-05-17 11:44:22 -07:00
## License
[Apache License 2.0 ](./LICENSE ).