660 Commits

Author SHA1 Message Date
John Joyce
1aede64465
bug(docker react): Fix react docker image build (#2118)
Co-authored-by: John Joyce <john@acryl.io>
2021-02-17 23:11:32 -08:00
RyanHolstien
ea86ade29b
feat: ML Model Backend Implementation (#1896)
Co-authored-by: RyanHolstien <rholstien@expediagroup.com>
2021-02-17 13:28:13 -08:00
John Joyce
715fb7d7f7
bug(docker): Removing datahub-gms-graphql-service from default docker-compose.yml file (#2111) 2021-02-16 21:47:38 -08:00
John Joyce
11f030b118 Adding Github Action to publish Docker image 2021-02-15 17:40:41 -08:00
John Joyce
ef04205a73 Serving React App via Docker 2021-02-15 17:40:41 -08:00
John Joyce
e3ac44cfd4 Fixes 2021-02-15 17:40:41 -08:00
John Joyce
8ef5e2a545 Deploy React via Docker 2021-02-15 17:40:41 -08:00
Arun Vasudevan
84e952e138
feat (graphql): Datahub GMS Graphql Api Application for Querying Dataset (#2071) 2021-02-01 11:51:15 -08:00
Satyaprakash Bommaraju
782e29ce53
Fix for Kafka-UI Connectivity Error with Kafka-Rest Proxy (#2053)
Fixes error when acessing http://localhost:18000 where the Kafka-UI was unable to connect to Rest Proxy
2021-01-12 10:30:47 -08:00
Mars Lan
36b79a3ef3
build(docker): add script to clean up docker environment (#2013)
Co-authored-by: Mars Lan <mars@trayminder.com>
2020-12-17 13:52:31 -08:00
John Plaisted
838f964114
feat: add elasticsearch sanity integration tests (#2028)
These tests verify that, given an index settings and mappings, data can be written to the index, and read from it with a query_all query. These are very simple sanity tests.

We can, and should, write more complex tests that specific to each index in the future.
2020-12-02 20:49:34 -08:00
John Plaisted
5f9d967451
fix: ingestion docker image (#2027)
The environment was not set correctly, so it could not fire kafka events. It (mce-cli) always worked when running outside of docker.

I also added a dev ingestion docker image / script which may be much faster if you've already built locally.

Tested:
1. Cleaned docker volumes and started datahub. Verified it is empty.
2. Built with gradle.
3. Ran ./docker/ingestion/ingestion-dev.sh. Verified data shows in DataHub.
4. Ran step 1 again.
5. Ran ./docker/ingestion/ingestion.sh. Verified data shows in DataHub.
2020-12-02 17:40:12 -08:00
Nagarjuna Kanamarlapudi
a1e7e26e08
Fix dataset index creation issue (#2022) 2020-11-30 18:33:06 -08:00
Kerem Sahin
4d8320e4a0
feat(dashboard): Dashboards backend implementation (#1884) 2020-11-23 09:25:58 -08:00
Nagarjuna Kanamarlapudi
5d083143db
feat(dataset): Enable search of datasets by field names (#2001)
* feat(dataset): Enable search of datasets by field names
2020-11-20 12:01:07 -08:00
Jyoti Wadhwani
70ddb09d29
feat: enable SCSI for datasets (#1986)
* enable SCSI for datasets

* Update scsi-onboarding-guide.md
2020-11-11 13:04:20 -08:00
Kerem Sahin
b989f9d16a
Upgrade neo4j to 4.0 (#1960) 2020-10-26 05:31:00 -07:00
Mars Lan
93805e7f1a
build(docker): use community version of ES & Kibana in quickstart (#1929)
Fixes #1928
2020-10-07 21:21:08 -07:00
Fredrik Sannholm
125ae288f1
docker: Run as non-root user in docker (#1914) 2020-10-06 04:35:38 -07:00
Mars Lan
a13ca65e02
Update README.md 2020-09-30 06:20:14 -07:00
Grant Nicholas
9bcf273661
fix(docker): update mae and mce consumer images to include glibc compat layer. allows the consumer jobs to deal with snappy compressed kafka topics when running on alpine linux (#1899) 2020-09-28 15:30:56 -07:00
John Plaisted
821bce7d69
feat: Port mce-cli to Java. (#1871)
Port mce-cli to Java.

Also moved off the avro format event file to json instead. Much nicer to use :)
2020-09-25 14:05:29 -07:00
Fredrik Sannholm
d50b9c01b4
fix (docker): Fix install of Chrome in frontend Dockerimage (#1889)
* fix (docker): Fix install of Chrome in frontend Dockerimage

Retry installing Chrome after dependencies have been installed

* fix (docker): Install Chrome with apt-get

Install Chrome and dependencies at the same time, using apt-get
2020-09-22 12:02:37 -07:00
Kerem Sahin
ece9b82f7a
Update README.md 2020-08-19 21:39:46 -07:00
Kerem Sahin
21a5c9e607
Update README.md 2020-08-19 21:38:03 -07:00
na zhang
97424509d1
add description field for dataset index mapping (#1791) 2020-08-09 17:35:17 -07:00
Mars Lan
aa0a62e991
Update README.md 2020-08-08 04:58:55 -07:00
John Plaisted
b8e18b0b5d
refactor(docker): make docker files easier to use during development. (#1777)
* Make docker files easier to use during development.

During development it quite nice to have docker work with locally built code. This allows you to launch all services very quickly, with your changes, and optionally with debugging support.

Changes made to docker files:
- Removed all redundant docker-compose files. We now have 1 giant file, and smaller files to use as overrides.
- Remove redundant README files that provided little information.
- Rename docker/<dir> to match the service name in the docker-compose file for clarity.
- Move environment variables to .env files. We only provide dev / the default environment for quickstart.
- Add debug options to docker files using multistage build to build minimal images with the idea that built files will be mounted instead.
- Add a docker/dev.sh script + compose file to easily use the dev override images (separate tag; images never published; uses debug docker files; mounts binaries to image).
- Added docs/docker documentation for this.
2020-08-06 16:38:53 -07:00
Chris Lee
4143fb901e
<refactor>[ingestions]: align the default kafka topics with PR #1756 (#1758) 2020-07-29 20:26:01 -07:00
Mars Lan
00d89115b2
feat(gms): add postgres & mariadb supports to GMS (#1742)
* feat(gms): add postgres & mariadb supports to GMS

Also add corresponding docker-compose files

* Update README.md
2020-07-22 19:39:58 -07:00
Liangjun Jiang
5d078aa617
Implemented data process search feature (#1706)
* implement search feature

* add test for dataprocessIndexBuilder; refactor code based on feedback

* update based on PR feedback

* Update DataProcessDocument.pdl

fixed typo wording.

* add not null check for data process info
2020-06-29 10:20:22 -07:00
Kerem Sahin
2dc11a51f4
fix(py3): Bump ingestion Docker py dependency to 3.6 (#1716) 2020-06-29 08:22:50 -07:00
Kerem Sahin
9501e9bd70 docs: Graph onboarding demo 2020-06-26 01:10:44 -07:00
Mars Lan
34d6f4ed09
Update README.md 2020-06-25 19:24:03 -07:00
Liangjun Jiang
92c4a3689e
Data process entity (#1680)
* add job info as aspect of a dataset

* add job urn def., aspect and entity

* job entity with upstream and downstream lineage

* use job urn in upstream & downstream

* add Job entity rest APIs

* rest.li api, impl and factory for job entity

* code cleanup

* use pdl; onboard data process entity

* add es index json

* fix gradlew build ignored tasks

* add a comment about data process info field

* fix style warning issues

* update content based on PR

* checked in generated snapshot json

* updated based on PR feedback

* update data process data format

* updated based on code review feedback

* revert back gms & mce-job docker image

* delete temp files

* update based pr feedback

* file name and a typo

* format with linkedin style

Co-authored-by: Liangjun <liajiang@expediagroup.com>
2020-06-09 15:42:08 -07:00
Mars Lan
4f221f9a12
build(docker): refactor docker build scripts (#1687)
* build(docker): refactor docker build scripts

- add "build" option to docker-compose files to simplify rebuilding of images
- create "start.sh" script so it's easier to override "command" in the quickstart's docker-compose file
- use dockerize to wait for requisite services to start up
- add a dedicated Dockerfile for kafka-setup

This fixes https://github.com/linkedin/datahub/issues/1549 & https://github.com/linkedin/datahub/issues/1550
2020-06-08 13:37:14 -07:00
Mars Lan
94ffb300a9
build(docker): refactor ingestion docker build script (#1690)
- add "build" option to docker-compose file to simplify rebuilding of images
- move command from docker-compose.yml to Dockerfile
- add ingestion.sh script to simplify quickstart instruction and to reduce confusion
2020-06-05 14:39:20 -07:00
Mars Lan
920a1774dc
docs: points to docker images hosted by linkedin org (#1683)
As we're now utilize GitHub Actions to build & publish docker images to docker hub under linkedin org
Also allow overriding image tags via DATAHUB_VERSION environment variable
2020-06-01 09:36:51 -07:00
Mars Lan
505c8c03ca
build: further clean up gms Dockerfile 2020-05-29 16:14:46 -07:00
Mars Lan
e812dd7a16
build: clean up gms Dockerfile 2020-05-29 16:06:15 -07:00
Mars Lan
509b2e1515
refactor: use named volume instead of bind mount in quickstart (#1669)
Volume is the preferred method over bind mount (https://docs.docker.com/storage/volumes/) for persistent container data.
This also eliminates the need for the ugly chmod hack for elasticsearch and hopefully fixes https://github.com/linkedin/datahub/issues/1650
2020-05-11 09:06:38 -07:00
e11it
31887dfbea
fix(quickstart): set utf8mb4 for mysql (#1657)
Co-authored-by: Ilya Makarov <makarov_ia@nlmk.com>
2020-05-04 20:04:25 -07:00
Arun Vasudevan
8c9787c19c
feat: convert MAE application to springboot (#1635)
* Convert MAE to Spring boot
* Fix after testing
* Changes after testing
* Add file appender for gms and update doc type for ESv5.6
* Review comments
* Fix Review Comments
2020-04-10 10:23:24 -07:00
Arun Vasudevan
e56e4eafd2
feat: Converting MCE to a Spring boot Application (#1629) 2020-04-06 10:16:41 -07:00
Bharat
e7feaf1fc0
fix: Reduce mae-consumer & mce-consumer docker image sizes (#1632) 2020-04-05 09:30:05 -07:00
Kerem Sahin
a28f4db5e0
fix(Docker): Fixing 'dockerize not found' issue while starting datahub-gms container using quickstart (#1631) 2020-04-04 13:34:28 -07:00
Bharat
51c0a5c6fe
fix: Reduce gms & frontend docker image sizes (#1630)
* fix: Reduce gms docker image size to 254MB from 1.15GB
* fix: Reduce frontend docker image size to 294MB from 2.68GB
2020-04-04 07:51:29 -07:00
Mars Lan
6cb3f94b82
fix: elasticsearch not starting on Mac (#1621)
The `chown 1000:1000` trick mentioned in https://discuss.elastic.co/t/elastic-elasticsearch-docker-not-assigning-permissions-to-data-directory-on-run/65812/4 only works on Linux. Make the directory world writeable instead so that it works on both Mac & Linux.
2020-04-02 07:19:52 -07:00
Kerem Sahin
34d1ba3f63 build(quickstart): Small fix in quickstart.sh & doc update
1. Use "source" while executing quickstart.sh to bring env var to the context of calling shell
2. Use sudo while running chown in quickstart.sh
3. Update main readme
4. Add missing container names
2020-03-23 16:11:35 -07:00
Albert Franzi
5ac098fbdd
feature: 🐳 - Allow to store Quickstart dockers data in a folder for persistance (#1568)
* feature: 🐳 - Allow to store Quickstart dockers data in a folder for persistance

* bump: Update Kafka dockers to 5.4.0

* feature: 🐳 - Add kafka-topics-ui docker to the kafka docker folder

* refactor: Provide a quickstart.sh script to start all dockers
2020-03-23 13:43:51 -07:00