292 Commits

Author SHA1 Message Date
John Joyce
2c5edd88ab
feat(graphql): migrating GraphQL API to metadata-service (nee GMS) (#3131) 2021-08-20 10:58:07 -07:00
John Joyce
81eb4f19bf
feat(sso): Just-In-Time User & Group Provisioning on SSO Login (oidc) (#3082) 2021-08-20 07:42:18 -07:00
Dexter Lee
2df9d4fed6
feat: add support for AWS glue schema registry (#3083) 2021-08-11 21:40:37 -07:00
Gabe Lyons
a06c4caf4b
chore: upgrading gma to 0.2.80 (#3070) 2021-08-10 13:29:11 -07:00
John Joyce
352a0abf8d
Introducing TimeSeries Aspects + Dataset Profile (Stats) Aspect (#2983)
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Ravindra Lanka <rlanka@acryl.io>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-07-30 17:41:03 -07:00
John Joyce
050d940d58
feat(datahub-frontend): Adding basic file-based authentication to datahub-frontend (#2818) 2021-07-02 06:31:01 -07:00
Dexter Lee
ad081138f7
fix(datahub-upgrade): fix vulnerabilities (#2813) 2021-06-30 22:57:39 -07:00
Dexter Lee
8f0f322279
feat(backup): Add restore indices and restore backup tasks (#2779) 2021-06-30 16:49:02 -07:00
Dexter Lee
313da07447
fix(docker): Upgrade to 4.1.44 netty-all library (#2784) 2021-06-28 17:04:12 -07:00
Dexter Lee
3caa09d21a
fix(docker): Fix dependency vulnerability (#2763) 2021-06-24 23:00:55 -07:00
Dexter Lee
e640f5ed03
fix(docker): modernize docker images and fix vulnerabilities (#2746) 2021-06-23 13:55:14 -07:00
John Joyce
97e9660037
feat: No Code Metadata Modeling (#2629)
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-06-03 13:24:33 -07:00
Dexter Lee
259e6af494
feat(search-by-field): Add the ability to search for field names (#2286) 2021-03-23 15:18:32 -07:00
John Plaisted
5e91014e00
feat(search) BREAKING Support ElasticSearch 7, drop ES5 (#2263)
Merges in changes from our ES7 branch, and drops support for ES5.

This is a breaking change due to the upgrade, we have a ES5 branch at the commit before this.
2021-03-18 19:16:44 -07:00
John Joyce
08616cc610
feat(react): SSO support simple OIDC authentication (#2190)
Co-authored-by: John Joyce <john@acryl.io>
2021-03-11 13:38:35 -08:00
Rickard Cardell
3252e92a16
feat: neo4j Bolt TLS support (#2100) (#2145)
* Bumping version of neo4j-java-driver to include encryption support that came in 4.0.1.
2021-03-01 12:55:46 -08:00
John Plaisted
d3cad7bba8
Switch GMA dep from bintray to artifactory. (#2149)
* Switch GMA dep from bintray to artifactory.

Bintray is set for deprecation and EoL in a few months.

We still need bintray for pegasus until restli team migrates off of it.

* tick version to version that should only be on artifactory
2021-02-26 14:21:01 -08:00
Harshal Sheth
ad745a2a74
fix: Bump gma version (#2094) 2021-02-09 12:07:02 -08:00
John Joyce
cb2f3a466d
fix(integ test): Bumping GMA to 0.2.40 (#2085)
Co-authored-by: John Joyce <john@acryl.io>
2021-02-05 11:51:56 -08:00
John Plaisted
d89fa4a27b
feat: update GMA to 0.2.35 (#2067)
Required editing the search tests for the new search config tests.
2021-01-28 15:55:51 -08:00
John Joyce
50cec65f57
feat(GQL Queries): Productionalizing GraphQL Part 1: Dataset Query support + adding shared GraphQL module (#2066)
* Productionalizing GraphQL Part 1: Dataset Query support + introducing common datahub-graphql-core module.

Co-authored-by: John Joyce <jjoyce0510@gmail.com>
2021-01-22 15:44:00 -08:00
John Plaisted
838f964114
feat: add elasticsearch sanity integration tests (#2028)
These tests verify that, given an index settings and mappings, data can be written to the index, and read from it with a query_all query. These are very simple sanity tests.

We can, and should, write more complex tests that specific to each index in the future.
2020-12-02 20:49:34 -08:00
John Plaisted
6f59a91865
Bump GMA to latest (#2007)
Involves fixing some types. Bright side is that ModelValidation actually checks aspects now (it did not before because aspects are not records but unions!).
2020-11-18 14:07:48 -08:00
John Plaisted
60e43061d8
[Breaking] Update to GMA 0.2.0 and fix Urn definitions. (#1977)
Urn definitions needed to be updated since 0.2.0 changed the base Urn class. 

I also added some more urn coercers that were missing.
2020-11-11 16:06:29 -08:00
Jyoti Wadhwani
9fb3ee2247
add LocalDAOStorageConfigFactory for SCSI (#1985) 2020-11-05 11:37:45 -08:00
John Plaisted
b2e73fa003
test: improve test coverage for DatasetIndexBuilder. (#1971)
The other index builders need similar improvements, but after #1937 some coverage is better than no coverage.
2020-10-28 13:32:04 -07:00
Mars Lan
15c4d238f5
feat: enable default restli documentation (#1969)
Fixes https://github.com/linkedin/datahub/issues/1966
2020-10-28 08:05:00 -07:00
Kerem Sahin
1ec9f66b66
Bump to datahub-gma 0.1.0 (#1931) 2020-10-26 16:18:21 -07:00
John Plaisted
25b663cc18
refactor: move code to linkedin/datahub-gma. (#1955)
Move code to linkedin/datahub-gma.

"GMA" (Generalized Metadata Architecture) is the backend of DataHub, and has been moved to its own repository.

This deletes the code that was moved and uses jars that GMA publishes to bintray to load it.

Note that not all of GMA was moved, but most of it. We may still move more things to the other repository in the future.
2020-10-23 15:14:57 -07:00
Mars Lan
c413a2a69c
build: fix build by adding zookeeper dependency explicitly (#1900) 2020-09-29 06:00:47 -07:00
Lars Nielsen
b26d6fe880
Bump up kafkaAvroSerde to support SSL for Schema Registry (#1898)
* Bump up kafkaAvroSerde to support security config for Confluent Schema Registry

* Support certs in secrets

* Extra Spring config (e.g. security)

* Optional values

* Clarify log warnings

* Update faq.md

Co-authored-by: Lars Nielsen <Lars.Nielsen@kindredgroup.com>
Co-authored-by: Mars Lan <mars.th.lan@gmail.com>
2020-09-29 04:12:43 -07:00
John Plaisted
821bce7d69
feat: Port mce-cli to Java. (#1871)
Port mce-cli to Java.

Also moved off the avro format event file to json instead. Much nicer to use :)
2020-09-25 14:05:29 -07:00
John Plaisted
6ece2d6469
Start adding java ETL examples, starting with kafka etl. (#1805)
Start adding java ETL examples, starting with kafka etl.

We've had a few requests to start providing Java examples rather than Python due to type safety.

I've also started to add these to metadata-ingestion-examples to make it clearer these are *examples*. They can be used directly or as a basis for other things.

As we port to Java we'll move examples to contrib.
2020-09-11 13:04:21 -07:00
Grant Nicholas
f267c8c8e2
fix(gms): update kafka client libraries to a newer version to support schema registry basic auth + SSL (#1863) 2020-09-09 14:37:47 -07:00
Mars Lan
85a6cd1698
build(node): replace broken & unmaintained gradle node plugin (#1838) 2020-08-30 19:43:36 -07:00
John Plaisted
d9b86d1f05
Update metadata-models to head! (#1811)
metadata-models 80.0.0 -> 90.0.13:

   90.0.13: Roll forward: Fix the open source build by avoiding URN method that isn't part of the open source URN.
    90.0.2: Refactor listUrnsFromIndex method
    90.0.0: Start distinguishing between [] aspects vs null aspects input param
    89.0.4: Fix the open source build by avoiding URN method that isn't part of the open source URN.
    89.0.2: fix some test case name
    89.0.0: META-12686: Made the MXE_v5 topics become strictly ACL'ed to avoid the wildcard write ACL as "MetadataXEvent.+"
    88.0.6: change DAO to take Storage Config as input
    88.0.3: Add a comment on lack of avro generation for MXEv5 + add MXEv5 to the pegasus validation task.
   87.0.15: META-12651: Integrate the metadata-models-ext with metadata-models
   87.0.13: add StorageConfig to Local DAO
    87.0.3: Treat empty aspect vs optional aspect same until all clients are migrated
    87.0.2: Treat empty aspect vs optional aspect differently
    87.0.1: META-12533: Skip processing unregistered aspect specific MAE.
    83.0.6: action method to return list of urns from strong consistent index
    83.0.4: Change input param type for batch backfill
    83.0.3: Implement batch backfill
    83.0.1: Implement support for OR filter in browse query
   82.0.10: Throw UnsupportedOperationException for unsupported condition types in search filter
    82.0.6: Implement local secondary backfilling index as part of backfill method
    82.0.5: [strongly consistent index] implement getUrns method
    82.0.4: Add indexing urn fields to the local secondary index
    82.0.0: Render Delta fiels in the MCE_v5.
    81.0.1: Add pegasus to avro conversion for FMCE
    80.0.4: add get all support for BaseSingleAspectEntitySimpleKeyResource
    80.0.2: Add a BaseSearchWriterDAO with an ESBulkWriterDAO implementation.
    80.0.1: META-12254: Produce aspect specific MAE with always emit option
    80.0.0: Convert getNodesInTraversedPath to getSubgraph to return complete view of the subgraph (nodes+edges)
2020-08-19 16:06:29 -07:00
Javier Sotelo
ad4c80a753
feat(kafka-config): Add ability to configure other Kafka props (#1745)
* Integarte spring-kafka & spring-boot for security props

- Upgrade spring-kafka to 2.1.14
- Use KafkaListener and KafkaTemplates to enable KafkaAutoConfiguration
- Integrates spring-boot's KafkaProperties into spring-kafka's config

* Cleanup imports

* Add DataHub kafka env vars

* Remove kafka-streams dependency

* Add KafkaProperties to gms; Add docs

* Add to Adoption

* Remove KAFKA_BOOTSTRAP_SERVER default

Co-authored-by: jsotelo <javier.sotelo@viasat.com>
Co-authored-by: Kerem Sahin <ksahin@linkedin.com>
2020-07-27 05:22:51 -07:00
Mars Lan
dec2ce9152
build(frontend): Drop unnecessary DB-related dependencies (#1741)
* refactor(frontend): Drop unnecessary DB-related dependencies

* Drop unused dependencies from top-level build script
2020-07-22 19:43:30 -07:00
Mars Lan
00d89115b2
feat(gms): add postgres & mariadb supports to GMS (#1742)
* feat(gms): add postgres & mariadb supports to GMS

Also add corresponding docker-compose files

* Update README.md
2020-07-22 19:39:58 -07:00
Jyoti Wadhwani
ad6f1653e1
metadata-models 62.0.3 -> 72.0.8 (#1693) 2020-06-11 10:21:51 -07:00
Jyoti Wadhwani
90e449d25e
upgrade the version of neo4j (#1691) 2020-06-04 22:14:57 -07:00
Jyoti Wadhwani
8a8dd3de77
move the gradle plugin version to top level build.gradle (#1685) 2020-06-01 19:00:19 -07:00
Jyoti Wadhwani
2d06e1de5e
Revert "upgrade version of certain dependencies (#1675)" (#1676)
This reverts commit 81508bcb835df4c46b399fd3029b13259930bcd6.
2020-05-19 18:31:41 -07:00
Jyoti Wadhwani
81508bcb83
upgrade version of certain dependencies (#1675) 2020-05-19 09:12:21 -07:00
Mars Lan
f932437742
build: start enforcing checkstyle and fix all violations (#1670) 2020-05-11 08:41:02 -07:00
Mars Lan
246b736ef5
build: update pegasus to v28 to add PDL support (#1664)
See https://github.com/linkedin/rest.li/issues/283 for new repo location
2020-05-08 12:00:28 -07:00
Mars Lan
d6fcefb75b
style: add checkstyle and IDEA code style config (#1662)
Currently allow checkstyle warnings until all violations have been fixed
2020-05-07 15:59:16 -07:00
Arun Vasudevan
e56e4eafd2
feat: Converting MCE to a Spring boot Application (#1629) 2020-04-06 10:16:41 -07:00
Kerem Sahin
1168501083 Enable tests for all modules by using global gradle config 2020-02-21 11:53:45 -08:00
Kerem Sahin
b17b91f24a Bump gradle to 5.6.4 and pegasus to 27.7.18 2020-02-12 17:10:49 -08:00