78 Commits

Author SHA1 Message Date
RyanHolstien
34c27f076b
feat(removeGMA): remove all dependencies on gma libraries (#3835) 2022-01-05 17:32:31 -08:00
xiphl
8cd1e91072
Upgrade to 3rd Apache patch for log4j (#3772) 2021-12-20 06:55:22 -08:00
John Joyce
5b5135be0b
fix(vuln): log4j vulnerability - bumping to 2.16.0 (#3755) 2021-12-15 11:07:45 -08:00
Fredrik Sannholm
d651040c85
Fix vulnderability (#3716) 2021-12-10 10:07:55 -08:00
Claudio Benfatto
f9bc3b32c4
fix(metadata-service): fix debug logging in MAE producer (#3626)
closes: https://github.com/linkedin/datahub/issues/3625
2021-11-28 21:07:42 -08:00
John Joyce
a92ab66a3a
refactor(nocode): Final part of No-Code cleanup (#3477) 2021-10-31 22:06:36 -07:00
Dexter Lee
8747fbe43c
feat(perf): Add perf testing and monitoring framework (#3195) 2021-09-07 23:06:15 -07:00
John Joyce
f3fc0970f3
refactor(build): Remove unnecessary ext modules. (#3074) 2021-08-10 22:48:06 -07:00
John Joyce
20b1685de2
fix(gms): better logging on failed MCL / MAE (#3007) 2021-08-02 17:53:56 -07:00
John Joyce
352a0abf8d
Introducing TimeSeries Aspects + Dataset Profile (Stats) Aspect (#2983)
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Ravindra Lanka <rlanka@acryl.io>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-07-30 17:41:03 -07:00
Gabe Lyons
aa253f5b3b
feat(deletes): add run commands (list, show, rollback) to datahub ingest (#2960) 2021-07-29 20:04:40 -07:00
John Joyce
09cbc548a4
feat(logs): improve logging in GMS and datahub-frontend (#2761) 2021-06-25 10:56:45 -07:00
John Joyce
97e9660037
feat: No Code Metadata Modeling (#2629)
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-06-03 13:24:33 -07:00
shakti-garg
8ed14a62e2
feat(business_glossary): add new entity business term and its relationship with dataset and its fields (#2228)
Co-authored-by: shubham.garg <shubham.garg@thoughtworks.com>
2021-05-10 13:20:23 -07:00
Dexter Lee
fa015c5aaa
fix(kafka-topic-convention): Fix DAOs that do not refer to TopicConvention (#2387) 2021-04-13 07:58:31 -07:00
Fredrik Sannholm
e2d6adc906
fix(datajob): Fix URI templates for datajob and dataflow (#2324) 2021-03-31 12:27:43 -07:00
Fredrik Sannholm
b02c7a345c
fix(tags): Support creating tags with MCE (#2320) 2021-03-31 11:16:12 -07:00
Gabe Lyons
039fe597f7
feat(tags): editing tags from react client on datasets, schemas, charts & dashboards (#2248) 2021-03-18 11:52:14 -07:00
Fredrik Sannholm
da6b3d111d
feat(datajob): Backend implementation (#2197) 2021-03-13 08:00:44 -08:00
RyanHolstien
ea86ade29b
feat: ML Model Backend Implementation (#1896)
Co-authored-by: RyanHolstien <rholstien@expediagroup.com>
2021-02-17 13:28:13 -08:00
Nagarjuna Kanamarlapudi
f9d33f5519
(refactor): Convert dataPlatforms to GMA aspect models and associated resource to GMA resource. (#2057)
* (refactor): Convert dataPlatforms to GMA aspect and associated resource to GMA resource.

BREAKING CHANGE: /datasets/dataPlatforms API is now changed to become GMA resource.

* Change documentation style
2021-01-20 15:50:48 -08:00
Kerem Sahin
4d8320e4a0
feat(dashboard): Dashboards backend implementation (#1884) 2020-11-23 09:25:58 -08:00
John Plaisted
25b663cc18
refactor: move code to linkedin/datahub-gma. (#1955)
Move code to linkedin/datahub-gma.

"GMA" (Generalized Metadata Architecture) is the backend of DataHub, and has been moved to its own repository.

This deletes the code that was moved and uses jars that GMA publishes to bintray to load it.

Note that not all of GMA was moved, but most of it. We may still move more things to the other repository in the future.
2020-10-23 15:14:57 -07:00
John Plaisted
2f86cd680e
[BREAKING] Break dependency of ebean-dao on metadata-models. (#1895)
The coupling was between the static path extractor API. This broken by making a new `UrnPathExtractor` interface, and adding an overload of `EbeanLocalDAO`'s constructor to accept one (no breaking constructor change). The old constructors default to an `EmptyPathExtractor`, which does nothing (which is a breaking behavioral change, see below).

BREAKING: `DatasetUrnPathExtractor` was deleted. No one should've been depending on this directly. However, downstreams that were relying on it being there at runtime (dataset GMS) need to copy `DatasetUrnPathExtractor` and create their `EbeanLocalDAO` with one. Note that this is a little dangerous becasue it is a runtime behavioral change only. Potential impact is that SCSI suddenly stops working as intended.

SYNC=metadata-models_101.0.0
2020-09-28 12:30:34 -07:00
Jyoti Wadhwani
dc9c877984 [scsi] preserve the order of urns 2020-09-24 16:02:12 -07:00
John Plaisted
542ae67cb1 Add support for customizing topic names via a convention.
Requested by a few people in OS. See https://github.com/linkedin/datahub/issues/1840.

Companies need full customization over the topic name. This new class should be easily customizable by using a spring factory.

TODO to finish the implmentation for v5. For right now v5 is LI only and unfinished. Getting this in for v4 so it is useful to other companies now.

TODO AFTER OPEN SOURCE PUSH - make configurable via spring
TODO AFTER SUBMIT - see where else we can use this (jobs, where else?)
2020-09-24 16:02:12 -07:00
Jyoti Wadhwani
b944910b0e extend filter finder method to get metadata from SCSI 2020-09-24 16:02:12 -07:00
Jyoti Wadhwani
c3c52cf8f6 add support for getWithExtraInfo in BaseLocalDAO 2020-09-24 16:02:12 -07:00
John Plaisted
8d536a54d3 Break dependency of metadata-dao on metadata-models.
This also breaks a few others transitively.
2020-09-24 16:02:12 -07:00
John Plaisted
e4ce0376d2 Fix open source build.
Do not use internal URN API.
2020-09-24 16:02:12 -07:00
John Plaisted
96da83033c Break dependency of metadata-test-utils on metadata-models. 2020-09-24 16:02:12 -07:00
Jyoti Wadhwani
caf22dc921 add support for conditions other than equals 2020-09-24 16:02:12 -07:00
John Plaisted
8223cdcbdb Fix build after merge:
- Add commonsLang to build file.
- Add emails field to CorpUserInfoDocument (either this should be synced or the index builder not synced in the future).
- Fix EbeanLocalDAOTest which used internal Urn API.
- Fix BaseSearchableEntityResource "backfill" override return types (and regenerate snapshots).
- EbeanlocalDAO's constructor changed; now requires URN class.
- Add restli resource module as dependency of :gms:api as it now contains a needed PDL model.
2020-09-11 09:15:56 -07:00
Kerem Sahin
411e0a8fee Implementing a truly free-form query interface on Neo4j query DAO 2020-09-11 09:15:56 -07:00
Kerem Sahin
46d8f4f05d Update backfill API to allow for backfill modes 2020-09-11 09:15:56 -07:00
John Plaisted
6b9a053f6e ROLL FORWARD: Add new style checks and fix issues.
- Upgrade to checkstyle 8
- Copy javadoc checks from Google
- Disable missing class and method checks for now, too many warnings. I'll have to figure out how to suppress them instead.
- Fix other issues, which are mostly missing periods at the end of sentences and lack of paragraph tags.

Revert "Reverting the commit range: 8dfdb73ac6c73581ef56c0d81c21a2a92e8a1a02..194bd6f57f4a4d075d2ea1f442397d1139080f7a."

This reverts commit ab178ec1469fa72c0c339f0b842e7ff0850e7c74.
2020-09-11 09:15:56 -07:00
Jyoti Wadhwani
f277a49dd0 [scsi] add resilience to EbeanLocalDAO 2020-09-11 09:15:56 -07:00
Jyoti Wadhwani
a3a892f5ec change listUrns to return typed urns 2020-09-11 09:15:56 -07:00
Chris Lee
679069e16f Made the log to WARN for the v5 producer early termination. 2020-09-11 09:15:56 -07:00
John Plaisted
6ac7622af6 Reverting the commit range: 8dfdb73ac6c73581ef56c0d81c21a2a92e8a1a02..194bd6f57f4a4d075d2ea1f442397d1139080f7a.
REVERTED RB=99999 PCVALIDATIONOVERRIDE I18NOVERRIDE CIOVERRIDE TRUNKBLOCKERFIX

See https://crt.prod.linkedin.com/#/testing/executions/77e10182-d60f-4c8d-9e55-599bdc4384e0/execution for more details.
2020-09-11 09:15:56 -07:00
John Plaisted
b9f11ae21b Add new style checks and fix issues.
- Upgrade to checkstyle 8
- Copy javadoc checks from Google
- Disable missing class and method checks for now, too many warnings. I'll have to figure out how to suppress them instead.
- Fix other issues, which are mostly missing periods at the end of sentences and lack of paragraph tags.
2020-09-11 09:15:56 -07:00
John Plaisted
08f2ce61d9 Free form API for getPaths
Context:
We will need to perform customized query to get info from Neo4j such as get paths with only unremoved dataset, get paths for field-path lineage.

What are changed:
- Adding a new API in Neo4jQueryDAO so that we could perform a customized query from API user side(GMS)
2020-09-11 09:15:56 -07:00
Jyoti Wadhwani
02ffe16624 [scsi] support for multiple filters 2020-09-11 09:15:56 -07:00
Jyoti Wadhwani
8813ed9c86 save aspect fields to index table 2020-09-11 09:15:56 -07:00
Kerem Sahin
64e5160365 Implement backfill API which uses SCSI 2020-09-11 09:15:56 -07:00
John Plaisted
55344330a1 [metadata-models] support range filter for ESSearchDAO 2020-09-11 09:15:56 -07:00
Mars Lan
7d6fde4f37
feat: add MCE ingestion support for CorpGroup (#1837)
* feat: add MCE ingestion support for CorpGroup

Also use consistent camel case for corp user URNs in bootstrap MCE data

Fixes https://github.com/linkedin/datahub/issues/1822
2020-08-31 10:08:58 -07:00
John Plaisted
d9b86d1f05
Update metadata-models to head! (#1811)
metadata-models 80.0.0 -> 90.0.13:

   90.0.13: Roll forward: Fix the open source build by avoiding URN method that isn't part of the open source URN.
    90.0.2: Refactor listUrnsFromIndex method
    90.0.0: Start distinguishing between [] aspects vs null aspects input param
    89.0.4: Fix the open source build by avoiding URN method that isn't part of the open source URN.
    89.0.2: fix some test case name
    89.0.0: META-12686: Made the MXE_v5 topics become strictly ACL'ed to avoid the wildcard write ACL as "MetadataXEvent.+"
    88.0.6: change DAO to take Storage Config as input
    88.0.3: Add a comment on lack of avro generation for MXEv5 + add MXEv5 to the pegasus validation task.
   87.0.15: META-12651: Integrate the metadata-models-ext with metadata-models
   87.0.13: add StorageConfig to Local DAO
    87.0.3: Treat empty aspect vs optional aspect same until all clients are migrated
    87.0.2: Treat empty aspect vs optional aspect differently
    87.0.1: META-12533: Skip processing unregistered aspect specific MAE.
    83.0.6: action method to return list of urns from strong consistent index
    83.0.4: Change input param type for batch backfill
    83.0.3: Implement batch backfill
    83.0.1: Implement support for OR filter in browse query
   82.0.10: Throw UnsupportedOperationException for unsupported condition types in search filter
    82.0.6: Implement local secondary backfilling index as part of backfill method
    82.0.5: [strongly consistent index] implement getUrns method
    82.0.4: Add indexing urn fields to the local secondary index
    82.0.0: Render Delta fiels in the MCE_v5.
    81.0.1: Add pegasus to avro conversion for FMCE
    80.0.4: add get all support for BaseSingleAspectEntitySimpleKeyResource
    80.0.2: Add a BaseSearchWriterDAO with an ESBulkWriterDAO implementation.
    80.0.1: META-12254: Produce aspect specific MAE with always emit option
    80.0.0: Convert getNodesInTraversedPath to getSubgraph to return complete view of the subgraph (nodes+edges)
2020-08-19 16:06:29 -07:00
Jyoti Wadhwani
779eaeed70
metadata-models 72.0.8 -> 80.0.0 (#1756) 2020-07-29 11:42:35 -07:00
Kerem Sahin
2e2fb2b810
Add missing updates from recent internal push (#1700) 2020-06-12 12:55:50 -07:00