266 Commits

Author SHA1 Message Date
Dexter Lee
669160a677
feat(run): Create a describe run endpoint for fetching aspects created by the ingestion run (#4964) 2022-05-24 14:46:36 -07:00
Justin Marozas
97bce80fc9
refactor(metadata-io): introduce a storage-independent in-memory entity aspect model (#4957) 2022-05-24 12:40:42 -07:00
RyanHolstien
c3cfaf8e3c
feat(telemetry): add server side telemetry (#4925)
Co-authored-by: Kevin Hu <kevinhuwest@gmail.com>
2022-05-17 11:37:45 -07:00
Pedro Silva
78c3ca039e
feat(gms): Add support for deleting reference pointers when deleting by urn (#4791) 2022-05-12 19:02:13 -07:00
RyanHolstien
65fb49f4b4
feat(versionedDataset): adds a versionStamp to timeline response & adds versionStamp param to dataset graphql (#4727) 2022-04-29 12:04:38 -07:00
Eddie (Xiao Di) Li
01a5b13a7a
feat(Cassandra): Add Cassandra implementation of EntityService (#3286) 2022-04-25 17:29:24 -07:00
Dexter Lee
46c15ef0a7
fix(search): Fix urn component settings (#4672)
* Fix urn component settings

* Add keyword subfield to all fields

* Fix tests
2022-04-14 15:40:03 -07:00
Dexter Lee
55f0412a33
feat(search): Add SearchScore annotation to use fields for search ranking (#4596)
* Add SearchScore annotation

* Add back test-model

* Remove search features

* Fix to John's comments

* simplify ranker

* Fix checkstyle
2022-04-07 11:07:27 -07:00
Dexter Lee
32324a6476
fix(search): Make sure home page and search pages are consistent (#4588)
* Centralize entity count cache

* Fix to John's comment

* Fix test

* Fix tests
2022-04-06 22:06:55 -07:00
John Joyce
c69310522b
feat(metadata service): Introducing Platform Events (#4477) 2022-03-29 18:32:04 -07:00
RyanHolstien
b843014c5e
Oss/urn validation (#4511) 2022-03-29 16:44:58 -07:00
Kevin Neville
d8e6f890a9
fix: Replace old repository link with new link (#4446) 2022-03-18 14:12:19 -07:00
Pedro Silva
aa593c32d8
Flexible search on soft delete (#4405)
* Adds filter logic to correct DB

* Fix build

* Adds documentation & fixes flag typo

* apply review comments

* Adds test for filtered search

* Adds warning log for redundant parameter combo
2022-03-16 16:35:04 -07:00
Pedro Silva
e8f6c4cabd
feat(cli) Changes rollback behaviour to apply soft deletes by default (#4358)
* Changes rollback behaviour to apply soft deletes by default

Summary:
Addresses feature request: Flag in delete command to only delete aspects touched by an ingestion run; add flag to nuke everything by modifying the default behaviour of a rollback operation which will not by default delete an entity if a keyAspect is being rolled-back.

Instead the key aspect is kept and a StatusAspect is upserted with removed=true, effectively making a soft delete.
Another PR will follow to perform garbage collection on these soft deleted entities.

To keep old behaviour, a new parameter to the cli ingest rollback endpoint: --hard-delete was added.

* Adds restli specs

* Fixes deleteAspect endpoint & adds support for nested transactions

* Enable regression test & fix docker-compose for local development

* Add generated quickstart

* Fix quickstart generation script

* Adds missing var env to docker-compose-without-neo4j

* Sets status removed=true when ingesting resources

* Adds soft deletes for ElasticSearch + soft delete flags across ingestion sub-commands

* Makes elastic search consistent

* Update tests with new behaviour

* apply review comments

* apply review comment

* Forces Elastic search to add documents with status removed false when ingesting

* Reset gradle properties to default

* Fix tests
2022-03-15 12:05:52 -07:00
Dexter Lee
ab0b5162af
fix(lineage): Move lineage registry to entity-registry module (#4412)
* Move lineage registry

* Fix test
2022-03-15 11:59:59 -07:00
Dexter Lee
a12c9d2a30
Add flag to enable cache (#4335) 2022-03-14 16:40:10 -07:00
Dexter Lee
424d357cfd
Fix lineage (#4326) 2022-03-07 09:36:49 -08:00
Dexter Lee
18dd5b6c13
feat(Impact Analysis): Support impact analysis to check all downstreams of given entity (#4322) 2022-03-04 16:10:25 -08:00
Dexter Lee
daab31d6ef
refactor(extractor): Move extractors to entity-registry (#4307)
* Move extractors to entity-registry

* Fix test

* Fix checkstyle
2022-03-03 16:57:32 -08:00
Dexter Lee
5d26c86a30
fix(search): Update urn tokenizer to tokenize on periods and slashes (#4085)
* Add period

* Update urn tokenizer

* Add tests
2022-03-03 10:37:10 -08:00
Ravindra Lanka
1ac20ee764
feat(platform): add timeline api for dataset entity (#4252) 2022-02-24 19:40:55 -08:00
HuYueeer
3a0fe44344
feat(search): supporting chinese glossaryterm full text retrieval(#3914) (#3956)
* feat(search): supporting chinese glossaryterm full text retrieval(#3914)

* refactor(search): modify mainTokenizer to appropriate position(#3914)

Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2022-02-24 17:11:43 -08:00
stephenp-gr
fb80e6b7bc
Common ElasticsearchContainer creation (#4152) 2022-02-16 16:10:10 -08:00
ksrinath
4c24f386a6
feat(model): data quality model (#3787)
Co-authored-by: Ravindra Lanka <rlanka@acryl.io>
Co-authored-by: Mayuri N <mayuri.nehate@gslab.com>
2022-02-02 22:56:08 -08:00
senni0418
80c46f24ac
refactor(search): refactor NUM_RETRIES in esindexbuilder to be configurable (#3870) 2022-01-17 15:35:59 -08:00
Dexter Lee
b58609269d
fix(search): Fix issue with filters and autocomplete (#3868) 2022-01-12 13:01:36 -08:00
Aseem Bansal
a56f0661d5
perf(ingest): changes to improve ingest performance a bit (#3837) 2022-01-06 08:07:16 -08:00
RyanHolstien
9a237e9c92
fix(test): fix bug on verification for EbeanEntityService (#3829)
Co-authored-by: Ryan Holstien <ryan@acryl.io>
2022-01-05 14:05:54 -08:00
Dexter Lee
9d04aa1255
feat(mae/mcl): Make ingestAspect produce both MCLs and MAEs (#3737) 2022-01-04 18:24:30 -08:00
Dexter Lee
76f9423581
feat(nocode): Add get entities v2 endpoint that can get without snapshot (#3738) 2022-01-03 07:19:46 -08:00
Ravindra Lanka
4b81ad6262
feat(Stateful Ingestion): 1/3 Stateful ingestion server changes (#3749) 2021-12-15 10:32:58 -08:00
Gabe Lyons
3fd3313544
Revert "feat(graph): Make Dgraph a proper Neo4j alternative (#3578)" (#3740) 2021-12-14 10:49:03 -08:00
Dexter Lee
8e4769f40c
feat(retention): Add retention to Local DB (#3715) 2021-12-13 18:18:02 -08:00
Enrico Minack
a6deaabfcf
feat(graph): Make Dgraph a proper Neo4j alternative (#3578) 2021-12-13 12:37:59 -08:00
Dexter Lee
1f1467aac2
fix(home): Change docs count to not count removed datasets (#3711) 2021-12-09 22:51:06 -08:00
Dexter Lee
999e9e5b49
fix(search): Improve search and recs performance (#3660) 2021-12-03 18:13:11 -08:00
Pedro Silva
d5eddf52ab
fix(test): Adds docker engine configuration checks before running docker-based tests for metadata-io (#3654) 2021-12-02 12:42:43 -08:00
Shirshanka Das
842c05980b
feat(metadata-service): support for custom model extensions without forks (#3630) 2021-11-28 21:06:27 -08:00
Enrico Minack
031e0b9b91
feat(GraphService): Add Dgraph implementation of GraphService (#3261) 2021-11-15 10:33:30 -08:00
John Joyce
2f03ad854b
feat(recommendations): Recommendations infra P1 (#3455) 2021-10-26 21:23:08 -07:00
Dexter Lee
9b54242a6e
fix(datahub-upgrade): Update datahub upgrade to use MCL instead of MAE (#3411) 2021-10-18 22:52:48 -07:00
John Joyce
3d0c69b7d1
fix(metadata-service): add Rest.li Validation to ingestProposal (#3409) 2021-10-17 08:49:18 -07:00
John Joyce
61bd7fd080
feat(search): support Boolean OR Filters in Rest.li APIs (#3344) 2021-10-14 12:34:29 -07:00
Gabe Lyons
aeba0e9afe
Revert "revert "test(ElasticSearch): Retry ES requests" (#3385)" (#3392)
This reverts commit b16b090f2bf78e0a751490310b95db3dfc142bbf.
2021-10-13 18:53:47 -07:00
Gabe Lyons
b16b090f2b
revert "test(ElasticSearch): Retry ES requests" (#3385) 2021-10-13 14:37:51 -07:00
Enrico Minack
3753ff2298
test(elasticsearch): retry elasticsearch requests (#3377) 2021-10-12 15:41:34 -07:00
Dexter Lee
d0548408b4
feat(search): redesign search page - left filter pane (#3337) 2021-10-07 11:41:29 -07:00
Enrico Minack
c0aa3ecb4b
test(GraphService): Thorough graph service tests (#3011) 2021-09-17 13:54:24 -07:00
rslanka
c418bc845c
feat(Analytics): Support for Timeseries Aggregated Statistics (#3207)
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
Co-authored-by: Dexter Lee <dexter@acryl.io>
2021-09-14 18:35:10 -07:00
Enrico Minack
66c862fea9
test(metadata-io): Improve speed of ElasticSearch tests (#3160) 2021-09-13 14:37:29 -07:00