46 Commits

Author SHA1 Message Date
Harshal Sheth
5bc8a895f9
chore(ingest): remove calls to deprecated methods (#13009) 2025-03-28 13:42:54 -07:00
cccs-cat001
cb67726ab4
fix(ingest): handle groups in pattern_cleanup_ownership transformer (#12536) 2025-02-24 14:47:46 -08:00
Harshal Sheth
ce6474df5a
chore(ingest): remove deprecated calls to Urn.create_from_string (#11983) 2024-12-02 09:53:13 -08:00
Aseem Bansal
02f0a3dee7
feat(ingest/transform): extend ownership transformer to other entities (#11700) 2024-10-29 15:28:41 +05:30
Andrew Sikowitz
72d1236669
feat(ingest/transformer/domain): Add support for on conflict do nothing to dataset domain transformers (#11649) 2024-10-18 12:01:39 -07:00
Harshal Sheth
d34717fd82
fix(ingest): remove default value from DatahubClientConfig.server (#11570) 2024-10-16 13:50:33 -07:00
sagar-salvi-apptware
660fbf8e57
fix(ingestion/transformer): Add container support for ownership and domains (#11375) 2024-10-01 11:39:07 -07:00
Harshal Sheth
3755731f0e
chore(ingest): improve code formatting (#11326) 2024-09-11 10:48:57 -07:00
sagar-salvi-apptware
da72ba2113
fix(ingestion/transformer): replace the externalUrl container (#11013) 2024-07-30 15:17:04 +05:30
Shubham Jagtap
b6c7fe8267
refactor(ingestion): remove company domain for security reason (#10839) 2024-07-08 21:15:20 -07:00
sagar-salvi-apptware
640d42dc65
feat(ingest/transformer): tags to terms transformer (#10758)
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2024-07-02 15:30:05 +05:30
sagar-salvi-apptware
5fbf781558
fix(ingest/transformer): Add dataset domains based on tags using transformer (#10458) 2024-05-15 14:13:03 +05:30
dushayntAW
c00ddb2a0d
fix(ingest/transformer): new transformer to clean user URN for datasetUsageStatistics aspect (#10398) 2024-05-03 13:24:48 +05:30
dushayntAW
f860f7907d
fix(ingest/transformer): replace externalUrl in dataset properties (#10281) 2024-04-15 20:14:42 +05:30
Shubham Jagtap
fa139a582e
feat(ingestion/transformer): Handle overlapping while mapping in extract ownership from tags transformer (#10201) 2024-04-04 12:19:11 -07:00
Shubham Jagtap
9f2c5d36f3
feat(ingestion/bigquery): BigQuery Owner Label to Datahub Ownership (#10047) 2024-03-28 15:50:25 -07:00
dushayntAW
dd502ae662
fix(ingest): added new transformer to cleanup suffix/prefix in owner URN (#10067) 2024-03-22 15:23:03 +05:30
sid-acryl
874c683f2a
fix(ingest/transformer): generate ownership aspect from handle_end_of_stream (#9720) 2024-01-31 14:25:08 -08:00
Shubham Jagtap
cb8002413e
feat(ingestion/transformer): Add dataset dataproduct transformer (#9491)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-01-05 17:03:22 -05:00
siddiquebagwan-gslab
0d6a5e5df2
feat(ingestion/transformer): create tag if not exist (#9076) 2023-12-14 10:36:28 -05:00
skrydal
a495d652e0
feat(ingestion): Add typeUrn handling to ownership transformers (#9370) 2023-12-13 14:34:20 -05:00
Aseem Bansal
ad313ad282
feat(transfomer): add transformer to get ownership from tags (#8748) 2023-10-04 14:06:03 +05:30
Aseem Bansal
9df70d7355
ingest(elasticsearch): add basic profiling (#8351) 2023-07-20 08:25:30 +05:30
Harshal Sheth
e99875cac6
chore(ingest): enable flake8 bugbear linting (#7763) 2023-04-10 14:14:42 -07:00
Harshal Sheth
89734587f7
feat(ingest): add urn modification helper (#7440)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-03-16 13:27:08 -07:00
Andrew Sikowitz
8901498582
fix(transformers): pattern add domain transformer - enable replace_existing (#7317) 2023-02-13 12:52:44 -08:00
Harshal Sheth
f651646d3d
chore(ingest): remove inferred args to MCPW, part 2 (#6905) 2023-01-04 23:29:56 -05:00
Tamas Nemeth
e41b455e14
fix(ingest): bigquery - sharded table support improvements (#6789) 2022-12-19 18:57:37 +01:00
Harshal Sheth
74cc88f2df
fix(ingest): correctly handle transformer patch semantics (#6505) 2022-11-22 09:29:57 -08:00
Harshal Sheth
1a81d8de6a
feat(ingest): supports MCEs in domain transformer (#6364) 2022-11-05 11:41:43 -07:00
mohdsiddique
2f65e2f226
feat(transformers): Add semantics & transform_aspect support in transformers (#5514)
Co-authored-by: MohdSiddique Bagwan <mohdsiddique.bagwan@gslab.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2022-09-06 14:44:14 -07:00
Harshal Sheth
9790f3cefa
feat(ingest): infer aspectName from aspect type in MCP (#5566) 2022-08-07 07:52:58 -07:00
Harshal Sheth
480fb2e770
test(ingestion): change class names to avoid unittest warnings (#5005) 2022-05-26 22:09:40 +05:30
Gabe Lyons
a8079ca163
feat(transformers): add transformers to provide tags & terms to schema fields based on regex patterns (#4936)
* add tag & term transformers for schemas

* added documentation

* lint fixes

* add clarification that only first set of matching terms is applied
2022-05-18 16:03:34 -07:00
Swaroop Jagadish
35b187a8d4
feat(ingest): transformers - add support for processing MCP-s (#4337) 2022-03-07 13:14:29 -08:00
Sergio Gómez Villamor
3fa22e1138
feat(transform): adds simple add dataset properties transform (#3778) 2021-12-22 08:51:38 -08:00
ecooklin
1a5121a5ae
feat(ingest): adds glossary terms transformer (#3657) 2021-12-07 21:54:15 -08:00
Noé López
cde1ce043d
feat(ingest): add transformer to add properties (#3480) 2021-11-09 20:03:21 -08:00
Swaroop Jagadish
ec406c7928
feat(ingest): read-modify-write capabilities during ingestion, adding ownership patch transformer (#3506) 2021-11-03 21:39:52 -07:00
Sergio Gómez Villamor
696ad2ccc4
feat(ingest): adds replace_existing config to browse path transformer (#3313) 2021-10-01 12:35:16 -07:00
Sergio Gómez Villamor
9e28423c6d
feat(ingest): add optional config for ownership type in ownership transformers (#3159)
Co-authored-by: Sergio Gómez <sergio.gomez.villamor@adevinta.com>
2021-09-01 22:44:03 -07:00
Rahul Jain
0e0198f385
feat(ingest): Added support for "add dataset ownership by regex match" (#2990) 2021-08-10 11:03:16 -07:00
aseembansal-gogo
9ee091f518
feat(ingest): add transformers to clear dataset ownership, mark status, add browse paths (#2986) 2021-08-09 09:43:46 -07:00
Harshal Sheth
220dfe728c
feat(ingest): support dynamic imports for transfomer methods (#2858) 2021-07-12 11:03:53 -07:00
Harshal Sheth
2f921d15e8
fix(ingest): avoid setting timestamps unless source system provides it (#2843) 2021-07-08 12:11:06 -07:00
Harshal Sheth
1d4bcbe4fb
feat(ingest): add dataset tag transformer (#2580) 2021-05-18 14:43:43 -07:00