1646 Commits

Author SHA1 Message Date
Atanu Chatterjee
487d52ad6e
fix(openapi_parser): add ability to parse openapi 3.0+ schemas (#13624) 2025-06-01 18:52:52 +02:00
Hyejin Yoon
a142a9e2d2
feat(sdk): add dataflow and datajob entity (#13551) 2025-05-29 22:53:56 +09:00
david-leifker
492b55322f
feat(tracing): trace error log with timestamp & update system-metadata (#13628) 2025-05-27 17:33:40 -05:00
Andrew R Smith
647fb792de
feat(ingest): add snowflake ingestion config options (#12841)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-05-27 09:22:16 -07:00
Sergio Gómez Villamor
d8d1de431f
fix(iceberg): update MinIO client commands for compatibility (#13631)
Co-authored-by: Claude <noreply@anthropic.com>
2025-05-27 11:15:41 +02:00
David Leifker
b956132e3c Revert "feat(tracing): python logging & update system metadata on no-op"
This reverts commit 64f315eb64417467cfef5cec473068dc42886293.
2025-05-26 12:15:18 -05:00
David Leifker
64f315eb64 feat(tracing): python logging & update system metadata on no-op 2025-05-26 10:22:51 -05:00
Tamas Nemeth
2ffa84be5c
fix(ingest/datahub): Create Structured property templates in advance and batch processing (#13355)
Co-authored-by: Pedro Silva <pedro@acryl.io>
2025-05-26 14:05:17 +02:00
Sergio Gómez Villamor
7bd7a33d0b
feat(hex): consider additional context when parsing hex query metadata (#13596) 2025-05-23 08:01:04 +02:00
Harshal Sheth
7e60587dec
fix(cli): strictly validate structured property values (#13576)
Co-authored-by: Chakravarthy Racharla <chakru.racharla@acryl.io>
2025-05-21 08:36:50 -07:00
Tamas Nemeth
9fca1737ff
fix(ingest/dbt): Fix urn validation in ownership type check (#13563) 2025-05-21 13:02:26 +02:00
Tamas Nemeth
0eca4dfde2
fix(ingest/hive): Fix hive storage path formats (#13536) 2025-05-19 16:17:28 +02:00
Sergio Gómez Villamor
8cae980286
tests(ingestion): moving some tests so they are available for sdk users (#13540) 2025-05-19 08:39:53 +02:00
Jonny Dixon
132ff7081f
feat(ingestion/s3): Add externalUrls for datasets in s3 and gcs (#12763) 2025-05-17 17:03:40 +01:00
Harshal Sheth
d3944ded93
feat(ingest/snowflake): generate lineage through temp views (#13517) 2025-05-16 21:27:13 -07:00
Harshal Sheth
9a892c6eca
feat(ingest): improve join extraction (#13502) 2025-05-14 09:50:17 -07:00
Jonny Dixon
c756af31b1
feat(ingestion/looker): extract group_labels from looker and add as tags in datahub (#13503) 2025-05-14 13:08:13 +01:00
Michael Minichino
bc860181d8
fix(ingest/mode): Additional pagination and timing metrics (#13497)
Co-authored-by: NehaGslab <neha.marne@gslab.com>
2025-05-13 08:48:38 -05:00
Sergio Gómez Villamor
184fb09fc0
fix(mssql): improve stored proc lineage + add temporary_tables_pattern config (#13415)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-05-13 10:36:52 +02:00
Harshal Sheth
7c791db087
feat(ingest/sql): column logic + join extraction (#13426) 2025-05-12 17:19:44 -07:00
Gabe Lyons
aeda8f4c95
feat(cassandra): Support ssl auth with cassandra (#13465) 2025-05-12 13:15:49 -04:00
Sergio Gómez Villamor
fb7bcbaf17
tests(ingestion): fixes hex and hive docker flakiness (#13476) 2025-05-11 19:06:23 +02:00
Anthony Burdi
f71ff7722a
fix(sdk): use pluralized assertions (#13481) 2025-05-09 16:19:50 -04:00
Tamas Nemeth
60f79153ac
fix(ingest/hive): Fix hive properties with double colon (#13478) 2025-05-09 19:59:40 +02:00
Kevin Karch
4c77c71315
feat(ingest): filter by database in superset and preset (#13409) 2025-05-09 09:51:59 -04:00
Hyejin Yoon
a414bbb798
feat(sdk): add datajob lineage & dataset sql parsing lineage (#13365) 2025-05-09 10:20:48 +09:00
Harshal Sheth
926bb3ceba
chore(ingest): bump bounds on cooperative timeout test (#13449) 2025-05-08 12:12:21 +05:30
Harshal Sheth
287f373a9c
fix(ingest/snowflake): fix previously broken tests (#13428) 2025-05-06 10:19:25 -07:00
Anthony Burdi
294ad23500
feat(sdk): scaffold assertion client (#13362)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-05-06 10:51:34 -04:00
skrydal
65d1c2b43c
feat(ingestion): Make jsonProps of schemaMetadata less verbose (#13416) 2025-05-06 16:18:26 +02:00
Harshal Sheth
cb3988a5f3
feat(ingest): associate queries with operations (#13404) 2025-05-05 11:27:33 -07:00
Harshal Sheth
2e3328fce0
chore(ingest): bump sqlglot dep (#13411) 2025-05-05 09:24:50 -07:00
david-leifker
4e7bb3998d
feat(ingestion): refactor api-tracing EmitMode (#13397) 2025-05-05 10:54:31 -05:00
Harshal Sheth
f83460255a
feat(ingest): add urn -> url helpers (#13410) 2025-05-02 19:54:01 -07:00
Harshal Sheth
b7ef234bc7
fix(ingest): fix deps for fivetran (#13385) 2025-05-02 12:31:07 -07:00
Aseem Bansal
03531520ce
fix(ingest/dynamodb): put primary keys correctly (#13373) 2025-05-02 15:25:34 +05:30
David Leifker
e6babc3b81 Revert "feat(ingestion): refactor api-tracing EmitMode"
This reverts commit bf598aed9687e9b08ccfbd72257fc890b505d775.
2025-05-01 21:06:10 -05:00
David Leifker
bf598aed96 feat(ingestion): refactor api-tracing EmitMode
* Created EmitMode to control write guarantees
    * IMMEDIATE, QUEUE, BLOCKING_QUEUE
2025-05-01 20:30:21 -05:00
david-leifker
8919154f02
fix(ingestion): fix cloud vs core logic (#13387) 2025-05-01 16:16:41 -05:00
david-leifker
d25d318233
feat(ingestion-sdk): OpenAPI & Tracing With SDK (#13349)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2025-04-30 21:09:12 -05:00
Harshal Sheth
34e74d826e
fix(ingest): update dremio golden files (#13381) 2025-04-30 17:45:54 -05:00
Harshal Sheth
d264a7afba
feat(ingest/dbt): make catalog.json optional (#13352) 2025-04-29 10:39:53 -07:00
Hyejin Yoon
f986315582
doc: Acryl to DataHub, datahubproject.io to datahub.com (#13252)
Co-authored-by: Jay <159848059+jayacryl@users.noreply.github.com>
2025-04-28 10:34:33 -04:00
Jonny Dixon
9c718c870e
feat(ingestion/neo4j): Add stateful_ingestion and platform_instance capabilities to connector (#12631)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-04-26 11:52:20 +01:00
Michael Minichino
fee67788a3
fix(ingest/mode): Add pagination and warn on missing reports (#13322) 2025-04-25 18:21:27 -05:00
david-leifker
9b0634805a
feat(ingestion-openapi): patch support (#13282)
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-04-25 13:54:28 -05:00
Sergio Gómez Villamor
1c5b7c18fc
chore(ingestion): removes ignore for SIM117 ruff rule (#13295) 2025-04-23 15:55:46 +02:00
Sergio Gómez Villamor
1563b0e9fb
fix(ingestion): use default generate_browse_path_v2 even if no pipeline_config (#13117) 2025-04-23 13:25:58 +02:00
Aseem Bansal
1de5fb3e6f
fix(cli): redact more secrets (#13287) 2025-04-23 14:49:26 +05:30
Sergio Gómez Villamor
a8637abfe2
tests(kafka): fixing flaky tests (#13171) 2025-04-22 12:58:47 +02:00