1638 Commits

Author SHA1 Message Date
Harshal Sheth
7cbe5ca7f2
feat(ingest): enable use_queries_v2 by default for snowflake/bigquery 2025-05-22 11:14:54 -07:00
Harshal Sheth
7e60587dec
fix(cli): strictly validate structured property values (#13576)
Co-authored-by: Chakravarthy Racharla <chakru.racharla@acryl.io>
2025-05-21 08:36:50 -07:00
Tamas Nemeth
9fca1737ff
fix(ingest/dbt): Fix urn validation in ownership type check (#13563) 2025-05-21 13:02:26 +02:00
Tamas Nemeth
0eca4dfde2
fix(ingest/hive): Fix hive storage path formats (#13536) 2025-05-19 16:17:28 +02:00
Sergio Gómez Villamor
8cae980286
tests(ingestion): moving some tests so they are available for sdk users (#13540) 2025-05-19 08:39:53 +02:00
Jonny Dixon
132ff7081f
feat(ingestion/s3): Add externalUrls for datasets in s3 and gcs (#12763) 2025-05-17 17:03:40 +01:00
Harshal Sheth
d3944ded93
feat(ingest/snowflake): generate lineage through temp views (#13517) 2025-05-16 21:27:13 -07:00
Harshal Sheth
9a892c6eca
feat(ingest): improve join extraction (#13502) 2025-05-14 09:50:17 -07:00
Jonny Dixon
c756af31b1
feat(ingestion/looker): extract group_labels from looker and add as tags in datahub (#13503) 2025-05-14 13:08:13 +01:00
Michael Minichino
bc860181d8
fix(ingest/mode): Additional pagination and timing metrics (#13497)
Co-authored-by: NehaGslab <neha.marne@gslab.com>
2025-05-13 08:48:38 -05:00
Sergio Gómez Villamor
184fb09fc0
fix(mssql): improve stored proc lineage + add temporary_tables_pattern config (#13415)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-05-13 10:36:52 +02:00
Harshal Sheth
7c791db087
feat(ingest/sql): column logic + join extraction (#13426) 2025-05-12 17:19:44 -07:00
Gabe Lyons
aeda8f4c95
feat(cassandra): Support ssl auth with cassandra (#13465) 2025-05-12 13:15:49 -04:00
Sergio Gómez Villamor
fb7bcbaf17
tests(ingestion): fixes hex and hive docker flakiness (#13476) 2025-05-11 19:06:23 +02:00
Anthony Burdi
f71ff7722a
fix(sdk): use pluralized assertions (#13481) 2025-05-09 16:19:50 -04:00
Tamas Nemeth
60f79153ac
fix(ingest/hive): Fix hive properties with double colon (#13478) 2025-05-09 19:59:40 +02:00
Kevin Karch
4c77c71315
feat(ingest): filter by database in superset and preset (#13409) 2025-05-09 09:51:59 -04:00
Hyejin Yoon
a414bbb798
feat(sdk): add datajob lineage & dataset sql parsing lineage (#13365) 2025-05-09 10:20:48 +09:00
Harshal Sheth
926bb3ceba
chore(ingest): bump bounds on cooperative timeout test (#13449) 2025-05-08 12:12:21 +05:30
Harshal Sheth
287f373a9c
fix(ingest/snowflake): fix previously broken tests (#13428) 2025-05-06 10:19:25 -07:00
Anthony Burdi
294ad23500
feat(sdk): scaffold assertion client (#13362)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-05-06 10:51:34 -04:00
skrydal
65d1c2b43c
feat(ingestion): Make jsonProps of schemaMetadata less verbose (#13416) 2025-05-06 16:18:26 +02:00
Harshal Sheth
cb3988a5f3
feat(ingest): associate queries with operations (#13404) 2025-05-05 11:27:33 -07:00
Harshal Sheth
2e3328fce0
chore(ingest): bump sqlglot dep (#13411) 2025-05-05 09:24:50 -07:00
david-leifker
4e7bb3998d
feat(ingestion): refactor api-tracing EmitMode (#13397) 2025-05-05 10:54:31 -05:00
Harshal Sheth
f83460255a
feat(ingest): add urn -> url helpers (#13410) 2025-05-02 19:54:01 -07:00
Harshal Sheth
b7ef234bc7
fix(ingest): fix deps for fivetran (#13385) 2025-05-02 12:31:07 -07:00
Aseem Bansal
03531520ce
fix(ingest/dynamodb): put primary keys correctly (#13373) 2025-05-02 15:25:34 +05:30
David Leifker
e6babc3b81 Revert "feat(ingestion): refactor api-tracing EmitMode"
This reverts commit bf598aed9687e9b08ccfbd72257fc890b505d775.
2025-05-01 21:06:10 -05:00
David Leifker
bf598aed96 feat(ingestion): refactor api-tracing EmitMode
* Created EmitMode to control write guarantees
    * IMMEDIATE, QUEUE, BLOCKING_QUEUE
2025-05-01 20:30:21 -05:00
david-leifker
8919154f02
fix(ingestion): fix cloud vs core logic (#13387) 2025-05-01 16:16:41 -05:00
david-leifker
d25d318233
feat(ingestion-sdk): OpenAPI & Tracing With SDK (#13349)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2025-04-30 21:09:12 -05:00
Harshal Sheth
34e74d826e
fix(ingest): update dremio golden files (#13381) 2025-04-30 17:45:54 -05:00
Harshal Sheth
d264a7afba
feat(ingest/dbt): make catalog.json optional (#13352) 2025-04-29 10:39:53 -07:00
Hyejin Yoon
f986315582
doc: Acryl to DataHub, datahubproject.io to datahub.com (#13252)
Co-authored-by: Jay <159848059+jayacryl@users.noreply.github.com>
2025-04-28 10:34:33 -04:00
Jonny Dixon
9c718c870e
feat(ingestion/neo4j): Add stateful_ingestion and platform_instance capabilities to connector (#12631)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-04-26 11:52:20 +01:00
Michael Minichino
fee67788a3
fix(ingest/mode): Add pagination and warn on missing reports (#13322) 2025-04-25 18:21:27 -05:00
david-leifker
9b0634805a
feat(ingestion-openapi): patch support (#13282)
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-04-25 13:54:28 -05:00
Sergio Gómez Villamor
1c5b7c18fc
chore(ingestion): removes ignore for SIM117 ruff rule (#13295) 2025-04-23 15:55:46 +02:00
Sergio Gómez Villamor
1563b0e9fb
fix(ingestion): use default generate_browse_path_v2 even if no pipeline_config (#13117) 2025-04-23 13:25:58 +02:00
Aseem Bansal
1de5fb3e6f
fix(cli): redact more secrets (#13287) 2025-04-23 14:49:26 +05:30
Sergio Gómez Villamor
a8637abfe2
tests(kafka): fixing flaky tests (#13171) 2025-04-22 12:58:47 +02:00
Harshal Sheth
f48c6b53ee
feat(ingest/snowflake): show returned query row counts (#13246) 2025-04-21 14:41:40 -07:00
Chakru
54156ea78a
fix(cli): use patch to update dataset properties (#13226) 2025-04-21 14:41:31 -07:00
Harshal Sheth
08453cfbb1
fix(ingest/hive): support multiline view definitions (#13248) 2025-04-21 11:11:35 -07:00
Harshal Sheth
9f7f3cb886
chore(ingest/snowflake): remove unused query code (#13245) 2025-04-21 10:31:36 -07:00
Harshal Sheth
64bda48b51
feat(sdk): auto-fix bad entity type casing (#13218) 2025-04-20 21:40:00 -07:00
Harshal Sheth
5ba8b7d173
fix(ingest/fivetran): use project id by default for bigquery (#13250) 2025-04-20 21:39:40 -07:00
skrydal
47490ec050
feat(ingestion/iceberg): Add capability to extract namespace properties to the iceberg ingestor (#13238) 2025-04-17 16:29:43 +02:00
Hyejin Yoon
72aab9fe63
feat(sdk): add sdk lineage client (#13244)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-04-17 17:44:07 +09:00