905 Commits

Author SHA1 Message Date
Harshal Sheth
7dbb3e60cb
chore(ingest): start using explicit exports (#11899) 2024-11-20 13:33:30 -08:00
Harshal Sheth
85c8e605be
fix(ingest): consider sql parsing fallback as failure (#11896) 2024-11-19 15:06:16 -08:00
Andrew Sikowitz
94f1f39667
fix(ingest/partitionExecutor): Fetch ready items for non-empty batch when _pending is empty (#11885) 2024-11-18 17:25:43 -08:00
skrydal
2527f54972
feat(ingest/iceberg): Iceberg performance improvement (multi-threading) (#11182) 2024-11-18 19:41:45 +01:00
sagar-salvi-apptware
fd2da83ff4
feat(ingest/cassandra): Add support for Cassandra as a source (#11822) 2024-11-15 20:41:21 +05:30
Andrew Sikowitz
5ff6295b0f
fix(ingest/partition-executor): Fix deadlock by recomputing ready items (#11853) 2024-11-14 08:48:30 +01:00
Mayuri Nehate
383a70ac0a
fix(ingest/oracle): fix scheme for sqlalchemy < 2 (#11829) 2024-11-14 12:46:27 +05:30
sid-acryl
6454ff30ab
feat(ingest/powerbi): DatabricksMultiCloud native query support (#11756)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2024-11-12 19:26:06 +05:30
Mayuri Nehate
84c677629d
feat(ingest): add stateful ingestion support for file source (#11804) 2024-11-08 16:11:30 +05:30
Andrew Sikowitz
a1e16fc22a
fix(ingest/browsePathsV2): Emit Container aspect first, to avoid BrowsePathsV2 generation race condition (#11813) 2024-11-06 23:07:33 -08:00
Harshal Sheth
e609ff810d
feat(ingest/powerbi): improve reporting around m-query parser (#11763) 2024-10-31 16:27:45 -07:00
Harshal Sheth
143fc011fa
feat(ingest/powerbi): add timeouts for m-query parsing (#11753) 2024-10-30 19:40:45 +01:00
Tamas Nemeth
b33ad0a788
feat(ingest/datahub): Add way to filter soft deleted entities (#11738) 2024-10-30 17:41:45 +01:00
Harshal Sheth
6316e10d48
feat(ingest): check ordering in SqlParsingAggregator tests (#11735) 2024-10-29 17:50:37 +01:00
Aseem Bansal
02f0a3dee7
feat(ingest/transform): extend ownership transformer to other entities (#11700) 2024-10-29 15:28:41 +05:30
sagar-salvi-apptware
bb63cbd9db
fix(ingestion/bigquery): Add lineage extraction for BigQuery with GCS source (#11442) 2024-10-29 09:18:08 +01:00
Mayuri Nehate
87fa5b89e8
feat: multi-query lineage for temp upstreams (#11708) 2024-10-25 16:56:55 +05:30
Jonny Dixon
8b062eb8bd
feat(ingest/oracle): retire deprecated cx_oracle library (#11607)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-10-23 14:39:57 -07:00
Mayuri Nehate
eab2ac7a2e
feat(ingest/snowflake): support lineage via rename and swap using que… (#11600) 2024-10-23 14:02:08 +05:30
Julien Jehannet
326afc6308
fix(ingestion/glue): manage table names from resource_links from nearest catalog correctly (#11578) 2024-10-23 11:39:23 +05:30
Harshal Sheth
35f30b7d3c
feat(ingest): use mainline sqlglot (#11693) 2024-10-22 19:57:46 -07:00
Shirshanka Das
3b1b76244d
feat(sdk):platform-resource - complex queries (#11675) 2024-10-19 14:53:28 -07:00
Andrew Sikowitz
72d1236669
feat(ingest/transformer/domain): Add support for on conflict do nothing to dataset domain transformers (#11649) 2024-10-18 12:01:39 -07:00
Andrew Sikowitz
dcf4793c3d
fix(ingest/dbt): Prevent lineage cycles when parsing sql of dbt models (#11666) 2024-10-18 03:13:18 -07:00
Mayuri Nehate
179a6714a6
feat(ingest): ensure sqlite file delete on clean exit (#11612) 2024-10-18 10:33:28 +05:30
Harshal Sheth
6e3724b2da
perf(ingest): streamline CLL generation (#11645) 2024-10-17 17:50:59 -07:00
Harshal Sheth
523a456f5c
fix(ingest/redshift): fix syntax error in temp sql (#11661) 2024-10-17 12:33:04 -07:00
Harshal Sheth
68cd17b34e
fix(ingest/redshift): reduce sequence limit for LISTAGG (#11621)
Co-authored-by: treff7es <treff7es@gmail.com>
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2024-10-17 10:08:37 -07:00
Harshal Sheth
b8144699fd
chore(ingest): reorganize unit tests (#11636) 2024-10-16 19:18:32 -07:00
Harshal Sheth
d34717fd82
fix(ingest): remove default value from DatahubClientConfig.server (#11570) 2024-10-16 13:50:33 -07:00
Tamas Nemeth
e76647dd7a
feat(ingest/bigquery): Generate platform resource entities for BigQuery labels (#11602)
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2024-10-16 10:04:01 +02:00
Mayuri Nehate
b74ba11d93
fix(ingest/delta-lake): skip file count if require_files is false (#11611) 2024-10-14 18:21:05 +05:30
Mayuri Nehate
d0d09a09f8
fix(ingest): ignore irrelevant urns from % change computation (#11583) 2024-10-11 16:55:27 +05:30
Shuixi Li
f147b51fc8
feat(ingest): add preset source (#10954)
Co-authored-by: MARK CHENG <hcheng@wealthsimple.com>
Co-authored-by: hwmarkcheng <94201005+hwmarkcheng@users.noreply.github.com>
2024-10-09 20:27:31 -07:00
Mayuri Nehate
26bbe02e44
feat(ingest/stateful): omit irrelevant urns for deletion (#11558) 2024-10-09 08:46:58 -07:00
Shirshanka Das
f3a348a231
sdk(platform-resource): add entity type for ease of use (#11541) 2024-10-07 20:30:14 -07:00
skrydal
134ad21afe
fix(ingestion/nifi): Fix for incremental lineage ingestion for nifi (#11517) 2024-10-04 17:29:06 +05:30
skrydal
e1514d5e8e
fix(ingestion/nifi): Improve nifi lineage extraction performance (#11490) 2024-10-01 21:51:00 +02:00
sagar-salvi-apptware
660fbf8e57
fix(ingestion/transformer): Add container support for ownership and domains (#11375) 2024-10-01 11:39:07 -07:00
Harshal Sheth
07034caf09
feat(ingest): support DATAHUB_INCLUDE_ENV_IN_CONTAINER_PROPERTIES (#11476) 2024-09-27 10:24:22 -07:00
Mayuri Nehate
6a58493011
fix(ingest/bq): do not query PARTITIONS for biglake tables (#11463) 2024-09-27 16:46:37 +05:30
sid-acryl
9fb2df11f3
fix(ingest): sort by last modified not working in the UI (#11343) 2024-09-23 10:06:05 -07:00
Harshal Sheth
aec5e1b249
fix(ingest/dbt): handle null index values (#11433) 2024-09-19 16:05:44 -07:00
Sergio Gómez Villamor
31edb46dbc
feat(ingestion): adds env property in ContainerProperties (#11214)
Co-authored-by: siladitya2 <siladitya2@gmail.com>
2024-09-18 14:56:52 +05:30
Harshal Sheth
38bcd9c381
feat(ingest): default to ASYNC_BATCH mode in datahub-rest sink (#11369) 2024-09-17 07:11:58 +01:00
Harshal Sheth
3755731f0e
chore(ingest): improve code formatting (#11326) 2024-09-11 10:48:57 -07:00
Harshal Sheth
311ea10833
feat(ingest): maintain ordering in file-backed dict (#11346) 2024-09-10 13:53:38 -07:00
Mayuri Nehate
837d00d391
fix(ingest/bq): fix ordering of queries for use_queries_v2 (#11333) 2024-09-10 12:17:23 -07:00
Harshal Sheth
f4033707d4
chore(ingest): bump acryl-sqlglot (#11331) 2024-09-09 21:09:44 -07:00
Mayuri Nehate
cf49f80e77
feat(ingest/sql): auto extract and use mode query user metadata (#11307)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-09-09 12:38:24 -07:00