1604 Commits

Author SHA1 Message Date
Oleksandr Simonchuk
8b4e302881
feat(ingest): add and use file system abstraction in file source (#8415)
Co-authored-by: oleksandrsimonchuk <oleksandr.si@appsflyer.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-07-01 10:47:07 -07:00
Tim Drahn
93616f7869
fix(ingestion): ingest emails as empty if no ldap attribute (#9433)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-07-01 11:53:41 +05:30
Teppo Naakka
b223281305
feat(ingest/powerbi): powerbi dataset profiling (#9355) 2024-06-28 14:50:08 -07:00
Harshal Sheth
f4be88d0a9
feat(ingest): set pipeline name in system metadata (#10190)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2024-06-27 15:00:35 -07:00
Harshal Sheth
fa2ab1bcee
fix(ingest): add status aspect to dataProcessInstance (#10757) 2024-06-27 12:07:28 -07:00
Harshal Sheth
0d677e4992
fix(ingest/snowflake): fix column batcher (#10781) 2024-06-25 22:21:54 -07:00
Harshal Sheth
724907b8f4
feat(ingest): add async batch mode to the rest sink (#10733) 2024-06-25 15:49:00 -07:00
Harshal Sheth
0dc0bc5761
feat(ingest/snowflake): performance improvements (#10746) 2024-06-25 14:46:55 -07:00
Eric L (CCCS)
79ba0b1720
fix(ingest/iceberg): add support for nested dictionaries when configuring pyiceberg (#10762) 2024-06-21 14:38:01 -07:00
Harshal Sheth
2d727a960b
feat(ingest/snowflake): support more than 10k views in a db (#10718) 2024-06-18 07:37:39 +02:00
ethan-cartwright
c58be155f3
feat(ingest/bigquery): Support for View Labels (#10648)
Co-authored-by: Ethan Cartwright <ethan.cartwright@acryl.io>
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2024-06-17 18:36:41 +05:30
Rajasekhar-Vuppala
a2d8a099d8
feat(ingest/vertica): use 3 part naming (#10636) 2024-06-14 17:04:55 -07:00
Harshal Sheth
62c6704f69
feat(ingest/snowflake): refactor + parallel schema extraction (#10653) 2024-06-14 13:23:07 -07:00
Shubham Jagtap
3479d070fe
fix(ingestion/sigma): Fix multiple requests http errors (#10616) 2024-06-14 11:30:45 -07:00
sagar-salvi-apptware
cc8389e431
fix(ingest/kafka-connect): Add lineage extraction for BigQuery Sink Connector in Kafka Connect source (#10647) 2024-06-14 18:51:04 +05:30
sagar-salvi-apptware
d69966074a
fix(ingest/bigquery): Map BigQuery policy tags to datahub column-level tags (#10669) 2024-06-14 16:43:12 +05:30
Harshal Sheth
6329153e36
fix(ingest): fix redshift query urns + reduce memory usage (#10691) 2024-06-13 11:27:06 -07:00
Harshal Sheth
25d48d2d09
fix(ingest/fivetran): fix fivetran bigquery support (#10693) 2024-06-13 11:26:47 -07:00
Harshal Sheth
3a72d92493
feat(ingest/dbt): include package_name in dbt custom props (#10652) 2024-06-12 15:07:42 +02:00
Shubham Jagtap
05aee03f3f
perf(ingestion/fivetran): Connector performance optimization (#10556) 2024-06-11 20:19:57 -07:00
skrydal
b9e71a61b1
feat(ingest/glue): database parameters extraction (#10665) 2024-06-11 11:50:46 -07:00
aabharti-visa
8a905774f7
feat(ingestion/kafka)-Add support for ingesting schemas from schema registry (#10612) 2024-06-11 14:00:12 +02:00
Harshal Sheth
e842161849
feat(ingest): add fast query fingerprinting (#10619) 2024-06-05 13:47:44 -07:00
Eric L (CCCS)
c04b3bc2e4
fix(ingest/iceberg): update iceberg source to support newer versions of pyiceberg at runtime (#10614) 2024-06-04 09:45:29 -07:00
Mayuri Nehate
81b655c82d
feat(open assertion spec): MVP for Snowflake DMF Assertions: update models, add assertions cli with snowflake integration (#10602) 2024-05-31 12:03:22 -07:00
Harshal Sheth
db965d61ea
fix(ingest/dbt): only generate one subtype (#10615) 2024-05-29 17:11:34 -07:00
Harshal Sheth
37bc423b50
feat(ingest): enable stateful ingestion safety threshold (#10516) 2024-05-29 12:01:04 -07:00
Harshal Sheth
e873104b80
feat(ingest): fetch connections from the backend (#10511) 2024-05-29 10:32:29 -07:00
Tony Ouyang
a5515c5d47
feat(ingestion/SageMaker): Remove deprecated apis and add stateful ingestion capability (#10573) 2024-05-28 12:16:28 +02:00
Paul Rogalinski-Pinter
1c1450e1d8
fix(ingest/metabase): Fix for query template expressions and invalid URNs for Text Cards (#10381)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-05-24 11:35:29 -07:00
Harshal Sheth
2e14f70864
test(ingest/sql): refactor CLL generator + add tests (#10580) 2024-05-23 18:11:22 -07:00
sid-acryl
e361d28309
feat(ingestion/looker): ingest explore tags into the DataHub (#10547) 2024-05-23 18:36:43 +05:30
dushayntAW
92780e607c
fix(ingest/unity-catalog) upstream lineage for hive_metastore external table with s3 location (#10546) 2024-05-23 18:34:56 +05:30
Harshal Sheth
7a519ac73c
fix(ingest/dbt): resolve more dbt ephemeral node lineage gaps (#10553) 2024-05-22 12:44:54 -07:00
sid-acryl
666de9e4e6
fix(ingestion/powerbi): Databricks support for table lineage (#10416) 2024-05-22 18:57:45 +05:30
Tamas Nemeth
f831518328
fix(ingest/mode): Adding Dashboards into containers (#10563) 2024-05-22 07:38:57 +02:00
Harshal Sheth
b8023a93a4
refactor(ingest): defer ctx.graph initialization (#10504) 2024-05-21 17:01:35 -07:00
Harshal Sheth
2b6c78b776
feat(ingest): bump acryl-sqlglot dep (#10554) 2024-05-21 23:52:33 +02:00
Harshal Sheth
187ef12182
fix(ingest/dbt): improve handling for CLL via ephemeral nodes (#10535) 2024-05-20 13:33:25 -07:00
sid-acryl
40d2ae3b78
fix(ingestion/powerbi): handle special character #(tab) in native query parsing (#10520) 2024-05-20 15:20:39 +05:30
Sergio Gómez Villamor
0059960720
feat(ingestion/glue): delta schemas (#10299)
Co-authored-by: Mayuri Nehate <33225191+mayurinehate@users.noreply.github.com>
2024-05-17 14:21:35 +02:00
Harshal Sheth
3d5735cbc5
chore(ingest): run pyupgrade for python 3.8 (#10513) 2024-05-15 22:31:05 -07:00
Harshal Sheth
bc9250c904
fix(ingest): fix bug in incremental lineage (#10515) 2024-05-15 22:30:47 -07:00
sid-acryl
c55c12c918
fix(ingestion/looker): deduplicate the view field (#10482)
Co-authored-by: Pedro Silva <pedro@acryl.io>
2024-05-15 11:25:07 -07:00
richenc
8e5f17b131
feat(ingest/tableau): support platform instance mapping based off database server hostname (#10254)
Co-authored-by: Richie Chen <richie.chen@hulu.com>
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
2024-05-15 08:23:49 -07:00
dushayntAW
a164b70e1d
chore(ingest/presto-on-hive) Set enable_properties_merge to True by default (#10469) 2024-05-15 18:57:13 +05:30
sagar-salvi-apptware
5fbf781558
fix(ingest/transformer): Add dataset domains based on tags using transformer (#10458) 2024-05-15 14:13:03 +05:30
dushayntAW
bfd3c4bf1c
fix(ingestion/kafka-connect): failure on multiple env substitutes (#10443) 2024-05-09 17:59:55 +05:30
Egemen Berk Galatalı
b1b7cedd8d
feat(ingest/tableau): Fetch Upstreams From Columns (#9874) 2024-05-08 14:21:07 -07:00
dushayntAW
96061be564
fix(ingestion/salesforce): handle the label with none value scenario (#10446) 2024-05-08 14:11:50 +05:30