3089 Commits

Author SHA1 Message Date
Mayuri Nehate
6e50bcc7ea
fix(ingest/nifi): remove duplicate upstream jobs (#10849) 2024-07-04 09:27:06 +02:00
John Joyce
fa3e381f83
refactor(ingest): Refactor structured logging to support infos, warnings, and failures structured reporting to UI (#10828)
Co-authored-by: John Joyce <john@Johns-MBP.lan>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-07-03 19:20:58 -07:00
Julien Jehannet
73cdfeb6c2
feat(ingest/glue): add helper method to permit user or group ownership (#10784) 2024-07-03 10:24:41 -07:00
david-leifker
ea6bc61787
fix(ingestion): fix datajob patcher (#10827) 2024-07-03 11:47:08 -05:00
sagar-salvi-apptware
b8af2b9d69
fix(ingestion/glue): ensure date formatting works on all platforms for aws glue (#10836) 2024-07-03 18:05:37 +05:30
Aseem Bansal
e45f7a4167
feat(cli): add more details to get cli (#10815) 2024-07-03 12:25:15 +05:30
skrydal
099021c7a3
feat(ingest/glue): allow ingestion of empty databases from Glue (#10666)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-07-03 11:43:12 +05:30
jordanjeremy
226b059684
fix(ingest): correct profile_day_of_week implementation (#10818) 2024-07-03 11:42:55 +05:30
Chris Collins
b651d81289
feat(SDK) Add StructuredPropertyPatchBuilder in python sdk and provide sample CRUD files (#10824) 2024-07-02 12:39:08 -04:00
dushayntAW
40418d9184
fix(ingestion/unity-catalog): fixed issue with profiling with GE turned on (#10752)
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2024-07-02 15:31:43 +05:30
sagar-salvi-apptware
640d42dc65
feat(ingest/transformer): tags to terms transformer (#10758)
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2024-07-02 15:30:05 +05:30
Chris Collins
6745dfb45e
feat(SDK) Add FormPatchBuilder in python sdk and provide sample CRUD files (#10821) 2024-07-01 17:48:09 -04:00
sid-acryl
3e4749624f
feat(ingest/lookml): ingest field tags (#10792)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-07-01 11:15:51 -07:00
Oleksandr Simonchuk
8b4e302881
feat(ingest): add and use file system abstraction in file source (#8415)
Co-authored-by: oleksandrsimonchuk <oleksandr.si@appsflyer.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-07-01 10:47:07 -07:00
Tim Drahn
93616f7869
fix(ingestion): ingest emails as empty if no ldap attribute (#9433)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-07-01 11:53:41 +05:30
Teppo Naakka
b223281305
feat(ingest/powerbi): powerbi dataset profiling (#9355) 2024-06-28 14:50:08 -07:00
Harshal Sheth
62e6b7ff78
fix(ingest/snowflake): fix error case in column lineage (#10808) 2024-06-28 12:47:59 -07:00
Harshal Sheth
65a1dd1d82
fix(ingest/snowflake): add limits on tables/columns/queries in lineage (#10804) 2024-06-27 18:54:34 -07:00
Harshal Sheth
f4be88d0a9
feat(ingest): set pipeline name in system metadata (#10190)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2024-06-27 15:00:35 -07:00
pie1nthesky
5e9afc6d28
fix(ingest/pipeline): catch pipeline exceptions (#10753)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-06-27 13:03:36 -07:00
Harshal Sheth
fa2ab1bcee
fix(ingest): add status aspect to dataProcessInstance (#10757) 2024-06-27 12:07:28 -07:00
Harshal Sheth
d63f25faa8
fix(ingest/looker): prevent bad input fields (#10785) 2024-06-27 11:12:01 -07:00
Ellie O'Neil
1ae5bfc1b8
refactor(tags): Use TagUrn class when generating urn (#10786)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-06-27 18:00:22 +00:00
Harshal Sheth
0d677e4992
fix(ingest/snowflake): fix column batcher (#10781) 2024-06-25 22:21:54 -07:00
Harshal Sheth
724907b8f4
feat(ingest): add async batch mode to the rest sink (#10733) 2024-06-25 15:49:00 -07:00
Harshal Sheth
0dc0bc5761
feat(ingest/snowflake): performance improvements (#10746) 2024-06-25 14:46:55 -07:00
Harshal Sheth
c6f2839650
fix(ingest): use more aggressive errors with sqlglot (#10769) 2024-06-25 13:27:42 -07:00
Mayuri Nehate
5735d9ec87
feat(data quality): custom assertions models, graphql, sdk (#10761) 2024-06-25 09:16:13 -07:00
Eric L (CCCS)
79ba0b1720
fix(ingest/iceberg): add support for nested dictionaries when configuring pyiceberg (#10762) 2024-06-21 14:38:01 -07:00
ethan-cartwright
b3dccb0b35
fix(bigquery): use get() instead of hassattr for view labels (#10756)
Co-authored-by: Ethan Cartwright <ethan.cartwright@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-06-20 13:56:07 -07:00
Aseem Bansal
f581f70c93
fix(ingest/tableau): warn with better error message (#10749) 2024-06-20 16:56:53 +05:30
pie1nthesky
dfd5bcf5dd
fix(ingest/logging): fix excessive ingestion logging (#10735) 2024-06-19 16:27:39 +05:30
Harshal Sheth
c2b6272730
feat(ingest/snowflake): log queries at info level (#10745) 2024-06-18 15:16:20 -07:00
Harshal Sheth
f1e5a4757c
fix(ingest/dbt): support emitting only model performance (#10714) 2024-06-18 10:31:19 -07:00
Pedro Silva
1e7fec421d
feat(cli): Make ingest deploy create recipe with urn if not exists (#10724) 2024-06-18 14:02:16 +05:30
Harshal Sheth
2d727a960b
feat(ingest/snowflake): support more than 10k views in a db (#10718) 2024-06-18 07:37:39 +02:00
Harshal Sheth
07df5f872a
fix(ingest/looker): fix looker browse paths v2 (#10700) 2024-06-17 22:55:08 +02:00
Harshal Sheth
39081ece15
feat(ingest/bigquery): support using table read permission without profiling (#10699) 2024-06-17 22:54:28 +02:00
ethan-cartwright
c58be155f3
feat(ingest/bigquery): Support for View Labels (#10648)
Co-authored-by: Ethan Cartwright <ethan.cartwright@acryl.io>
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2024-06-17 18:36:41 +05:30
Shubham Jagtap
e6246c9d71
fix(ingestion/bigquery): user exceeded quota for concurrent project.lists requests (#10578)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-06-14 17:09:13 -07:00
Harshal Sheth
60b09c5dbe
feat(ingest): log http request retries (#10715) 2024-06-14 17:07:53 -07:00
Rajasekhar-Vuppala
a2d8a099d8
feat(ingest/vertica): use 3 part naming (#10636) 2024-06-14 17:04:55 -07:00
noggi
e66726b9cc
Expose get_entities_v2 endpoint in python client (#10694) 2024-06-14 13:46:39 -07:00
Harshal Sheth
62c6704f69
feat(ingest/snowflake): refactor + parallel schema extraction (#10653) 2024-06-14 13:23:07 -07:00
Harshal Sheth
402bf31938
feat(gms): add ingestProposalBatch endpoint (#10706) 2024-06-14 12:30:19 -07:00
Shubham Jagtap
3479d070fe
fix(ingestion/sigma): Fix multiple requests http errors (#10616) 2024-06-14 11:30:45 -07:00
sagar-salvi-apptware
cc8389e431
fix(ingest/kafka-connect): Add lineage extraction for BigQuery Sink Connector in Kafka Connect source (#10647) 2024-06-14 18:51:04 +05:30
sagar-salvi-apptware
d69966074a
fix(ingest/bigquery): Map BigQuery policy tags to datahub column-level tags (#10669) 2024-06-14 16:43:12 +05:30
Harshal Sheth
6329153e36
fix(ingest): fix redshift query urns + reduce memory usage (#10691) 2024-06-13 11:27:06 -07:00
Harshal Sheth
25d48d2d09
fix(ingest/fivetran): fix fivetran bigquery support (#10693) 2024-06-13 11:26:47 -07:00