3904 Commits

Author SHA1 Message Date
dushayntAW
96061be564
fix(ingestion/salesforce): handle the label with none value scenario (#10446) 2024-05-08 14:11:50 +05:30
Tamas Nemeth
897e648eae
fix(ingest/mode): Improve query lineage (#10284)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-05-07 22:02:37 -07:00
Mayuri Nehate
f6627efe71
fix(ingest/snowflake): add more reporting for usage aggregation, handle lineage errors (#10279)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-05-07 08:42:39 -07:00
Shubham Jagtap
ae3f0fd5ee
feat(ingestion): Copy urns from previous checkpoint state on ingestion failure (#10347) 2024-05-07 17:36:40 +05:30
Shubham Jagtap
ddb38e7448
fix(ingestion/tableau): Fix tableau custom sql lineage gap (#10359) 2024-05-07 13:47:56 +05:30
Harshal Sheth
0e8fc5129f
feat(cli): cache sql parsing intermediates (#10399) 2024-05-06 16:59:00 -07:00
Harshal Sheth
1dae37a8ed
fix(ingest/bigquery): remove last modified timestamp fallback (#10431) 2024-05-06 16:30:04 -07:00
Harshal Sheth
6a24ed2743
feat(ingest/snowflake): use system sampling on very large tables (#10430) 2024-05-06 22:00:18 +02:00
Gabe Lyons
bda609b000
docs(ingest): update datahub sink doc to include an acryl example (#10411) 2024-05-03 08:47:57 -07:00
dushayntAW
c00ddb2a0d
fix(ingest/transformer): new transformer to clean user URN for datasetUsageStatistics aspect (#10398) 2024-05-03 13:24:48 +05:30
Tamas Nemeth
4e47933e55
fix(ingest/bigquery): Fixing double sanitization of urns (#10386) 2024-05-02 21:24:53 -07:00
Ellie O'Neil
d82750b891
DynamoDB IAM auth (#10419) 2024-05-02 16:02:40 -07:00
Pablo Osinaga
0b99145797
feat(metabase): add stateful ingestion (#10360) 2024-05-02 12:04:18 -07:00
sid-acryl
0e795a6614
fix(ingestion/looker): fix lineage for dimension group column (#10382) 2024-05-02 11:48:11 -07:00
Harshal Sheth
8bebf63e3d
fix(ingest): map bigquery nested types properly (#10409) 2024-05-02 12:27:35 +02:00
Shubham Jagtap
77045f9a7f
perf(ingestion/fivetran): Connector performance optimization (#10346) 2024-04-30 09:44:14 -07:00
Hyejin Yoon
704ca650ad
docs: add slack utm component in docs (#10214) 2024-04-30 12:54:54 +09:00
Hyejin Yoon
e3f27c8b91
feat: add keywords for SEO (#10358) 2024-04-30 08:12:32 +09:00
Harshal Sheth
3ab4ec9b44
feat(ingest/dbt): support a datahub section in meta mappings (#10371) 2024-04-26 09:41:03 -07:00
Tamas Nemeth
7e69247a7f
fix(ingest/profiling): Filter tables early based on profile pattern filter (#10378) 2024-04-26 17:35:03 +02:00
Harshal Sheth
4add9b157d
feat(ingest/dbt): use columns from manifest as a fallback (#10374) 2024-04-25 22:29:51 +02:00
Harshal Sheth
4c40a24d76
fix(ingest/bigquery): map date types correctly (#10383) 2024-04-25 22:22:37 +02:00
Harshal Sheth
e64229b036
feat(ingest/dbt): handle complex dbt sql + improve docs (#10323) 2024-04-24 11:13:32 -07:00
mrjefflewis
e4cf4de3e0
feat(ingest/mssql): improve docs on using odbc (#10370)
Co-authored-by: Jeff Lewis <jeff.lewis@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-04-24 13:00:45 -05:00
Camilo Gutierrez
cb6d744ae7
feat(ingestion/bigquery): support for table clones (#10274)
Co-authored-by: Equipo DataOps <cagutierra@unal.edu.co>
2024-04-24 18:04:02 +05:30
Shubham Jagtap
ca2a10e36f
fix(ingestion/qlik): Unable to ingest more than ten spaces (#10228)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-04-23 16:04:20 -07:00
dushayntAW
b9a34fe90b
fix(ingest/salesforce): escape markdown char for multiline description (#10351) 2024-04-23 17:05:55 +05:30
Shubham Jagtap
934ab03d16
test(ingestion/sigma): Add integration test cases (#10356) 2024-04-22 16:19:01 -07:00
Sergio
725c85815b
fix(ingest/starburst): parse create_time datetime format (#10345) 2024-04-22 09:12:41 -07:00
dushayntAW
3668a56df7
fix(ingest/transformer): avoid duplicating terms (#10348) 2024-04-22 20:15:58 +05:30
Harshal Sheth
08731055ba
feat(ingest): bump acryl-sqlglot dep (#10343) 2024-04-20 08:37:22 +02:00
Tamas Nemeth
bf8e3a9838
fix(ingest/bigquery): set default max_overflow to -1 (#10342) 2024-04-19 14:44:20 -07:00
Raj Tekal
996e5b0130
chore(metadata) Addressing vulnerabilities (#10296) 2024-04-19 12:53:50 -07:00
Pablo Osinaga
62c7ac706b
feat(ingest/metabase): add ability to exclude other users collections (#10330) 2024-04-19 12:53:17 -07:00
Harshal Sheth
d1cc0af314
feat(ingest/classify): add pip dependency (#10335) 2024-04-19 12:52:51 -07:00
Harshal Sheth
27917ca634
feat(ingest): mark acryl cloud package first-party for logging (#10334) 2024-04-19 10:59:44 -07:00
david-leifker
adffce2f03
feat(openapi-v3): entity-registry openapi spec (#9550)
Co-authored-by: Ajoy Majumdar <ajoymajumdar@hotmail.com>

Adds support for custom aspects in the openapi api
2024-04-18 15:03:41 -05:00
Harshal Sheth
7d31420b69
feat(ingest): materialize terms produced by ingestion (#10249) 2024-04-18 10:48:16 -07:00
Harshal Sheth
8ecafbe802
fix(ingest/kafka): clarify meta-mapping docs (#10320) 2024-04-18 10:47:43 -07:00
Harshal Sheth
f99f73841a
feat(ingest/profiling): allow unique count queries to be combined (#10322) 2024-04-18 10:47:27 -07:00
Aseem Bansal
d3fb698d8d
fix(ingest): make gms url configuration resilient in rest emitter (#10316) 2024-04-18 14:46:32 +05:30
Andrew Sikowitz
a041a2ee52
fix(ingest/transformers): Use set to store tags in AddDatasetTags (#10317) 2024-04-18 13:11:18 +05:30
Harshal Sheth
77f1a0c60e
fix(ingest/profiling): compute sample row count correctly (#10319) 2024-04-18 08:40:11 +02:00
Harshal Sheth
4e2cec86b3
feat(ingest/sigma): fix stateful ingestion (#10321) 2024-04-17 20:09:30 -07:00
Mayuri Nehate
529710ab9d
fix(ingest/databricks): handle and report config parse failure, updat… (#10261) 2024-04-17 12:14:16 -07:00
Mayuri Nehate
0613234d2a
fix(ingest/tableau): handle very large filter queries (#10295) 2024-04-17 12:14:02 -07:00
david-leifker
25ba1e1a8b
chore(pyiceburg): set minimum version (#10318) 2024-04-17 11:47:13 -07:00
Harshal Sheth
3cdc462a7b
fix(ingest): disallow src.* imports, fix powerbi/sigma (#10292) 2024-04-16 15:04:51 -07:00
Tamas Nemeth
d463a16b49
chore(ingest/presto-on-hive): Renaming presto-on-hive to hive-metastore source (#10278) 2024-04-16 23:35:16 +02:00
Felix Lüdin
9eb6b2d68d
fix(ingest): improve performance of get_allowed_list in AllowDenyPattern when dealing with large lists (#10219) 2024-04-16 12:48:48 -07:00