905 Commits

Author SHA1 Message Date
Shubham Jagtap
3635c1c221
feat(ingestion/bigquery): Use sqlglot_lineage for usage and add more perf timers (#9247)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-12-28 04:54:26 -05:00
Harshal Sheth
52687f3eea
feat(ingest): support CLL for redshift materialized views with auto refresh (#9508) 2023-12-22 08:18:22 +01:00
Harshal Sheth
a18c72083d
feat(ingest): handle multiline string coercion (#9484) 2023-12-21 13:50:39 -05:00
Harshal Sheth
ecda3e6187
feat(ingest): pydantic v2 compatibility (#9434) 2023-12-18 18:26:33 -05:00
Tamas Nemeth
03590a1948
fix(ingest/snowflake) - Fixing snowflake url with default region (#9443) 2023-12-18 13:54:31 -05:00
Shubham Jagtap
1741c07d76
feat(ingestion): Add test_connection methods for important sources (#9334) 2023-12-14 12:31:51 -05:00
Shirshanka Das
ecef50f8fc
fix(ingest): make user_urn and group_urn generation consider user and… (#9026)
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2023-12-14 21:33:36 +05:30
siddiquebagwan-gslab
0d6a5e5df2
feat(ingestion/transformer): create tag if not exist (#9076) 2023-12-14 10:36:28 -05:00
Mayuri Nehate
aac1c55a14
feat(ingest/unity): enable hive metastore ingestion (#9416) 2023-12-14 10:35:06 -05:00
skrydal
a495d652e0
feat(ingestion): Add typeUrn handling to ownership transformers (#9370) 2023-12-13 14:34:20 -05:00
RyanHolstien
8a1122049c
feat(patch): support fine grained lineage patches (#9408)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2023-12-11 16:25:43 -06:00
Andrew Sikowitz
16fe22aafa
feat(ui/last-updated): Calculate last updated time as max(properties time, operation time) (#9242) 2023-12-06 15:21:56 -05:00
Andrew Sikowitz
27f23ecdd5
feat(ingest/unity): GE Profiling (#8951) 2023-12-06 13:59:23 -05:00
Andrew Sikowitz
806f09ae23
test(ingest/unity): Unity catalog data generation (#8949) 2023-12-05 12:33:00 -05:00
Harshal Sheth
7517c77ffd
fix(ingest): resolve issue with caplog and asyncio (#9377) 2023-12-04 20:00:11 -05:00
Harshal Sheth
4ec3208918
feat(ingest): support stdin in datahub put (#9359) 2023-12-04 14:31:58 -05:00
Andrew Sikowitz
c0ef728868
fix(ingest/powerbi): Allow old parser to parse [db].[schema].[table] table references (#9360) 2023-12-04 13:21:42 -05:00
Harshal Sheth
4d9eb12cba
feat(ingest/dbt): support custom ownership types in dbt meta (#9332) 2023-12-01 14:33:10 +05:30
Harshal Sheth
f9fd9467ef
feat(ingest): clean up DataHubRestEmitter return type (#9286)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-11-30 21:00:43 -05:00
Harshal Sheth
a7dc9c9d22
feat(sdk): autogenerate urn types (#9257) 2023-11-30 18:11:36 -05:00
Adriano Vega Llobell
3d7962cf17
fix(ingest/sql): improve handling of views with dots in their names (#9183) 2023-11-28 17:58:42 -05:00
Mayuri Nehate
08fb730676
fix(ingest): drop deprecated database_alias from sql sources (#9299)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2023-11-28 15:49:49 -05:00
Tamas Nemeth
f794a90515
fix(ingest/snowflake): Apply email filter on all usage metrics (#9269) 2023-11-23 09:45:25 +01:00
RyanHolstien
a62d52e6b1
feat(mysql): upgrade to version 8.2 for quickstart (#9241)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2023-11-22 13:54:12 -06:00
Tim
01874808af
fix(ingest/athena): detect decimal type correctly (#9270) 2023-11-20 16:57:48 +01:00
Adriano Vega Llobell
d0fa5de9f0
fix(sql-parser): convert platform instance to lowercase when building table urns (#9181) 2023-11-16 13:58:49 -05:00
Harshal Sheth
19aa215068
feat(ingest/dbt): dbt column-level lineage (#8991) 2023-11-13 19:00:30 -05:00
Shubham Jagtap
a187127ac5
feat(ingestion): file-based state checkpoint provider (#9029) 2023-11-10 14:36:00 -08:00
Andrew Sikowitz
2c58c63780
fix(ingest/datahub-source): Order by version in memory (#9185) 2023-11-06 15:58:57 -05:00
Harshal Sheth
ec9725026d
chore(ingest): remove legacy memory_leak_detector (#9158) 2023-11-02 09:39:08 -07:00
Harshal Sheth
bab9d1c931
fix(ingest): fully support MCPs in urn_iter primitive (#9157) 2023-11-01 20:47:18 -07:00
Harshal Sheth
55f14530a3
feat(ingest): bump sqlglot parser (#9155) 2023-11-01 12:42:52 +05:30
Harshal Sheth
ea1273281e
feat(ingest): sql parser perf + asyncio fixes (#9119) 2023-10-30 20:57:59 -07:00
Harshal Sheth
ce0f36b8bc
feat(sqlparser): support more update syntaxes + fix bug with subqueries (#9105) 2023-10-30 14:23:19 -07:00
Harshal Sheth
0bd2d9a36c
feat(ingest): add entity_supports_aspect helper (#9120) 2023-10-30 14:22:05 -07:00
Mayuri Nehate
f402090c1e
feat(ingest): support view lineage for all sqlalchemy sources (#9039) 2023-10-26 09:14:32 -07:00
Andrew Sikowitz
6c932e8afe
feat(ingest/bigquery): Attempt to support raw dataset pattern (#9109) 2023-10-25 13:17:09 -07:00
Andrew Sikowitz
8a80e858a7
build(ingest): Bump avro pin: security vulnerability (#9042) 2023-10-25 13:06:12 -04:00
siddiquebagwan-gslab
b612545220
feat(ingestion/redshift): support auto_incremental_lineage (#9010) 2023-10-25 15:26:06 +05:30
skrydal
378d84a346
fix(ui): Fixes handling of resources filters in UI (#9087) 2023-10-24 21:12:11 +01:00
Harshal Sheth
d13553f53a
feat(sqlparser): extract CLL from updates (#9078) 2023-10-24 12:24:50 -07:00
Andrew Sikowitz
adf8c8db38
refactor(ingest): Move sqlalchemy import out of sql_types.py (#9065) 2023-10-24 08:59:56 +02:00
Harshal Sheth
8fb95e88a1
feat(sqlparser): parse create DDL statements (#9002) 2023-10-23 12:40:42 -07:00
Tim
1eaf9c8c5f
feature(ingest/athena): introduce support for complex and nested schemas in Athena (#8137)
Co-authored-by: dnks23 <dominik.s23@live.de>
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
Co-authored-by: Tim <tim@MBP-von-Tim.fritz.box>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2023-10-18 09:39:59 -07:00
Andrew Sikowitz
d2eb42373f
fix(ingest/sqlalchemy): Fix URL parsing when sqlalchemy_uri provided (#9032) 2023-10-18 17:34:45 +02:00
Mayuri Nehate
c81a339bfc
build(ingest): remove ratelimiter dependency (#9008) 2023-10-16 09:27:57 -07:00
Andrew Sikowitz
6bc7425353
feat(cli/datacontract): Add data quality assertion support (#8968)
Co-authored-by: Mayuri Nehate <33225191+mayurinehate@users.noreply.github.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2023-10-13 12:36:18 -04:00
Tamas Nemeth
a8f0080c08
feat(ingest/teradata): Teradata source (#8977) 2023-10-12 15:14:45 -07:00
Tamas Nemeth
c381806110
feat(ingestion): Adding config option to auto lowercase dataset urns (#8928) 2023-10-12 13:56:30 +02:00
Tamas Nemeth
dd418de76d
fix(ingest/bigquery): Fix shard regexp to match without underscore as well (#8934) 2023-10-12 13:10:59 +02:00