1703 Commits

Author SHA1 Message Date
Jonny Dixon
010da3c480
fix(ingestion/abs): updated deprecated azure sdk parameter to supported parameter and uri prefix support of https (#14106) 2025-07-16 18:44:31 +01:00
Sergio Gómez Villamor
6c82d70642
fix(snowflake): correct additional_database_names OR logic in access_history filter (#14104)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-16 12:10:12 +02:00
Chakru
3ab354eac4
fix(quickstart): update commit hash with resolved quickstart profile (#14089) 2025-07-15 22:19:29 +05:30
Sergio Gómez Villamor
7c8c55bc4e
feat(snowflake): add database pattern filtering to access history query for improved performance (#14021)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-14 18:15:13 +02:00
Aseem Bansal
8d332b68b8
fix(ingest): better experimental contract (#14071) 2025-07-14 19:45:55 +05:30
Aseem Bansal
9e205ccc53
fix(ingest/report): fix bug w.r.t. aspect count (#14070) 2025-07-14 16:17:16 +05:30
RyanHolstien
c694168991
feat(rest_emitter): support delete emit mcp (#14033) 2025-07-11 15:41:42 -05:00
Harshal Sheth
84ad365b37
chore(ingest/teradata): fix lint errors (#14055) 2025-07-11 13:01:10 -07:00
Harshal Sheth
47d40dc1fd
fix(sdk): deduplicate entity types in search sdk (#14041) 2025-07-11 11:42:33 -07:00
Tamas Nemeth
3c748c1b3c
fix(ingest/teradata): Teradata perf improvements (#13967) 2025-07-11 17:47:11 +02:00
Harshal Sheth
c3dd9707bb
fix(ingest/tableau): optimize tableau test performance (#14036) 2025-07-11 08:37:27 -07:00
Chakru
21726bc334
feat(quickstart): migrate to compose profile and other improv. (#13566) 2025-07-11 20:12:10 +05:30
skrydal
c0d651362b
feat(ingestion): Introduce query dedup strategy (#13915) 2025-07-11 15:51:41 +02:00
skrydal
f0a6e016e8
fix(ingestion/snowflake): Address diamond lineage problem + performance improvements (#13918) 2025-07-11 15:50:52 +02:00
Harshal Sheth
8758e85739
refactor(ingest/tableau): cleanup duplicate lineage calls (#14018) 2025-07-10 21:06:09 -07:00
Harshal Sheth
aa4f8aca3e
chore(ingest): simplify MetadataChangeProposalWrapper usages (#14019)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-07-10 08:45:01 -07:00
Benjamin Maquet
0eb2d1b2e2
feat(superset/ingest): add metrics to dataset columns (#13894)
Co-authored-by: Hyejin Yoon <0327jane@gmail.com>
2025-07-10 18:32:28 +09:00
Sergio Gómez Villamor
904a43e1c9
fix(ingest/avro): expand record fields consistently in arrays, maps, and direct references (#13961)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-10 11:00:59 +02:00
Tamas Nemeth
b1354abcba
fix(ingest/gcs): Fix GCS URI mismatch causing file filtering during ingestion (#14006) 2025-07-09 22:22:22 +02:00
Aseem Bansal
4509add782
chore(python): drop python 3.8 follow ups (#14004) 2025-07-09 13:24:33 +05:30
Aseem Bansal
c0482c0e85
feat(ingest): add better urn samples in report (#13977) 2025-07-09 12:49:08 +05:30
Sergio Gómez Villamor
5da40ec91e
fix(kafka-connect): handle SQL Server 3-level topic patterns and add Debezium integration tests (#13970)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-08 13:09:19 +02:00
Sergio Gómez Villamor
d85e2e5f28
fix(kafka-connect): escape dots in topic.prefix for regex patterns (#13955) 2025-07-08 09:59:03 +02:00
Harshal Sheth
6d2796a1c1
feat(ingest/snowflake): support TLL for stored procs (#13890) 2025-07-07 14:59:42 -07:00
Tamas Nemeth
5b8d4bad7c
feat(ingest/athena): Iceberg partition columns extraction (#13607) 2025-07-07 14:46:40 +02:00
Tamas Nemeth
e331e807c6
fix(ingest/s3): Fix ingestion when path_spec had a wildcard character in the path (#13940) 2025-07-07 13:12:21 +02:00
Aseem Bansal
a7c5895d98
feat(ingest): add aspects by subtype in report, telemetry (#13921) 2025-07-03 17:07:39 +05:30
Sergio Gómez Villamor
1561a6c8ca
feat(hex): add retry logic with exponential backoff for 429 rate limiting (#13905)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-03 11:12:31 +02:00
Tamas Nemeth
eaf2bf6dec
feat(ingest/kafka-connect): Add more connectors the regexp transformation support (#13748) 2025-07-03 08:57:50 +02:00
Hyejin Yoon
fedbfa3f7e
feat: update fivetran connector with new sdk (#13859) 2025-07-02 21:47:02 +09:00
sleeperdeep
70a39b70f2
fix(ingest): support ownership types in AddDatasetOwnership transformer (#13081) 2025-07-01 11:34:29 -07:00
Aseem Bansal
92784ec3a4
feat(ingest/lineage): generate static json lineage file (#13906) 2025-07-01 20:51:18 +05:30
Aseem Bansal
03309b7ffa
feat(mock-data-source): add first seen urn in report (#13889) 2025-06-30 15:15:50 +05:30
Harshal Sheth
05d029d690
feat(ingest/snowflake): add extra_info for snowflake (#13539) 2025-06-27 12:23:28 -07:00
Aseem Bansal
5759711992
fix(ingest/rest): out-of-date structured report being sent (#13866)
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-06-27 15:07:28 +05:30
RyanHolstien
52e49eb79b
fix(emitter): fix emitter handling of unicode characters (#13867) 2025-06-26 10:43:55 -05:00
Sergio Gómez Villamor
9a32dd7f7f
feat(dremio): add configurable time range for query lineage extraction, sql aggregator report and fix schema_pattern filtering (#13613)
Co-authored-by: Claude <noreply@anthropic.com>
2025-06-26 15:28:42 +02:00
Michael Maltese
0f0119f219
feat(ingestion): use approx_distinct when profiling Athena and Trino (#13671) 2025-06-25 16:29:26 -04:00
Aseem Bansal
f8c6db07d8
ingest(snowflake): remove email_as_user_identifier support (#13827) 2025-06-20 19:38:16 +05:30
Aseem Bansal
b3a25d6fbd
fix(ingest/bigquery): use email as user urn (#13831) 2025-06-20 18:45:41 +05:30
Aseem Bansal
614e627720
feat(cli): --streaming-batch option delete large hierarchy (#13824) 2025-06-19 17:53:56 +05:30
Gabe Lyons
efa8d7dc27
fix(snowflake summary): fixing snowflake summary source (#13785) 2025-06-17 19:17:09 -04:00
Shuixi Li
28d58e8973
fix(ingest/preset): url for view in source from edit view to explore view (#13666) 2025-06-13 15:20:58 +05:30
Hyejin Yoon
45e5ef84be
docs: update the example scripts with the new sdk (#13717) 2025-06-12 14:00:26 +09:00
Tamas Nemeth
606439cafe
fix(ingest/bigquery): Set qualified name for bigquery containers (#13747) 2025-06-12 05:53:59 +02:00
Tamas Nemeth
6b8dfb0aa8
feat(ingest/glue): Lake formation tags ingestion (#13693) 2025-06-11 12:26:27 +02:00
david-leifker
794a2ab317
feat(config): update configuration caching (#13740) 2025-06-10 15:17:05 -05:00
Harshal Sheth
78cfc49703
chore(ingest): bump sqlglot dep (#13730) 2025-06-10 11:09:37 -07:00
Jonny Dixon
0fa88189c8
feat(ingestion/mssql): detection of rds or managed sql server for jobs history (#13731) 2025-06-10 18:41:26 +01:00
david-leifker
4f5e9c7508
feat(rest-emitter): set 60s ttl on gms config cache (#13729) 2025-06-10 09:21:29 -05:00