1004 Commits

Author SHA1 Message Date
Jonny Dixon
c7689142aa Update test_tableau_source.py 2025-07-30 18:17:26 +01:00
Jonny Dixon
201d054a38 Update test_tableau_source.py 2025-07-30 18:16:17 +01:00
Jonny Dixon
033d19479e test coverage 2025-07-30 18:14:25 +01:00
Jonny Dixon
6aee5dc6ae tests and linting 2025-07-30 12:01:36 +01:00
Harshal Sheth
fe3ffc1c27
revert(ingest): Extending file sink to support writing to S3 (#14160) (#14248) 2025-07-29 12:13:26 +02:00
John Joyce
9ff5a2cf9d
feat(ingest): Extending file sink to support writing to S3 (#14160)
Co-authored-by: John Joyce <john@Mac-2290.lan>
Co-authored-by: John Joyce <john@Mac-2536.lan>
2025-07-28 09:18:44 -07:00
Sergio Gómez Villamor
7a27e8e7fd
fix(ingest/sql-queries): handle string timestamps and empty table names (#14205)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-25 22:33:49 +02:00
Harshal Sheth
9490eba8dc
fix(sdk): make Filter type parsing more permissive (#14212) 2025-07-25 09:37:23 -07:00
Sergio Gómez Villamor
e24fc39966
feat(sql-parsing-aggregator): add override_dialect for observed query (#14201) 2025-07-25 08:21:44 +02:00
mihai103
8e1fbaffad
Adding support for domains when upserting datasets with the cli (#14152)
Co-authored-by: Mihai Ciocirdel <mihai.ciocirdel@swisscom.com>
Co-authored-by: Hyejin Yoon <0327jane@gmail.com>
2025-07-24 17:35:17 -04:00
Harshal Sheth
446f31f1bd
feat(sdk): expose view definition in dataset sdk (#14197) 2025-07-24 10:14:30 -07:00
Jonny Dixon
836e301e3b
feat(ingestion/grafana): Add datasets and charts to dashboards with lineage and tags. Lineage back to source (#12417)
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-07-24 17:05:09 +01:00
Harshal Sheth
658930e887
feat(ingest/dbt): add drop_duplicate_sources flag (#14161) 2025-07-23 22:05:35 -07:00
Jonny Dixon
a679ffca73
fix(ingestion/abs): align add_partition_columns_to_schema functionality with working s3/gcs functionality in abs connector (#14153) 2025-07-22 18:36:17 +01:00
Harshal Sheth
648983a710
feat(sdk): add data products to suggested entity type filters (#14159) 2025-07-22 08:23:59 -07:00
Harshal Sheth
1d65c454df
feat(sdk): add container filter to search sdk (#14130) 2025-07-21 12:05:51 -07:00
Harshal Sheth
c92aa0842e
feat(sdk): use discriminated unions for Filter types (#14127) 2025-07-21 08:23:07 -07:00
Aseem Bansal
14e7d367e3
feat(ingest/mock-data): misc changes (#14133) 2025-07-18 14:20:23 +05:30
Aseem Bansal
9e57fa7f9f
feat(mock-data): allow throwing uncaught exception, errors, warning (#14105) 2025-07-17 14:42:58 +05:30
Tamas Nemeth
d63a35ab00
fix(ingest/athena): Make Athena simple column v1 conversion optional (#14112)
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-07-17 09:15:48 +02:00
Jonny Dixon
010da3c480
fix(ingestion/abs): updated deprecated azure sdk parameter to supported parameter and uri prefix support of https (#14106) 2025-07-16 18:44:31 +01:00
Sergio Gómez Villamor
6c82d70642
fix(snowflake): correct additional_database_names OR logic in access_history filter (#14104)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-16 12:10:12 +02:00
Chakru
3ab354eac4
fix(quickstart): update commit hash with resolved quickstart profile (#14089) 2025-07-15 22:19:29 +05:30
Sergio Gómez Villamor
7c8c55bc4e
feat(snowflake): add database pattern filtering to access history query for improved performance (#14021)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-14 18:15:13 +02:00
Aseem Bansal
8d332b68b8
fix(ingest): better experimental contract (#14071) 2025-07-14 19:45:55 +05:30
Aseem Bansal
9e205ccc53
fix(ingest/report): fix bug w.r.t. aspect count (#14070) 2025-07-14 16:17:16 +05:30
RyanHolstien
c694168991
feat(rest_emitter): support delete emit mcp (#14033) 2025-07-11 15:41:42 -05:00
Harshal Sheth
84ad365b37
chore(ingest/teradata): fix lint errors (#14055) 2025-07-11 13:01:10 -07:00
Harshal Sheth
47d40dc1fd
fix(sdk): deduplicate entity types in search sdk (#14041) 2025-07-11 11:42:33 -07:00
Tamas Nemeth
3c748c1b3c
fix(ingest/teradata): Teradata perf improvements (#13967) 2025-07-11 17:47:11 +02:00
Chakru
21726bc334
feat(quickstart): migrate to compose profile and other improv. (#13566) 2025-07-11 20:12:10 +05:30
skrydal
c0d651362b
feat(ingestion): Introduce query dedup strategy (#13915) 2025-07-11 15:51:41 +02:00
skrydal
f0a6e016e8
fix(ingestion/snowflake): Address diamond lineage problem + performance improvements (#13918) 2025-07-11 15:50:52 +02:00
Harshal Sheth
8758e85739
refactor(ingest/tableau): cleanup duplicate lineage calls (#14018) 2025-07-10 21:06:09 -07:00
Harshal Sheth
aa4f8aca3e
chore(ingest): simplify MetadataChangeProposalWrapper usages (#14019)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-07-10 08:45:01 -07:00
Sergio Gómez Villamor
904a43e1c9
fix(ingest/avro): expand record fields consistently in arrays, maps, and direct references (#13961)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-10 11:00:59 +02:00
Tamas Nemeth
b1354abcba
fix(ingest/gcs): Fix GCS URI mismatch causing file filtering during ingestion (#14006) 2025-07-09 22:22:22 +02:00
Aseem Bansal
4509add782
chore(python): drop python 3.8 follow ups (#14004) 2025-07-09 13:24:33 +05:30
Aseem Bansal
c0482c0e85
feat(ingest): add better urn samples in report (#13977) 2025-07-09 12:49:08 +05:30
Harshal Sheth
6d2796a1c1
feat(ingest/snowflake): support TLL for stored procs (#13890) 2025-07-07 14:59:42 -07:00
Tamas Nemeth
5b8d4bad7c
feat(ingest/athena): Iceberg partition columns extraction (#13607) 2025-07-07 14:46:40 +02:00
Tamas Nemeth
e331e807c6
fix(ingest/s3): Fix ingestion when path_spec had a wildcard character in the path (#13940) 2025-07-07 13:12:21 +02:00
Aseem Bansal
a7c5895d98
feat(ingest): add aspects by subtype in report, telemetry (#13921) 2025-07-03 17:07:39 +05:30
Sergio Gómez Villamor
1561a6c8ca
feat(hex): add retry logic with exponential backoff for 429 rate limiting (#13905)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-03 11:12:31 +02:00
Tamas Nemeth
eaf2bf6dec
feat(ingest/kafka-connect): Add more connectors the regexp transformation support (#13748) 2025-07-03 08:57:50 +02:00
Hyejin Yoon
fedbfa3f7e
feat: update fivetran connector with new sdk (#13859) 2025-07-02 21:47:02 +09:00
sleeperdeep
70a39b70f2
fix(ingest): support ownership types in AddDatasetOwnership transformer (#13081) 2025-07-01 11:34:29 -07:00
Aseem Bansal
92784ec3a4
feat(ingest/lineage): generate static json lineage file (#13906) 2025-07-01 20:51:18 +05:30
Aseem Bansal
03309b7ffa
feat(mock-data-source): add first seen urn in report (#13889) 2025-06-30 15:15:50 +05:30
Harshal Sheth
05d029d690
feat(ingest/snowflake): add extra_info for snowflake (#13539) 2025-06-27 12:23:28 -07:00