4067 Commits

Author SHA1 Message Date
treff7es
7aa5113ae0 Add additional test case 2025-07-07 13:47:14 +02:00
Tamas Nemeth
cce6b9a24e Update metadata-ingestion/src/datahub/ingestion/source/sql/athena.py
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-07-07 12:29:07 +01:00
treff7es
5f7a14dd92 Fix comment 2025-07-07 12:29:07 +01:00
Tamas Nemeth
58e88dbe40 Update metadata-ingestion/src/datahub/ingestion/source/sql/athena.py
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-07-07 12:29:07 +01:00
treff7es
f1b70c4030 Add test fix 2025-07-07 12:29:07 +01:00
treff7es
3729ea26e7 Fix wrong property name 2025-07-07 12:29:07 +01:00
treff7es
65a28f6fae - Add confing option to enable/disable partition extraction using sql parsing
- Couple of improvements on colum definition parsing
2025-07-07 12:29:07 +01:00
treff7es
73921d8be6 Update Athena golden files 2025-07-07 12:29:07 +01:00
treff7es
a74cb769c2 Couple of lint fixes and test fixes 2025-07-07 12:29:07 +01:00
treff7es
39a0167e94 Fix column type extraction 2025-07-07 12:29:07 +01:00
treff7es
d17c4809ff Fix formatting 2025-07-07 12:29:07 +01:00
treff7es
41158efd96 Extract athena partition create table statement to extract Iceberg table partitions columns as well. 2025-07-07 12:29:07 +01:00
Tamas Nemeth
e331e807c6
fix(ingest/s3): Fix ingestion when path_spec had a wildcard character in the path (#13940) 2025-07-07 13:12:21 +02:00
Aseem Bansal
ae7e58d8a8
feat(ingest): add modifiers on capability on sources (#13954) 2025-07-07 15:32:55 +05:30
Sergio Gómez Villamor
2388d770dd
docs(dbt): incremental-lineage (#13959) 2025-07-04 11:16:00 +01:00
Michael Maltese
9d79914295
fix(graph/client): use fixed GMS URL consistently (#13945) 2025-07-03 11:57:11 -06:00
Aseem Bansal
ef3446f066
feat(cli): add kafka helper, improve restore indices helper (#13951) 2025-07-03 20:07:25 +05:30
Aseem Bansal
a7c5895d98
feat(ingest): add aspects by subtype in report, telemetry (#13921) 2025-07-03 17:07:39 +05:30
Sergio Gómez Villamor
1561a6c8ca
feat(hex): add retry logic with exponential backoff for 429 rate limiting (#13905)
Co-authored-by: Claude <noreply@anthropic.com>
2025-07-03 11:12:31 +02:00
Tamas Nemeth
eaf2bf6dec
feat(ingest/kafka-connect): Add more connectors the regexp transformation support (#13748) 2025-07-03 08:57:50 +02:00
Aseem Bansal
661a4ae9f1
fix(ingest): mypy lint python 3.8 (#13939)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2025-07-02 12:14:39 -07:00
Aseem Bansal
7b7038d8d3
refactor(ingest): centralise subtype strings (#13935) 2025-07-02 19:06:49 +05:30
Hyejin Yoon
fedbfa3f7e
feat: update fivetran connector with new sdk (#13859) 2025-07-02 21:47:02 +09:00
Tamas Nemeth
ecc24da0fa
fix(ingest/bigquery): Emit dataset profile when table does not have rows (#13919) 2025-07-02 13:53:12 +02:00
Michael Maltese
f7aa9ba1c2
fix(ingest/unity): don't crash when processing Platform Resources hits an error (#13877) 2025-07-02 08:03:11 +02:00
gabriel-morais-rokos
242fc1e50d
feat(ingestion): add patch structured properties to data product (#13813)
Co-authored-by: Hyejin Yoon <0327jane@gmail.com>
2025-07-02 08:29:10 +09:00
sleeperdeep
70a39b70f2
fix(ingest): support ownership types in AddDatasetOwnership transformer (#13081) 2025-07-01 11:34:29 -07:00
Aseem Bansal
92784ec3a4
feat(ingest/lineage): generate static json lineage file (#13906) 2025-07-01 20:51:18 +05:30
Aseem Bansal
1dab349517
feat(ingest): add source aspect number in telemetry (#13914) 2025-07-01 19:08:23 +05:30
Sergio Gómez Villamor
96bb33bed6
chore: aggregator_generate_timer for snowflake (#13913) 2025-07-01 11:16:28 +02:00
Benjamin Maquet
34b340e3b9
feat(preset): add preset to the list of platforms (#13896) 2025-07-01 10:33:56 +02:00
Aseem Bansal
d567c5d4bb
tests(doc): add tests for doc gen (#13903)
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-06-30 20:16:56 +05:30
skrydal
e8b5a60d1d
fix(cli): explicit default field value for optional field (pydantic v2) (#13901) 2025-06-30 14:16:42 +02:00
Aseem Bansal
7345af898d
feat(ingest): generate capability summary (#13881) 2025-06-30 15:16:08 +05:30
Aseem Bansal
03309b7ffa
feat(mock-data-source): add first seen urn in report (#13889) 2025-06-30 15:15:50 +05:30
Harshal Sheth
05d029d690
feat(ingest/snowflake): add extra_info for snowflake (#13539) 2025-06-27 12:23:28 -07:00
sleeperdeep
f3c8bf9cb4
fix(ingest/sql_server): switch to engine inspector instead of connection (#13104) 2025-06-27 11:15:33 -07:00
Aseem Bansal
8c1aaaf02f
doc(ingest/azuread): remove outdated information (#13885) 2025-06-27 19:40:52 +05:30
Aseem Bansal
5759711992
fix(ingest/rest): out-of-date structured report being sent (#13866)
Co-authored-by: Sergio Gómez Villamor <sgomezvillamor@gmail.com>
2025-06-27 15:07:28 +05:30
Hyejin Yoon
827a2308cd
docs: add sdk entity guides (#13870) 2025-06-27 15:28:20 +09:00
Shirshanka Das
820a449b2a
docs(ingestion): preset - update source certification status (#13641) 2025-06-26 11:29:20 -07:00
Sergio Gómez Villamor
7ed7dd25c5
feat(docs): add subscription client docs to assertions tutorial (#13872) 2025-06-26 18:24:01 +02:00
RyanHolstien
52e49eb79b
fix(emitter): fix emitter handling of unicode characters (#13867) 2025-06-26 10:43:55 -05:00
Sergio Gómez Villamor
9a32dd7f7f
feat(dremio): add configurable time range for query lineage extraction, sql aggregator report and fix schema_pattern filtering (#13613)
Co-authored-by: Claude <noreply@anthropic.com>
2025-06-26 15:28:42 +02:00
Michael Maltese
0f0119f219
feat(ingestion): use approx_distinct when profiling Athena and Trino (#13671) 2025-06-25 16:29:26 -04:00
Aseem Bansal
468e62b8cc
deprecate(ingest): match_fully_qualified_names for redshift,bigquery (#13858) 2025-06-25 16:25:29 +05:30
Aseem Bansal
a9e9ac9808
fix(ingest/dremio): fix report, mark usage stats capability (#13851) 2025-06-25 15:09:52 +05:30
Aseem Bansal
3b44ed847c
doc(ingest): mark for usage (#13850) 2025-06-25 14:48:28 +05:30
Aseem Bansal
40452f7c54
docs(ingest): docs for lineage (#13847) 2025-06-25 14:35:39 +05:30
Aseem Bansal
9fe319bc4d
fix(ingest): add fineGrainedLineages as a special case for aspects (#13844) 2025-06-24 16:31:38 +05:30