455 Commits

Author SHA1 Message Date
Mayuri Nehate
0361f2463d
feat(ingest/dynamodb): add support for classification (#10138) 2024-03-27 11:28:58 -07:00
Harshal Sheth
e97e6822ad
feat(ingest): loosen pyarrow dep (#10141) 2024-03-26 12:39:57 -07:00
dushayntAW
dd502ae662
fix(ingest): added new transformer to cleanup suffix/prefix in owner URN (#10067) 2024-03-22 15:23:03 +05:30
Aseem Bansal
9659d60867
feat(ingest/datahub-gc): gc source to cleanup things (#10085) 2024-03-21 15:21:17 +05:30
Mayuri Nehate
77c72dad01
feat(ingest): add classification to bigquery, redshift (#10031) 2024-03-13 22:45:28 -07:00
Harshal Sheth
b0163c4885
feat(ingest): utilities for query logs (#10036) 2024-03-12 23:20:46 -07:00
Mayuri Nehate
2de0e62ac4
feat(ingest): add classification for sql sources (#10013)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-03-12 09:23:20 -07:00
skrydal
2265ae9257
feat(ingestion): Support for Server-less Redshift (#9998)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2024-03-12 09:32:20 +01:00
Harshal Sheth
b6956f9a5c
feat(ingest): update sqlglot fork (#10022) 2024-03-11 15:22:30 +01:00
Harshal Sheth
d987707cde
feat(ingest): speed up to_obj() and validate() (#9969) 2024-03-04 13:31:39 +01:00
Harshal Sheth
1bcc9a67eb
feat(ingest): loosen sentry requirements (#9936) 2024-02-27 12:36:24 -08:00
Harshal Sheth
02f41b74b6
feat(ingest): use uv for python package installs (#9885) 2024-02-26 15:02:47 -08:00
Shubham Jagtap
a1f2216da7
feat(ingest/qlik): Qlik cloud connector integration (#9682)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-02-26 11:27:37 -08:00
Harshal Sheth
b01416c11b
feat(ingest): add timeout to sqlglot parser (#9868) 2024-02-23 11:11:02 -08:00
Harshal Sheth
ef3a8148ec
fix(ingest): bump sqlglot + add some debug info to tests (#9867) 2024-02-21 13:07:59 -08:00
Harshal Sheth
24da7a573b
test(ingest): use pytest-random-order for unit tests (#9753) 2024-02-09 18:20:05 -05:00
Aseem Bansal
45511644e7
feat(ingest/slack): source to get user info from slack (#9776) 2024-02-08 14:05:26 +05:30
cburroughs
3b898dbae9
fix(ingest/snowflake): stop pinning transitive spacy dependency (#9768) 2024-02-05 08:28:24 -08:00
cburroughs
eb8017d6eb
feat(ingestion/feast): loosen feast dependency (#9769) 2024-02-02 14:47:26 -08:00
cburroughs
b4d70dd13f
chore(ingest/feast): drop redundant typeguard constraint (#9770) 2024-02-02 14:46:54 -08:00
Harshal Sheth
0e418b527e
fix(ingest): upgrade pytest-docker (#9765) 2024-02-01 16:33:15 -08:00
Harshal Sheth
1498c36875
chore(cli): drop support for python 3.7 (#9731) 2024-01-29 10:50:47 -08:00
Tamas Nemeth
5735eb3a55
fix(ingest/test): Fixing breaking change in moto 5.0 library (#9736) 2024-01-29 19:42:30 +05:30
Harshal Sheth
2bb4b73f98
fix(ingest/metabase): add missing sql parser dep (#9725) 2024-01-26 23:03:16 +01:00
Harshal Sheth
b94d463fe0
feat(ingest/lookml): support complex lookml manifests (#9688) 2024-01-22 16:23:56 -08:00
ethan-cartwright
dfb2f7e596
feat(classifier): Add support for excluding list of exact column names (#9472)
Co-authored-by: Ethan Cartwright <ethan.cartwright@acryl.io>
2024-01-16 19:12:23 -08:00
cburroughs
acfc315b57
build(ingest/feast): upgrade to latest feast version (0.35.x) (#9627) 2024-01-16 17:28:56 -08:00
Shubham Jagtap
0486319bc8
feat(ingestion/fivetran): Add fivetran bigquery destination support (#9531)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-01-10 14:18:36 -05:00
Shubham Jagtap
cb8002413e
feat(ingestion/transformer): Add dataset dataproduct transformer (#9491)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-01-05 17:03:22 -05:00
Mayuri Nehate
29f2142a2c
feat(databricks): add hive metastore analyze profiling (#9511) 2024-01-02 23:18:43 -05:00
Shubham Jagtap
3635c1c221
feat(ingestion/bigquery): Use sqlglot_lineage for usage and add more perf timers (#9247)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-12-28 04:54:26 -05:00
cburroughs
754d881447
build(ingest/feast): upgrade to latest feast version (#9439) 2023-12-27 19:33:41 -05:00
Harshal Sheth
52687f3eea
feat(ingest): support CLL for redshift materialized views with auto refresh (#9508) 2023-12-22 08:18:22 +01:00
Harshal Sheth
ca518d6c78
feat(ingest): remove librdkafka hacks (#9507) 2023-12-21 20:28:45 -05:00
Tamas Nemeth
cfc641f0d0
fix(ingest/databricks): Pinning databricks sdk to not fail on mypy issues (#9500) 2023-12-21 20:32:51 +01:00
Harshal Sheth
ecda3e6187
feat(ingest): pydantic v2 compatibility (#9434) 2023-12-18 18:26:33 -05:00
Tamas Nemeth
03590a1948
fix(ingest/snowflake) - Fixing snowflake url with default region (#9443) 2023-12-18 13:54:31 -05:00
Harshal Sheth
caef6771b8
feat(ingest/redshift): drop repeated operations (#9440) 2023-12-15 15:07:56 -05:00
Mayuri Nehate
aac1c55a14
feat(ingest/unity): enable hive metastore ingestion (#9416) 2023-12-14 10:35:06 -05:00
Aseem Bansal
cff32e9c74
fix(ingest/transformer): correct registration (#9418) 2023-12-08 16:07:00 -08:00
Andrew Sikowitz
27f23ecdd5
feat(ingest/unity): GE Profiling (#8951) 2023-12-06 13:59:23 -05:00
Andrew Sikowitz
806f09ae23
test(ingest/unity): Unity catalog data generation (#8949) 2023-12-05 12:33:00 -05:00
Tamas Nemeth
3ee82590cd
fix(ingest/redshift): Identify materialized views properly + fix connection args support (#9368) 2023-12-05 17:42:29 +01:00
Harshal Sheth
4dd6738ae7
fix(ingest): bump pyhive to fix headers issue (#9328) 2023-11-29 10:25:33 +01:00
terratrue-daniel
ab10e6bc58
feat(ingest/mssql): enable TLS encryption for SQLServer using pytds (#9256) 2023-11-29 13:32:26 +05:30
Andrew Sikowitz
b778def10a
build(ingest/hive): Update thrift pin (#8964)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2023-11-16 17:19:25 -05:00
Harshal Sheth
9d41a8f9f0
fix(ingest): drop redshift-legacy and redshift-usage-legacy sources (#9244) 2023-11-16 13:33:35 -05:00
Harshal Sheth
19aa215068
feat(ingest/dbt): dbt column-level lineage (#8991) 2023-11-13 19:00:30 -05:00
david-leifker
ebd2e2312b
fix(datahub-ingestion): remove old jars, sync pyspark version (#9217) 2023-11-10 22:10:00 -06:00
Shubham Jagtap
a187127ac5
feat(ingestion): file-based state checkpoint provider (#9029) 2023-11-10 14:36:00 -08:00