595 Commits

Author SHA1 Message Date
Harshal Sheth
a4bce6af1c
feat(ingest): add snowflake-queries source (#10835) 2024-07-12 15:08:51 -07:00
Harshal Sheth
89bda5bdd9
fix(ingest/redshift): handle multiline alter table commands (#10727) 2024-07-11 16:32:37 -07:00
Harshal Sheth
44930dfd1e
fix(ingest/looker): add missing dependency (#10876) 2024-07-11 13:25:13 -07:00
sid-acryl
609847fa59
fix(ingestion/looker): Add sqlglot dependency and remove unused sqlparser (#10874) 2024-07-09 12:51:08 -07:00
Tamas Nemeth
1c8e8c32b5
chore(ingest): Mypy 1.10.1 pin (#10867) 2024-07-08 19:43:45 +02:00
sid-acryl
43bac365bc
fix(ingestion/lookml): liquid template resolution and view-to-view cll (#10542) 2024-07-08 09:26:39 -07:00
Mayuri Nehate
54b9d98177
ci(ingest): pin dask dependency for feast (#10865) 2024-07-08 18:52:11 +05:30
cburroughs
c24d7805bd
chore(ingest): update acryl-datahub-classify version (#10844) 2024-07-03 16:32:46 -07:00
sagar-salvi-apptware
640d42dc65
feat(ingest/transformer): tags to terms transformer (#10758)
Co-authored-by: Aseem Bansal <asmbansal2@gmail.com>
2024-07-02 15:30:05 +05:30
Oleksandr Simonchuk
8b4e302881
feat(ingest): add and use file system abstraction in file source (#8415)
Co-authored-by: oleksandrsimonchuk <oleksandr.si@appsflyer.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2024-07-01 10:47:07 -07:00
Harshal Sheth
45a8cc9ecf
feat(ingest): bump sqlglot (#10770) 2024-06-26 14:31:57 -07:00
Harshal Sheth
0816e7590c
fix(ingest): pin numpy<2 for classification (#10725) 2024-06-17 22:50:08 +02:00
Rajasekhar-Vuppala
a2d8a099d8
feat(ingest/vertica): use 3 part naming (#10636) 2024-06-14 17:04:55 -07:00
Harshal Sheth
62c6704f69
feat(ingest/snowflake): refactor + parallel schema extraction (#10653) 2024-06-14 13:23:07 -07:00
sagar-salvi-apptware
d69966074a
fix(ingest/bigquery): Map BigQuery policy tags to datahub column-level tags (#10669) 2024-06-14 16:43:12 +05:30
Harshal Sheth
6329153e36
fix(ingest): fix redshift query urns + reduce memory usage (#10691) 2024-06-13 11:27:06 -07:00
Harshal Sheth
25d48d2d09
fix(ingest/fivetran): fix fivetran bigquery support (#10693) 2024-06-13 11:26:47 -07:00
Harshal Sheth
894e25680b
feat(ingest): add snowflake-summary source (#10642) 2024-06-12 10:04:22 -07:00
Andrew Sikowitz
46dbb10940
docs(ingest): Rename csv / s3 / file source and sink (#10675) 2024-06-11 11:44:13 -07:00
Eric L (CCCS)
c04b3bc2e4
fix(ingest/iceberg): update iceberg source to support newer versions of pyiceberg at runtime (#10614) 2024-06-04 09:45:29 -07:00
Harshal Sheth
2e14f70864
test(ingest/sql): refactor CLL generator + add tests (#10580) 2024-05-23 18:11:22 -07:00
Harshal Sheth
2b6c78b776
feat(ingest): bump acryl-sqlglot dep (#10554) 2024-05-21 23:52:33 +02:00
sagar-salvi-apptware
5fbf781558
fix(ingest/transformer): Add dataset domains based on tags using transformer (#10458) 2024-05-15 14:13:03 +05:30
skrydal
9debbdd4a9
fix(ingestion): Explicitly set requirement on snowflake-connector-python to be newer or equal to 3.4.0 (#10445)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-05-08 14:12:30 +05:30
Tamas Nemeth
897e648eae
fix(ingest/mode): Improve query lineage (#10284)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-05-07 22:02:37 -07:00
Mayuri Nehate
f6627efe71
fix(ingest/snowflake): add more reporting for usage aggregation, handle lineage errors (#10279)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-05-07 08:42:39 -07:00
dushayntAW
c00ddb2a0d
fix(ingest/transformer): new transformer to clean user URN for datasetUsageStatistics aspect (#10398) 2024-05-03 13:24:48 +05:30
mrjefflewis
e4cf4de3e0
feat(ingest/mssql): improve docs on using odbc (#10370)
Co-authored-by: Jeff Lewis <jeff.lewis@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-04-24 13:00:45 -05:00
Harshal Sheth
08731055ba
feat(ingest): bump acryl-sqlglot dep (#10343) 2024-04-20 08:37:22 +02:00
Harshal Sheth
d1cc0af314
feat(ingest/classify): add pip dependency (#10335) 2024-04-19 12:52:51 -07:00
david-leifker
25ba1e1a8b
chore(pyiceburg): set minimum version (#10318) 2024-04-17 11:47:13 -07:00
Harshal Sheth
3cdc462a7b
fix(ingest): disallow src.* imports, fix powerbi/sigma (#10292) 2024-04-16 15:04:51 -07:00
Tamas Nemeth
d463a16b49
chore(ingest/presto-on-hive): Renaming presto-on-hive to hive-metastore source (#10278) 2024-04-16 23:35:16 +02:00
Shubham Jagtap
90c1249e7d
feat(ingest/sigma): Sigma connector integration (#10037)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-04-15 20:18:31 -07:00
dushayntAW
f860f7907d
fix(ingest/transformer): replace externalUrl in dataset properties (#10281) 2024-04-15 20:14:42 +05:30
Tamas Nemeth
8ed87d6a90
feat(ingest/mode): Mode improvements (#10273) 2024-04-12 09:01:16 +02:00
Dotan Mor
fa0c1b3fa9
feat(ingest/cockroachdb): add cockroachdb ingestion (#10226) 2024-04-09 18:36:51 -07:00
Mayuri Nehate
0949d8ca8b
fix(ingest/databricks): pin pandas for databricks ingestion (#10204) 2024-04-04 09:36:44 -07:00
Harshal Sheth
db33c8646a
fix(ingest): add classification dep for dynamodb (#10162) 2024-04-02 09:28:43 -05:00
Harshal Sheth
61c21e1a73
feat(ingest): bump sqlglot dep (#10144)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2024-03-28 17:53:28 -07:00
Mayuri Nehate
0361f2463d
feat(ingest/dynamodb): add support for classification (#10138) 2024-03-27 11:28:58 -07:00
Harshal Sheth
e97e6822ad
feat(ingest): loosen pyarrow dep (#10141) 2024-03-26 12:39:57 -07:00
dushayntAW
dd502ae662
fix(ingest): added new transformer to cleanup suffix/prefix in owner URN (#10067) 2024-03-22 15:23:03 +05:30
Aseem Bansal
9659d60867
feat(ingest/datahub-gc): gc source to cleanup things (#10085) 2024-03-21 15:21:17 +05:30
Mayuri Nehate
77c72dad01
feat(ingest): add classification to bigquery, redshift (#10031) 2024-03-13 22:45:28 -07:00
Harshal Sheth
b0163c4885
feat(ingest): utilities for query logs (#10036) 2024-03-12 23:20:46 -07:00
Mayuri Nehate
2de0e62ac4
feat(ingest): add classification for sql sources (#10013)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2024-03-12 09:23:20 -07:00
skrydal
2265ae9257
feat(ingestion): Support for Server-less Redshift (#9998)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2024-03-12 09:32:20 +01:00
Harshal Sheth
b6956f9a5c
feat(ingest): update sqlglot fork (#10022) 2024-03-11 15:22:30 +01:00
Harshal Sheth
d987707cde
feat(ingest): speed up to_obj() and validate() (#9969) 2024-03-04 13:31:39 +01:00