3975 Commits

Author SHA1 Message Date
Mayuri Nehate
983a8ca675
feat(classification): support for regex based custom infotypes (#8177) 2023-06-06 14:41:51 +02:00
Mayuri Nehate
9fc1ef0111
fix(ingest/snowflake): set use_quoted_name to profile lowercase tables (#8168)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-06-06 14:40:58 +02:00
Tamas Nemeth
279f9eb455
fix(ingest/kafka): Fixing error printing in Kafka properties get call (#8145) 2023-06-06 11:17:14 +02:00
Andrew Sikowitz
fcaaedd83f
feat(ingest/snowflake): Deprecate legacy lineage and optimize query history joins (#8176) 2023-06-06 10:14:28 +02:00
Adrián Pertíñez
743439c11d
feat(ingest/bigquery_v2): enable platform instance using project id (#8142)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-06-05 15:17:40 -07:00
Andrew Sikowitz
3022c2d12e
feat(ingest/unity): Add qualified name to dataset properties (#8164) 2023-06-05 11:20:13 -07:00
mohdsiddique
e7d1b900ec
fix(ingestion/looker): set project-name for imported_projects views (#8086) 2023-06-02 17:04:34 -07:00
Andrew Sikowitz
802c91a0a7
feat(ingest): Create Browse Paths V2 under flag (#8120) 2023-06-02 12:50:38 -07:00
Mayuri Nehate
fe1ff71318
fix(ingest/nifi): allow nifi site url with context path (#8156) 2023-06-02 15:43:33 +02:00
Mayuri Nehate
fb44cbdb15
feat(ingest/snowflake): support for more operation types (#8158) 2023-06-01 15:47:39 +02:00
Harshal Sheth
690ed083d9
feat(ingest): add more fail-safes to stateful ingestion (#8111) 2023-05-31 18:49:48 -07:00
Harshal Sheth
60dd9ef187
fix(ingest): remove original_table_name logic in sql source (#8130) 2023-05-31 15:58:09 -07:00
Harshal Sheth
a29b576daa
fix(ingest/json-schema): handle property inheritance in unions (#8121) 2023-05-30 22:59:28 -07:00
Mayuri Nehate
abc2f85331
docs(ingest/nifi): fix broken links (#8143) 2023-05-30 11:04:15 -07:00
Tamas Nemeth
d50a99935b
fix(ingest/s3): Path spec aware folder traversal (#8095) 2023-05-30 16:20:49 +02:00
Gabe Lyons
ada6ea5a45
docs(csv-enricher): add example csv file & recipe (#8141) 2023-05-29 19:02:26 +05:30
Serhii Dimchenko
6adb496581
feat: add dbt-athena adapter support for column types mapping (#8116)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2023-05-27 09:22:55 -05:00
Pedro Silva
ea92b86e6a
feat(custom ownership): Adds Custom ownership types as a top level entity (#8045)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2023-05-26 17:54:09 +01:00
Mayuri Nehate
3e727c5e9c
docs(glue): fix broken link (#8135)
Co-authored-by: Pedro Silva <pedro@acryl.io>
2023-05-26 09:25:59 -05:00
Aseem Bansal
96f364802b
feat(lineage source): add fine grained lineage support (#7904) 2023-05-26 17:09:32 +05:30
Mayuri Nehate
f2c53a3660
feat(ingest/glue): report glue job lineage failures, update doc (#8126) 2023-05-26 10:30:03 +02:00
Andrew Sikowitz
668a09ffd4
fix(ingest/bigquery): Deduplicate LogEntries based on insertId, timestamp, logName (#8132) 2023-05-25 17:53:47 -07:00
Harshal Sheth
2d442161c4
ci(ingest/kafka): improve kafka integration test reliability (#8085) 2023-05-25 15:40:56 -07:00
Tamas Nemeth
b216caa83b
fix(ingest/redshift): Fixing Redshift subtypes (#8125) 2023-05-25 19:45:07 +02:00
Vinícius Mello
7059874dec
feat(ingest/bigquery): Add BigQuery Views lineage extraction from Google Data Catalog API (#8100) 2023-05-25 08:37:46 -07:00
Andrew Sikowitz
d3cd4dbb0c
feat(ingest/unity): Allow ingestion without metastore admin role (#8091)
- Adds more detailed docs and connection test
- Fixes empty username queries
2023-05-24 15:36:22 -07:00
Tamas Nemeth
fb087c5e35
fix(ingest/redshift):fixing schema filter (#8119) 2023-05-24 22:35:28 +00:00
Mayuri Nehate
84270bcac8
feat(ingest/nifi): kerberos authentication (#8097)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
Co-authored-by: Indy Prentice <iprentic@users.noreply.github.com>
2023-05-24 15:09:01 -07:00
Andrew Sikowitz
fdbc4de695
refactor(ingest): Call source_helpers via new WorkUnitProcessors in base Source (#8101) 2023-05-24 13:36:19 -07:00
Amanda Hernando
0e0d8934ea
feat(ingest): Add GenericAspectTransformer (#7994)
Co-authored-by: Adrián Pertíñez <khurzak92@gmail.com>
2023-05-24 13:31:33 -07:00
Mayuri Nehate
b3d80e57e8
feat(ingest/bigquery): usage for views (#8046)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-05-24 09:48:58 -07:00
Mayuri Nehate
798ce3d6c8
feat(classification): configurable sample size (#8096)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2023-05-24 00:07:01 -07:00
Andrew Sikowitz
8357fc8d64
feat(ingest): Browse Path v2 helper (#8012) 2023-05-23 23:46:46 -07:00
Tamas Nemeth
287a292fe7
fix(ingest/redshift): Making Redshift source more verbose (#8109) 2023-05-24 08:19:37 +02:00
Harshal Sheth
b0f8c3de1e
refactor(ingest): simplify stateful ingestion provider interface (#8104)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-23 12:57:57 -07:00
Harshal Sheth
afd65e16fb
feat(cli): delete cli v2 (#8068) 2023-05-23 14:43:44 -05:00
Indy Prentice
0612c70a63
fix(sample data): Update timestamps in bootstrap_mce.json to more recent (#8103)
Co-authored-by: Indy Prentice <indy@Indys-MBP.hsd1.ca.comcast.net>
2023-05-23 14:09:27 -05:00
Harshal Sheth
4873a32e4a
fix(ingest): emitter bug fixes (#8093)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-23 12:04:16 -07:00
Tamas Nemeth
4ca7a9b50e
fix(ingest/build): setting typing extension <4.6.0 because it breaks tests (#8108) 2023-05-23 18:55:28 +05:30
Tamas Nemeth
f8be9f6aee
feat(ingest/s3): type aware directory sorting (#8089) 2023-05-23 08:59:46 +02:00
Harshal Sheth
fb9a35b58a
feat(cli): support datahub ingest mcps (#7871) 2023-05-22 17:48:51 -07:00
Andrew Sikowitz
a43903bf6d
refactor(ingest): Auto report workunits (#8061) 2023-05-22 17:06:31 -07:00
Harshal Sheth
4e9c652707
feat(ingest): add env to container properties (#8027) 2023-05-22 12:07:16 -07:00
Shubham Jagtap
e6371c8e94
fix(ingestion/powerbi): skip erroneous pages of a report (#8021)
Co-authored-by: mohdsiddique <mohdsiddiquebagwan@gmail.com>
Co-authored-by: MohdSiddiqueBagwan <mohdsiddique.bagwan@gslab.com>
2023-05-19 18:02:55 -07:00
Andrew Sikowitz
2ca362e96c
fix(ingest/delta-lake): Walk through directory structure with full path; reduce resource creation (#8072) 2023-05-19 12:42:48 -07:00
Harshal Sheth
d79b28c147
feat(sdk): better auth error messages in the rest emitter (#8025) 2023-05-19 16:16:03 +02:00
Harshal Sheth
98bba52c20
test(sdk): move cli tests into the unit dir (#8028) 2023-05-19 16:13:39 +02:00
alplatonov
999b997223
fix(ingest/s3_data_lake)_ingestor_skips_directories_with_similar_prefix (#8078)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-19 15:32:19 +02:00
Harshal Sheth
bd5f99b6ab
docs(ingest/snowflake): fix authentication type docs (#8059) 2023-05-19 13:19:10 +02:00
Tamas Nemeth
bdd4bc7b92
feat(ingest/s3) - Stateful ingestion and last-updated support (#8022) 2023-05-19 13:10:15 +02:00