26 Commits

Author SHA1 Message Date
Hyejin Yoon
f986315582
doc: Acryl to DataHub, datahubproject.io to datahub.com (#13252)
Co-authored-by: Jay <159848059+jayacryl@users.noreply.github.com>
2025-04-28 10:34:33 -04:00
Andrew Sikowitz
d138a64a6a
ci(graphql,workflows): Format .md, .graphql, and workflow .yml files via prettier (#13220) 2025-04-16 16:55:51 -07:00
Harshal Sheth
35e2bbb960
fix(ingest): avoid multiprocessing "fork" start method (#12543) 2025-02-03 22:19:37 -08:00
skrydal
30a77c022a
fix(ingestion/classifier): temporary measure to avoid deadlocks for classifier (#12261) 2025-01-06 22:12:56 +01:00
Mayuri Nehate
ff1c6b895e
feat(ingest/BigQuery): refactor+parallelize dataset metadata extraction (#10884) 2024-07-16 11:46:42 -07:00
Mayuri Nehate
b6ec52bf5c
docs(classification): correct the casing for full name infotype (#10782) 2024-07-01 10:45:25 -07:00
Hyejin Yoon
10cbbd7619
docs: fix docs utms & slack footer (#10481) 2024-05-12 16:39:33 -07:00
Hyejin Yoon
704ca650ad
docs: add slack utm component in docs (#10214) 2024-04-30 12:54:54 +09:00
Mayuri Nehate
77c72dad01
feat(ingest): add classification to bigquery, redshift (#10031) 2024-03-13 22:45:28 -07:00
ethan-cartwright
dfb2f7e596
feat(classifier): Add support for excluding list of exact column names (#9472)
Co-authored-by: Ethan Cartwright <ethan.cartwright@acryl.io>
2024-01-16 19:12:23 -08:00
Harshal Sheth
2031bd4de1
docs(ingest): update docs on adding stateful ingestion (#9327) 2023-11-28 18:31:56 -05:00
Pedro Silva
86e0023a4e
feat(ingestion): Adds more advanced configurations for runtime debugging (#8998) 2023-10-21 16:20:59 +01:00
Pedro Silva
f6e1312063
feat(ingestion): Adds support for memory profiling (#8856)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2023-10-12 18:43:14 +01:00
Hyejin Yoon
31abf383d1
ci: add markdown-link-check (#8771) 2023-09-14 11:34:21 +09:00
Mayuri Nehate
cc94ffbf6c
fix(ingest): stateful redundant run skip handler (#8467) 2023-08-28 15:03:31 +05:30
Hyejin Yoon
04ecf4f75a
docs(docs): add native versioning (#8714) 2023-08-25 14:10:13 -07:00
Mayuri Nehate
e67f811034
feat(classification): allow parallelisation to reduce time (#8368) 2023-08-02 09:53:39 +05:30
Mayuri Nehate
ac06cf3d3f
feat(classification): configurable minimum values threshold (#8186) 2023-06-07 21:28:13 -07:00
Harshal Sheth
690ed083d9
feat(ingest): add more fail-safes to stateful ingestion (#8111) 2023-05-31 18:49:48 -07:00
Mayuri Nehate
798ce3d6c8
feat(classification): configurable sample size (#8096)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2023-05-24 00:07:01 -07:00
Mayuri Nehate
7a8e36d57d
feat(ingest): refactor classification mixin interface, support new info types (#6545) 2022-11-25 18:48:42 +05:30
Mayuri Nehate
22847a987a
feat(ingest): automated term classification for snowflake (#6376) 2022-11-23 00:43:30 -05:00
Harshal Sheth
817406eadb
refactor(ingest): simplify stateful ingestion config (#6454)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2022-11-18 00:09:24 -05:00
Ravindra Lanka
b638bcf442
feat(ingestion): Documentation on adding stateful ingestion use-cases to new sources (#5985) 2022-09-21 09:02:50 -07:00
Mayuri Nehate
2558129391
refactor(snowflake): move snowflake-beta to certified snowflake source (#5923)
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2022-09-15 09:53:54 -07:00
Shirshanka Das
a9ad138172
feat(ingest): docs - overhaul source connector docs to make it code driven (#4798)
Co-authored-by: MugdhaHardikar-GSLab <mugdha.hardikar@gslab.com>
2022-05-02 00:18:15 -07:00