3089 Commits

Author SHA1 Message Date
Amanda Hernando
0e0d8934ea
feat(ingest): Add GenericAspectTransformer (#7994)
Co-authored-by: Adrián Pertíñez <khurzak92@gmail.com>
2023-05-24 13:31:33 -07:00
Mayuri Nehate
b3d80e57e8
feat(ingest/bigquery): usage for views (#8046)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-05-24 09:48:58 -07:00
Mayuri Nehate
798ce3d6c8
feat(classification): configurable sample size (#8096)
Co-authored-by: david-leifker <114954101+david-leifker@users.noreply.github.com>
2023-05-24 00:07:01 -07:00
Andrew Sikowitz
8357fc8d64
feat(ingest): Browse Path v2 helper (#8012) 2023-05-23 23:46:46 -07:00
Tamas Nemeth
287a292fe7
fix(ingest/redshift): Making Redshift source more verbose (#8109) 2023-05-24 08:19:37 +02:00
Harshal Sheth
b0f8c3de1e
refactor(ingest): simplify stateful ingestion provider interface (#8104)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-23 12:57:57 -07:00
Harshal Sheth
afd65e16fb
feat(cli): delete cli v2 (#8068) 2023-05-23 14:43:44 -05:00
Harshal Sheth
4873a32e4a
fix(ingest): emitter bug fixes (#8093)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-23 12:04:16 -07:00
Tamas Nemeth
f8be9f6aee
feat(ingest/s3): type aware directory sorting (#8089) 2023-05-23 08:59:46 +02:00
Harshal Sheth
fb9a35b58a
feat(cli): support datahub ingest mcps (#7871) 2023-05-22 17:48:51 -07:00
Andrew Sikowitz
a43903bf6d
refactor(ingest): Auto report workunits (#8061) 2023-05-22 17:06:31 -07:00
Harshal Sheth
4e9c652707
feat(ingest): add env to container properties (#8027) 2023-05-22 12:07:16 -07:00
Shubham Jagtap
e6371c8e94
fix(ingestion/powerbi): skip erroneous pages of a report (#8021)
Co-authored-by: mohdsiddique <mohdsiddiquebagwan@gmail.com>
Co-authored-by: MohdSiddiqueBagwan <mohdsiddique.bagwan@gslab.com>
2023-05-19 18:02:55 -07:00
Andrew Sikowitz
2ca362e96c
fix(ingest/delta-lake): Walk through directory structure with full path; reduce resource creation (#8072) 2023-05-19 12:42:48 -07:00
Harshal Sheth
d79b28c147
feat(sdk): better auth error messages in the rest emitter (#8025) 2023-05-19 16:16:03 +02:00
alplatonov
999b997223
fix(ingest/s3_data_lake)_ingestor_skips_directories_with_similar_prefix (#8078)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-19 15:32:19 +02:00
Harshal Sheth
bd5f99b6ab
docs(ingest/snowflake): fix authentication type docs (#8059) 2023-05-19 13:19:10 +02:00
Tamas Nemeth
bdd4bc7b92
feat(ingest/s3) - Stateful ingestion and last-updated support (#8022) 2023-05-19 13:10:15 +02:00
Harshal Sheth
48b856ccf4
fix(ingest/looker): don't prematurely show connectivity success (#8070) 2023-05-17 22:27:56 -07:00
Mayuri Nehate
43bb84738b
fix(ingest/tableau): don't use unsupported sql condition field (#8065)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2023-05-17 22:24:53 -07:00
matwalk
f2683203ab
fix(ingest/snowflake): lowercase user urn when using email (#7767)
Co-authored-by: mathilde.oustlant <mathilde.oustlant@cdiscount.com>
2023-05-17 22:09:36 -07:00
Andrew Sikowitz
2e1c3981aa
refactor(ingest): Move source_helpers.py from datahub/utilities -> datahub/api (#8052) 2023-05-17 20:51:06 -07:00
mohdsiddique
ae30be9c25
fix(ingestion/tableau): ingest parent project name in container properties (#8030)
Co-authored-by: MohdSiddiqueBagwan <mohdsiddique.bagwan@gslab.com>
2023-05-17 14:19:41 -07:00
Shubham Jagtap
8cc6606e68
feat(ingestion/kafka): add description in dataset properties (#7974)
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: MohdSiddiqueBagwan <mohdsiddique.bagwan@gslab.com>
Co-authored-by: mohdsiddique <mohdsiddiquebagwan@gmail.com>
2023-05-17 11:03:08 -07:00
Shirshanka Das
b3c790aab6
feat: Add support for Data Products (#8039)
Co-authored-by: Chris Collins <chriscollins3456@gmail.com>
2023-05-17 07:17:25 +00:00
Mayuri Nehate
20927e84c9
refractor(classification): simplify classification handler (#8056) 2023-05-16 23:27:02 -07:00
Andrew Sikowitz
7ba2d13087
refactor(ingest): Make get_workunits() return MetadataWorkUnits (#8051)
- Deprecates UsageAggregationClass, /usageStats?action=batchIngest, UsageStatsWorkUnit
- Removes parsing of UsageAggregationClass in file source, all sinks, and WorkUnitRecordExtractor
2023-05-17 00:01:57 -04:00
Mayuri Nehate
a06c5aee2c
fix(ingest/bigquery): update usage audit log query to include create/drop operations (#7995) 2023-05-16 11:58:20 -07:00
Harshal Sheth
e70d0b3859
fix(ingest/dbt): fix dbt subtypes for sources (#8048) 2023-05-16 11:11:26 -07:00
xiphl
d619cc6b3f
feat(ingest): Allow csv-enricher to update more types (#7932)
Co-authored-by: xiphl <xiphlerl9@gmail.com>
2023-05-15 10:38:19 -07:00
Shubham Jagtap
7483d9a4de
fix(ingestion/metabase): metabase connector bigquery lineage fix (#8042)
Co-authored-by: mohdsiddique <mohdsiddiquebagwan@gmail.com>
2023-05-15 14:30:20 +02:00
Harshal Sheth
92af7152c0
feat(ingest): support importing local modules (#8026) 2023-05-13 13:55:12 +05:30
cccs-Dustin
87d32d7377
feat(ingest/superset): add stateful ingestion (#8013) 2023-05-11 21:56:05 -07:00
Harshal Sheth
9a2e990bed
fix(sdk): throw errors on empty gms server urls (#8017) 2023-05-11 21:42:22 -07:00
Harshal Sheth
82afdb2c78
feat(cli): move registry delete to separate subcommand (#7968) 2023-05-11 12:55:46 -07:00
Andrew Sikowitz
afcf462cb1
feat(ingest/unity): Add profiling support (#7976)
- Also adds a new databricks sdk
2023-05-11 10:00:50 -07:00
Mayuri Nehate
294f65fdd7
fix(ingest/snowflake): fix lineage query aggregation for optimised lineage path (#8011)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-05-11 09:58:34 -07:00
Mayuri Nehate
eb99012c86
feat(ingest/classification): add classification report (#7925)
Co-authored-by: Tamas Nemeth <treff7es@gmail.com>
2023-05-10 13:01:33 -07:00
Tamas Nemeth
dec54bf098
feat(ingest/s3): Inferring schema from the alphabetically last folder (#8005) 2023-05-10 21:55:05 +02:00
Andrew Sikowitz
a68833769e
refactor(ingest/unity): Use databricks-sdk over databricks-cli for usage query (#7981) 2023-05-09 13:30:11 -07:00
Andrew Sikowitz
44406f7adf
fix(ingest/postgres): Allow specification of initial engine database; set default database to postgres (#7915)
Co-authored-by: Mayuri Nehate <33225191+mayurinehate@users.noreply.github.com>
2023-05-09 11:11:43 -07:00
Mayuri Nehate
c845c75a2d
feat(ingest/snowflake): add config option to specify deny patterns for upstreams (#7962)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-05-08 14:13:57 -07:00
Mayuri Nehate
13b1d66170
fix(ingest/bigquery): remove incorrectly used table_pattern filter (#7810)
Co-authored-by: Andrew Sikowitz <andrew.sikowitz@acryl.io>
2023-05-08 10:33:42 -07:00
Mayuri Nehate
0131aeefb1
fix(ingest/unity): improve error message if no scheme in workspace_url (#7951)
Co-authored-by: John Joyce <john@acryl.io>
2023-05-08 10:13:53 -07:00
Mayuri Nehate
fe097f116e
fix(ingest): use with for opened connections (#7908) 2023-05-08 10:12:06 -07:00
Tamas Nemeth
0e69e5a810
fix(ingest/redshift): Enabling autocommit for Redshift connection (#7983) 2023-05-08 10:24:40 +02:00
Tamas Nemeth
75c03d7229
fix(ingestion/redshift) - Fixing schema query (#7975) 2023-05-06 11:20:01 +02:00
Harshal Sheth
721ab5da37
fix(ingest): use certs correctly in rest emitter (#7978) 2023-05-06 11:17:54 +02:00
Harshal Sheth
b074387185
fix(ingest/salesforce): fix lint (#7980) 2023-05-06 11:16:52 +02:00
David Sanchez
42999df06f
fix(ingest/tableau): Add a try catch to LineageRunner parser (#7965) 2023-05-05 12:54:09 -07:00