3975 Commits

Author SHA1 Message Date
Tamas Nemeth
2ee1a78f4e
fix(ingestion): Fix for same schema foreign key reference (#3769) 2021-12-20 07:11:55 -08:00
John Joyce
110efa68b9
docs(snowflake): Adding documentation about required Snowflake Privileges (#3770) 2021-12-19 12:01:53 -08:00
Aseem Bansal
2770eb6813
docs(ingestion): Add details of sensitive info handling (#3767) 2021-12-19 11:37:32 -08:00
Tamas Nemeth
599edd22ae
fix(ingest): profiling - disable expensive profilers by default (#3759) 2021-12-17 17:17:25 -08:00
Hyun Min Choi
e73a30dc81
feat(ingest): bigquery - add support for parsing exported audit logs (#3680) 2021-12-17 17:05:21 -08:00
Harshal Sheth
22cef5f897
refactor(test): replace CliRunner with run_datahub_cmd method (#3746) 2021-12-16 20:07:38 -08:00
Ravindra Lanka
bd69e736ba
feat(Stateful Ingestion-2/3): Client side changes for checkpointing a source job state. (#3763)
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-12-16 20:06:33 -08:00
jawadqu
b4eaa9e09e
feat(ingestion): Mode retry wait logic to avoid hitting Mode API rate limit (#3761) 2021-12-16 19:17:12 -08:00
Harshal Sheth
df2cb94ed8
feat(ingest): profiling add upper bound on combined query size (#3762) 2021-12-16 17:34:46 -08:00
Harshal Sheth
c6f3ddf077
build(ingest): restrict latest mypy version (#3756) 2021-12-16 08:48:15 -08:00
Tamas Nemeth
1c1561c497
feat(ingest): skipping emitting metadata for duplicate tables from ingestion (#3753) 2021-12-15 09:15:03 -08:00
Sergio Gómez Villamor
c59c63e90d
feat: enables dbt metadata files to be loaded from URIs (#3739) 2021-12-15 09:11:39 -08:00
Harshal Sheth
adf9d2ead7
test(ingest): fix pytest warning for class starting with Test (#3745) 2021-12-14 22:44:42 -08:00
varunbharill
70d068892e
fix(ingest): snowflake honor allow/deny pattern for lineage and usage. (#3748) 2021-12-14 20:42:29 -08:00
Gabe Lyons
2e2ed34250
feat(ingest): snowflake-usage add knob for direct objects accessed vs base objects accessed (#3744) 2021-12-14 18:07:55 -08:00
John Joyce
40963b0635
fix(ingest): remove data platform isalpha check as it complains about s3 (#3742) 2021-12-14 17:39:15 -08:00
varunbharill
5f80e7a4b2
fix(ingest): changing datahub-graph to use underlying session connection. (#3743) 2021-12-14 17:28:27 -08:00
Gabe Lyons
3fd3313544
Revert "feat(graph): Make Dgraph a proper Neo4j alternative (#3578)" (#3740) 2021-12-14 10:49:03 -08:00
Harshal Sheth
f24440eff3
fix(ingest): count profiled tables separately in report (#3731) 2021-12-13 23:06:49 -08:00
jawadqu
578590e795
feat(ingestion) : Add Metabase Source Connector (#3602)
Co-authored-by: Jawad Qureshi <jqureshi@petabloc.com>
2021-12-13 23:02:47 -08:00
Harshal Sheth
3b7fd24740
feat(ingest): cleanup deprecated datahub.integrations.airflow.* imports (#3732) 2021-12-13 21:34:09 -08:00
Enrico Minack
a6deaabfcf
feat(graph): Make Dgraph a proper Neo4j alternative (#3578) 2021-12-13 12:37:59 -08:00
Aseem Bansal
a20821dc4d
feat(cli): allow to nuke without deleting data in quickstart (#3655) 2021-12-13 12:04:32 -08:00
Serge Travin
83207b37af
fix(superset): handle dashboards without charts (#3713) (#3714) 2021-12-13 11:01:36 -08:00
varunbharill
e55cbd1a34
feat(ingest): adding utilities methods to DataHubGraph class. (#3729) 2021-12-13 10:53:02 -08:00
Luis Angel Vicente Sanchez
9e32776248
fix(ingest): add source.config.connection.schema_registry_config to SchemaRegistryClient creation (#3702)
Co-authored-by: Luis Vicente Sanchez <luis.vicentesanchez@aaqua.live>
2021-12-13 09:32:15 -08:00
Sergio Gómez Villamor
9aa370cef2
fix(ingestion): adds missing port to the connection bootstrap (#3706) 2021-12-13 09:28:04 -08:00
Tamas Nemeth
b9f67c5b65
feat(ingest): trim long sql queries in usage connector (#3725) 2021-12-13 09:16:24 -08:00
mayurinehate
ff3dd162ff
fix(ingest): update trino source get_table_comment to handle not found error (#3712) 2021-12-13 07:40:48 -08:00
Tamas Nemeth
76949ca62d
fix(ingest): get mysql geotypes properly (#3726) 2021-12-13 07:38:22 -08:00
Tamas Nemeth
b5c3015f83
fix(profiler): fix division by zero in pct_unique calculation (#3727) 2021-12-13 07:37:12 -08:00
Tamas Nemeth
0c8c29a6a2
docs(redshift): adding svv_table privilege requirement to redshift source (#3708) 2021-12-10 17:58:42 -08:00
Gabe Lyons
5d8c813684
fix(mode): support definitions in mode query (#3721)
Co-authored-by: Jawad Qureshi <jqureshi@petabloc.com>
2021-12-10 17:56:39 -08:00
Gabe Lyons
8394fc62b0
feat(mode): add mode analytics ingestion source (#3710) 2021-12-09 16:10:08 -08:00
mayurinehate
bd4ecbc7b9
fix(nifi): add env in nifi config, add unit tests, fix nifi doc (#3703) 2021-12-09 13:34:13 -08:00
Tamas Nemeth
eef26fe8ef
docs(redshift): Adding requirements for redshift permissions (#3707) 2021-12-09 13:32:15 -08:00
Tamas Nemeth
b0ebc6b579
fix(ingest): disable query parser failure reporting to datahub in redshift lineage by default (#3699) 2021-12-08 23:56:52 -08:00
Gabe Lyons
0fdd3352bd
feat(ingestion): Add lineage support for Redshift source (#3697)
Co-authored-by: treff7es <treff7es@gmail.com>
2021-12-08 23:41:18 -08:00
Gabe Lyons
46850324d2
fix(ingest): revert accidental change to example recipe file_to_datahub_rest.yml (#3698) 2021-12-08 16:56:08 -08:00
Kevin Hu
15ed3aecf0
refactor(ingest): cli deletion function (#3694) 2021-12-08 16:09:04 -08:00
Gabe Lyons
3cc4e76748
feat(ingest): bigquery - support snapshot and partition tables during ingest & lineage (#3695) 2021-12-08 16:07:21 -08:00
mayurinehate
1d7ec8dba8
feat(ingest): add nifi source (#3681) 2021-12-08 14:56:31 -08:00
ecooklin
1a5121a5ae
feat(ingest): adds glossary terms transformer (#3657) 2021-12-07 21:54:15 -08:00
Harshal Sheth
a9ce255abf
feat(profiler): add query combiner report statistics (#3678) 2021-12-07 21:38:40 -08:00
Gabe Lyons
1c17ba76d2
fix(snowflake): support geo types (#3686)
* Mapping Snowflake's GEOPGRAPHY type to Nulltype as a workaround as SqlAlchemy does not know about it 

Co-authored-by: treff7es <treff7es@gmail.com>
2021-12-07 19:36:10 -08:00
Harshal Sheth
6dffe9b247
refactor(profiling): clean up SQL query analysis (#3674) 2021-12-07 17:43:03 -08:00
Aseem Bansal
aba060a04c
docs(business glossary): fix specification of the file (#3679) 2021-12-07 17:14:27 -08:00
Gabe Lyons
98366cca1f
feat(delete): support deleting by search w/ tokens (#3684) 2021-12-07 14:31:52 -08:00
Aseem Bansal
b3ef5ee489
docs(scheduling): re-arrange docs related to scheduling, lineage, CLI (#3669) 2021-12-07 10:09:59 -08:00
Kevin Hu
d3081f4807
feat(ingestion): anonymous usage stats (#3668) 2021-12-07 08:57:12 -08:00