1646 Commits

Author SHA1 Message Date
John Joyce
97e9660037
feat: No Code Metadata Modeling (#2629)
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-06-03 13:24:33 -07:00
Harshal Sheth
6b9d0d0129
fix(ingest): include urn as key for kafka emitter (#2634) 2021-06-03 11:04:40 -07:00
Thomas Larsson
b512920022
fix(ingestion): improve robustness of glue ingestion source (#2626)
fixes: #2625

Co-authored-by: thomas.larsson <thomas.larsson@klarna.com>
2021-06-01 11:02:52 -07:00
Harshal Sheth
958fe8ea83
feat(ingest): populate inputDatajobs field in airflow integration (#2606) 2021-05-25 22:47:00 -07:00
Fredrik Sannholm
1e0b67ce56
feat(ingestion): Fix looker test (#2601) 2021-05-25 11:15:47 -07:00
Harshal Sheth
1d4bcbe4fb
feat(ingest): add dataset tag transformer (#2580) 2021-05-18 14:43:43 -07:00
Harshal Sheth
f310ff9a4a
test(ingest): use different mysql test port (#2573) 2021-05-17 19:45:34 -07:00
Harshal Sheth
6d875b8241
test(ingest): ensure transformer registry works for aliases (#2572) 2021-05-17 15:08:49 -07:00
Gary Lucas
af4f3b9683
fix(dbt): set target platform and load schema (#2483) 2021-05-17 12:22:52 -07:00
Harshal Sheth
f590f11ff3
fix(ingest): check mypy types for test helpers (#2561) 2021-05-17 11:42:12 -07:00
Harshal Sheth
3dfe3d375b
feat(ingest): add options for Airflow lineage backend (#2557) 2021-05-13 20:02:47 -07:00
Kevin Hu
5ab1cbbbb2
feat(ingest): MongoDB schema inference (#2546) 2021-05-13 19:44:33 -07:00
Fredrik Sannholm
133577557c
feat(ingest): Looker view and dashboard ingestion (#2493) 2021-05-13 11:42:53 -07:00
Harshal Sheth
a671001824
refactor(ingest): move Airflow into datahub_provider module (#2521) 2021-05-12 15:01:11 -07:00
Harshal Sheth
a47400f18e
build(ingest): use gradle in commands + docs (#2531) 2021-05-11 19:03:20 -07:00
Harshal Sheth
2811d23e45
feat(ingest): add a transformer for adding ownership (#2532) 2021-05-11 17:46:39 -07:00
shakti-garg
8ed14a62e2
feat(business_glossary): add new entity business term and its relationship with dataset and its fields (#2228)
Co-authored-by: shubham.garg <shubham.garg@thoughtworks.com>
2021-05-10 13:20:23 -07:00
Harshal Sheth
cd588baccb
build(ingest): include package data in sdist (#2513) 2021-05-07 15:21:43 -07:00
Harshal Sheth
d0ca3191c9
build(ingest): add metadata-ingestion to gradle build (#2510) 2021-05-06 22:10:49 -07:00
Harshal Sheth
7f0656fd5e
fix(ingest): replace ImportError with ModuleNotFoundError (#2498)
Using the more specific exception will prevent us from accidentally
ignoring errors that should be handled.
2021-05-05 14:05:16 -07:00
Harshal Sheth
9f4de4b20a
fix(ingest): remove datahub.metadata import shortcut (#2449) 2021-04-30 21:10:12 -07:00
Harshal Sheth
71933a9f31
test(ingest): rename TestSource -> FakeSource (#2481) 2021-04-30 20:54:07 -07:00
Harshal Sheth
e48a74b80a
test(ingest): add test names and IDs using pytest (#2476) 2021-04-29 23:18:55 -07:00
Harshal Sheth
50aee5c05a
fix(ingest): support Airflow 1.10.x style lineage in Airflow 2 (#2455) 2021-04-26 23:08:43 -07:00
Harshal Sheth
694c4ce8ec
test(ingest): fix mypy issue in schema util test (#2465) 2021-04-26 18:18:37 -07:00
Harshal Sheth
83fdc6417f
feat(ingest): capture default values in Avro schemas (#2463) 2021-04-26 17:07:29 -07:00
Harshal Sheth
d415234a8c
fix(ingest): fields with defaults should be optional (#2461) 2021-04-26 16:45:48 -07:00
Harshal Sheth
c64196e8cf
fix(ingest): setup pyproject.toml (#2446) 2021-04-23 23:50:28 -07:00
Harshal Sheth
a857d3b9d8
fix(ingest): various updates to datahub rest sink (#2445) 2021-04-23 23:48:44 -07:00
Harshal Sheth
663dfe9a7c
fix(ingest): add snowflake warehouse and role to config (#2444) 2021-04-23 23:46:31 -07:00
Gabe Lyons
851e00ba9f
feat(lineage): implement support for datasets, charts and dashboards downstream lineage fetching in a generic way (#2397)
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Brian <brianwebtek@gmail.com>
Co-authored-by: John Joyce <john@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2021-04-23 00:18:39 -07:00
adriaanslechten
1295c44615
feat(ingest) LDAP groups ingestion (#2434) 2021-04-22 13:56:30 -07:00
Harshal Sheth
034c33a050
fix(ingest): use entrypoints lib instead of pkg_resources (#2438) 2021-04-22 00:13:47 -07:00
Harshal Sheth
ffe49f061a
fix(ingest): fix chart type enum serialization and add tests for rest emitter (#2429) 2021-04-21 11:34:24 -07:00
Harshal Sheth
79daec29b7
fix(ingest): ensure upstreams in airflow lineage emission are entities (#2427) 2021-04-20 20:44:38 -07:00
Thomas Larsson
7869a8f142
feature(ingestion): Adding the concept of transformers (#2411)
Fixes: #2410

Co-authored-by: thomas.larsson <thomas.larsson@klarna.com>
2021-04-18 11:15:05 -07:00
Harshal Sheth
91a2f69310
fix(ingest): properly handle fieldDiscriminator with restli (#2408) 2021-04-16 09:42:52 -07:00
Harshal Sheth
ffe03e6758
fix(ingest): streamline codegen init methods (#2400) 2021-04-14 19:25:57 -07:00
Harshal Sheth
2af4603e49
fix(ingest): enable mypy disallow_incomplete_defs and disallow_untyped_decorators (#2393) 2021-04-14 13:40:24 -07:00
Harshal Sheth
a11329d5b8
refactor(ingest): update test harness to use a compose file per test (#2392) 2021-04-13 17:30:24 -07:00
Harshal Sheth
41cd52f9e2
feat(ingest): add Airflow lineage backend (#2368) 2021-04-12 17:40:15 -07:00
Thomas Larsson
6610666496
fix(ingestion): dont crash on non-RecordSchema topics (#2372)
Fixes: #2371

Co-authored-by: thomas.larsson <thomas.larsson@klarna.com>
2021-04-09 17:36:01 -07:00
Thomas Larsson
e02a17aecf
fix(ingestion): Support mapping from avro "boolean" and "map" types t… (#2364)
Fixes: #2363

Co-authored-by: thomas.larsson <thomas.larsson@klarna.com>
2021-04-08 14:23:12 -07:00
Thomas Larsson
4215dcd53c
fix(ingestion): properly detect optional fields in avro schemas (#2343)
Co-authored-by: thomas.larsson <thomas.larsson@klarna.com>
2021-04-08 14:00:01 -07:00
Harshal Sheth
bfe345da42
fix(ingest): add test for avro serialization and deserialization (#2351) 2021-04-07 21:30:21 -07:00
Harshal Sheth
cafdd9664a
fix(cli): add --verbose flag for datahub check plugins (#2345) 2021-04-06 15:41:15 -07:00
Harshal Sheth
bd78b84bd3
feat(ingest): start airflow integration + metadata builders (#2331) 2021-04-05 19:11:28 -07:00
Harshal Sheth
c1f3eaed35
fix(ingest): add support for database and table patterns to glue source (#2339) 2021-04-05 17:14:02 -07:00
amy m
759288161c
feat(ingest): adding support for AWS Glue (#2319)
Co-authored-by: Harry Nash <harrywilliamnash@gmail.com>
2021-04-04 11:00:27 -07:00
Gary Lucas
9d00e8bb2e
fix(test): dbt-manifest files (#2330) 2021-04-02 10:59:20 -07:00