Harshal Sheth
fa22b8e17b
feat(ingest): add Airflow TaskFlow example ( #2958 )
2021-07-26 13:09:25 -07:00
Kevin Hu
662017ef17
fix(ingest): patch lookml types and refactor ingestion sources layout ( #2950 )
2021-07-26 13:06:52 -07:00
Harshal Sheth
6135eed757
test(ingestion): fix flaky package discovery test ( #2949 )
2021-07-25 21:15:22 -07:00
Harshal Sheth
7a1ce19d2b
fix(ingestion): resolve test bugs for 3.6 ( #2953 )
2021-07-23 17:07:13 -07:00
Kevin Hu
43e61d9628
feat(models): remove versions from metrics and hyperparams ( #2938 )
2021-07-22 22:18:45 -07:00
Kevin Hu
757abfc6f6
fix(ingest): fix browsepaths and ownership urns ( #2935 )
2021-07-22 15:26:10 -07:00
Kevin Hu
6dbc59940a
feat(ingest): refactor mlModel grouping and add browsepaths ( #2929 )
2021-07-22 13:33:15 -07:00
Harshal Sheth
7dc85a478e
feat(ingest): add make_data_platform_urn
method to builder ( #2926 )
2021-07-22 13:25:07 -07:00
Kevin Hu
736249f0c7
feat(ingest): extract SageMaker metrics, hyperparameters, and external URLs ( #2910 )
2021-07-21 21:30:07 -07:00
Harshal Sheth
5f0b4464f5
fix(ingest): pin snowflake sqlalchemy connector ( #2923 )
2021-07-20 19:28:40 -07:00
Harshal Sheth
416f2a95df
feat(ingest): add support for Oracle spatial types ( #2909 )
2021-07-20 19:13:49 -07:00
aseembansal-gogo
6e1b2cf4f7
feat(ingest): Add option to change name of database for postgres ( #2898 )
2021-07-20 07:01:42 -07:00
Kevin Hu
6abd5e191a
feat(ingest): lineage for SageMaker model endpoints and groups ( #2894 )
2021-07-19 11:30:43 -07:00
Harshal Sheth
45e931dbac
feat(ingest): add can_add_aspect
method for MCEs ( #2905 )
2021-07-17 20:02:07 -07:00
Kevin Hu
44ed2f3684
feat(ingest): extract lineage between SageMaker jobs and models ( #2868 )
2021-07-15 18:56:13 -07:00
Harshal Sheth
8e573fdb31
fix(ingest): fix druid misconfiguration bug ( #2882 )
2021-07-14 20:29:23 -07:00
Harshal Sheth
fe6bfc9685
fix(ingest): default to unlimited query log delay in bigquery-usage ( #2881 )
2021-07-14 20:05:31 -07:00
Harshal Sheth
be39037b10
build(ingest): reduce dependencies for dev install ( #2872 )
2021-07-14 20:02:48 -07:00
Harshal Sheth
220dfe728c
feat(ingest): support dynamic imports for transfomer methods ( #2858 )
2021-07-12 11:03:53 -07:00
Kevin Hu
a2106ca9e8
feat(ingest): SageMaker jobs and models ( #2830 )
2021-07-08 16:16:16 -07:00
Harshal Sheth
2f921d15e8
fix(ingest): avoid setting timestamps unless source system provides it ( #2843 )
2021-07-08 12:11:06 -07:00
Harshal Sheth
d66381451a
feat(ingest): refactor mce comparison and add pytest update golden files option ( #2812 )
2021-06-30 16:53:20 -07:00
Harshal Sheth
e51f86a9de
feat(ingest): support ingesting from multiple snowflake dbs ( #2793 )
2021-06-30 15:54:17 -07:00
Kevin Hu
4da76726d3
feat(ingest): SageMaker feature store ingestion ( #2758 )
2021-06-29 19:43:31 -07:00
Harshal Sheth
79f60d8b8a
refactor(ingest): remove deprecated methods and warn on deprecated import ( #2797 )
2021-06-29 11:43:43 -07:00
Harshal Sheth
19b2a42a00
feat: usage stats (part 2) ( #2762 )
...
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
2021-06-24 19:44:59 -07:00
Harshal Sheth
937f02c6bc
feat: usage stats (part 1) ( #2750 )
...
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
2021-06-24 17:11:00 -07:00
Kevin Hu
a89094da5b
feat(ingest): add support for Glue ETL jobs ( #2687 )
2021-06-22 11:33:22 -07:00
Harshal Sheth
5d93f249b4
feat(ingest): expose additional types to Python via codegen ( #2712 )
2021-06-17 10:04:28 -07:00
Brian
a5f9b8dfe9
feat(entities): add markdown description update/viewer feature in dataset, datajob, dataflow, chart and dashboard, update ui/ux ( #2707 )
2021-06-16 15:48:27 -07:00
Harshal Sheth
1b539220d5
feat(ingest): support Oracle service names ( #2676 )
2021-06-11 17:27:34 -07:00
zack3241
91eb3cc57e
Add get_identifier to hive source in metadata ingestion ( #2667 )
2021-06-09 15:12:17 -07:00
John Joyce
97e9660037
feat: No Code Metadata Modeling ( #2629 )
...
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-06-03 13:24:33 -07:00
Harshal Sheth
6b9d0d0129
fix(ingest): include urn as key for kafka emitter ( #2634 )
2021-06-03 11:04:40 -07:00
Thomas Larsson
b512920022
fix(ingestion): improve robustness of glue ingestion source ( #2626 )
...
fixes : #2625
Co-authored-by: thomas.larsson <thomas.larsson@klarna.com>
2021-06-01 11:02:52 -07:00
Harshal Sheth
958fe8ea83
feat(ingest): populate inputDatajobs field in airflow integration ( #2606 )
2021-05-25 22:47:00 -07:00
Harshal Sheth
1d4bcbe4fb
feat(ingest): add dataset tag transformer ( #2580 )
2021-05-18 14:43:43 -07:00
Harshal Sheth
6d875b8241
test(ingest): ensure transformer registry works for aliases ( #2572 )
2021-05-17 15:08:49 -07:00
Harshal Sheth
3dfe3d375b
feat(ingest): add options for Airflow lineage backend ( #2557 )
2021-05-13 20:02:47 -07:00
Harshal Sheth
a671001824
refactor(ingest): move Airflow into datahub_provider
module ( #2521 )
2021-05-12 15:01:11 -07:00
Harshal Sheth
a47400f18e
build(ingest): use gradle in commands + docs ( #2531 )
2021-05-11 19:03:20 -07:00
Harshal Sheth
2811d23e45
feat(ingest): add a transformer for adding ownership ( #2532 )
2021-05-11 17:46:39 -07:00
shakti-garg
8ed14a62e2
feat(business_glossary): add new entity business term and its relationship with dataset and its fields ( #2228 )
...
Co-authored-by: shubham.garg <shubham.garg@thoughtworks.com>
2021-05-10 13:20:23 -07:00
Harshal Sheth
cd588baccb
build(ingest): include package data in sdist ( #2513 )
2021-05-07 15:21:43 -07:00
Harshal Sheth
d0ca3191c9
build(ingest): add metadata-ingestion to gradle build ( #2510 )
2021-05-06 22:10:49 -07:00
Harshal Sheth
7f0656fd5e
fix(ingest): replace ImportError with ModuleNotFoundError ( #2498 )
...
Using the more specific exception will prevent us from accidentally
ignoring errors that should be handled.
2021-05-05 14:05:16 -07:00
Harshal Sheth
9f4de4b20a
fix(ingest): remove datahub.metadata import shortcut ( #2449 )
2021-04-30 21:10:12 -07:00
Harshal Sheth
71933a9f31
test(ingest): rename TestSource -> FakeSource ( #2481 )
2021-04-30 20:54:07 -07:00
Harshal Sheth
e48a74b80a
test(ingest): add test names and IDs using pytest ( #2476 )
2021-04-29 23:18:55 -07:00
Harshal Sheth
50aee5c05a
fix(ingest): support Airflow 1.10.x style lineage in Airflow 2 ( #2455 )
2021-04-26 23:08:43 -07:00