John Joyce
8c9c696cdd
feat(ingest): Adding an Okta Integration to extract Users, Groups, Group Membership ( #3043 )
2021-08-11 18:49:16 -07:00
rslanka
8844240328
feat: Adding support for nested schemas in ingestion and visualization ( #3079 )
2021-08-11 15:47:18 -07:00
Kevin Hu
752626b6c7
fix(ingest): remove tags from bootstrap_mce since that is deprecated ( #3069 )
2021-08-10 13:32:22 -07:00
Kevin Hu
45247083e3
docs(ingest): link to docs from recipes ( #3058 )
2021-08-09 10:00:47 -07:00
Kevin Hu
32b8fc6108
feat(docs): refactor source and sink ingestion docs ( #3031 )
2021-08-08 13:40:51 -07:00
John Joyce
352a0abf8d
Introducing TimeSeries Aspects + Dataset Profile (Stats) Aspect ( #2983 )
...
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Ravindra Lanka <rlanka@acryl.io>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-07-30 17:41:03 -07:00
Gabe Lyons
aa253f5b3b
feat(deletes): add run commands (list, show, rollback) to datahub ingest ( #2960 )
2021-07-29 20:04:40 -07:00
Gabe Lyons
d9d03c9bd9
feat(ml-model): adding ml models and ml model groups ( #2961 )
2021-07-28 20:39:05 -07:00
Harshal Sheth
7ab6355b1c
feat(ingest): stricter deserialization for MCE JSONs ( #2976 )
2021-07-28 14:50:21 -07:00
Kevin Hu
a1d1dd4269
feat(docs): tutorial for writing a custom transformer ( #2959 )
2021-07-28 14:38:13 -07:00
Kevin Hu
59f5d8f8c8
feat(ingest): add logo and platform entry for Glue ( #2943 )
2021-07-22 22:20:00 -07:00
Harshal Sheth
3e328d9511
fix(ingest): add quotes to secured kafka yaml config example ( #2911 )
2021-07-22 13:20:00 -07:00
Kevin Hu
859fab8bd3
docs(ingest): update sample recipe and test input for dbt ( #2917 )
2021-07-20 13:28:37 -07:00
Brian
7e2c4d6e44
feat(mlFeatureTable): add graphql, ui/ux for mlFeatureTable, mlFeature, mlPrimaryKey entities ( #2887 )
2021-07-20 11:42:21 -07:00
Kevin Hu
c4e2b9afa2
feat(ingest): add browse paths + dataplatform for Feast features ( #2849 )
2021-07-12 11:05:18 -07:00
Kevin Hu
a2106ca9e8
feat(ingest): SageMaker jobs and models ( #2830 )
2021-07-08 16:16:16 -07:00
Harshal Sheth
288d17f07e
docs(ingest): update links to Kafka docs ( #2834 )
2021-07-06 15:33:52 -07:00
Kevin Hu
4da76726d3
feat(ingest): SageMaker feature store ingestion ( #2758 )
2021-06-29 19:43:31 -07:00
Kevin Hu
7083de9b4f
fix(ingest): use correct platform for MongoDB ingestion ( #2783 )
2021-06-29 10:16:58 -07:00
Kevin Hu
a89094da5b
feat(ingest): add support for Glue ETL jobs ( #2687 )
2021-06-22 11:33:22 -07:00
Kevin Hu
554e1637c5
fix(ingest): types for dbt ( #2716 )
2021-06-22 10:37:08 -07:00
Harshal Sheth
5d93f249b4
feat(ingest): expose additional types to Python via codegen ( #2712 )
2021-06-17 10:04:28 -07:00
Harshal Sheth
26dcece8ec
fix(ingest): use looker
data platform ( #2708 )
2021-06-16 16:58:13 -07:00
Gabe Lyons
523c3bf1d4
feat(aspects): support fetching of versioned aspects ( #2677 )
2021-06-16 10:03:21 -07:00
Kevin Hu
ebdaa0e359
feat(ingest): Feast ingestion integration ( #2605 )
...
* Add feast testing setup
* Init Feast test script
* Add feast to dependencies
* Update feast descriptors
* Sort integrations
* Working feast pytest
* Clean up feast docker-compose file
* Expand Feast tests
* Setup feast classes
* Add continuous and bytes data to feature types
* Update field type mapping
* Add PDLs
* Add MLFeatureSetUrn.java
* Comment out feast setup
* Add snapshot file and update inits
* Init Feast golden files generation
* Clean up Feast ingest
* Feast testing comments
* Yield Feature snapshots
* Fix Feature URN naming
* Update feast MCE
* Update Feature URN prefix
* Add MLEntity
* Update golden files with entities
* Specify feast sources
* Add feast source configs
* Working feast docker ingestion
* List entities and features before adding tables
* Add featureset names
* Remove unused
* Rename feast image
* Update README
* Add env to feast URNs
* Fix URN naming
* Remove redundant URN names
* Fix enum backcompatibility
* Move feast testing to docker
* Move URN generators to mce_builder
* Add source for features
* Switch TypeClass -> enum_type
* Rename source -> sourceDataset
* Add local Feast ingest image builds
* Rename Entity -> MLPrimaryKey
* Restore features and keys for each featureset
* Do not json encode source configs
* Remove old source properties from feature sets
* Regenerate golden file
* Fix race condition with Feast tests
* Exclude unknown source
* Update feature datatype enum
* Update README and fix typos
* Fix Entity typo
* Fix path to local docker image
* Specify feast config and version
* Fix feast env variables
* PR fixes
* Refactor feast ingest constants
* Make feature sources optional for back-compatibility
* Remove unused GCP files
* adding docker publish workflow
* Simplify name+namespace in PrimaryKeys
* adding docker publish workflow
* debug
* final attempt
* final final attempt
* final final final commit
* Switch to published ingestion image
* Update name and namespace in java files
* Rename FeatureSet -> FeatureTable
* Regenerate codegen
* Fix initial generation errors
* Update snapshot jsons
* Regenerated schemas
* Fix URN formats
* Revise builds
* Clean up feast URN builders
* Fix naming typos
* Fix Feature Set -> Feature Table
* Fix comments
* PR fixes
* All you need is Urn
* Regenerate snapshots and update validation
* Add UNKNOWN data type
* URNs for source types
* Add note on docker requirement
* Fix typo
* Reorder aspect unions
* Refactor feast ingest functions
* Update snapshot jsons
* Rebuild
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-06-09 15:07:04 -07:00
John Joyce
97e9660037
feat: No Code Metadata Modeling ( #2629 )
...
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Gabe Lyons <itsgabelyons@gmail.com>
Co-authored-by: Shirshanka Das <shirshanka@apache.org>
2021-06-03 13:24:33 -07:00
Harshal Sheth
dbe42e07f6
feat(ingest): update bigquery demo data ( #2607 )
2021-05-25 18:52:10 -07:00
Brian
27c4607a56
feat(react): add nativeDataType with tooltip over icon in schema ( #2594 )
2021-05-21 13:33:06 -07:00
Brian
aa8ba1b8e9
feat(dataflow): update dataflow to have datajobs in new tab ( #2579 )
2021-05-18 21:25:42 -07:00
taufiqibrahim
db78373427
feat(ingest): kafka connect metadata ingestion ( #2516 )
2021-05-18 14:45:38 -07:00
Gary Lucas
af4f3b9683
fix(dbt): set target platform and load schema ( #2483 )
2021-05-17 12:22:52 -07:00
Harshal Sheth
8d910b9e39
docs(ingest): improve kafka schema registry config docs ( #2544 )
2021-05-12 17:54:33 -07:00
Harshal Sheth
a671001824
refactor(ingest): move Airflow into datahub_provider
module ( #2521 )
2021-05-12 15:01:11 -07:00
Brian
d7d8870008
feat(usergroup): implement corpgroup in graphql, refactor avatars and ownership in react ( #2519 )
2021-05-11 17:55:45 -07:00
shakti-garg
8ed14a62e2
feat(business_glossary): add new entity business term and its relationship with dataset and its fields ( #2228 )
...
Co-authored-by: shubham.garg <shubham.garg@thoughtworks.com>
2021-05-10 13:20:23 -07:00
Harshal Sheth
50ea58c32d
docs: improve airflow explanations and examples ( #2509 )
2021-05-06 19:12:19 -07:00
Harshal Sheth
7f0656fd5e
fix(ingest): replace ImportError with ModuleNotFoundError ( #2498 )
...
Using the more specific exception will prevent us from accidentally
ignoring errors that should be handled.
2021-05-05 14:05:16 -07:00
Rickard Cardell
48ea19a5f7
feat(react): add custom properties tab on dashboard profile page ( #2439 ) ( #2469 )
2021-04-27 11:26:09 -07:00
Brian
5894365d77
fix(bootstrap): update bootstrap data with more realistic nested schemas ( #2470 )
2021-04-27 11:19:40 -07:00
Gabe Lyons
851e00ba9f
feat(lineage): implement support for datasets, charts and dashboards downstream lineage fetching in a generic way ( #2397 )
...
Co-authored-by: Dexter Lee <dexter@acryl.io>
Co-authored-by: Brian <brianwebtek@gmail.com>
Co-authored-by: John Joyce <john@acryl.io>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
2021-04-23 00:18:39 -07:00
Gabe Lyons
c7b49de67b
feat(ingest): adding superset ingestion source ( #2425 )
2021-04-22 00:11:54 -07:00
Brian
95b96572c5
feat(react): schema visualization add support for nested structs ( #2423 )
2021-04-20 14:00:10 -07:00
Harshal Sheth
27a672e697
feat(ingest): add Kafka-based emitter example ( #2412 )
2021-04-16 17:54:50 -07:00
Harshal Sheth
91a2f69310
fix(ingest): properly handle fieldDiscriminator with restli ( #2408 )
2021-04-16 09:42:52 -07:00
Gabe Lyons
1e11851a90
feat(sample): adding sample mces for dataflows and datajobs ( #2396 )
2021-04-14 19:27:01 -07:00
Harshal Sheth
2af4603e49
fix(ingest): enable mypy disallow_incomplete_defs
and disallow_untyped_decorators
( #2393 )
2021-04-14 13:40:24 -07:00
Harshal Sheth
41cd52f9e2
feat(ingest): add Airflow lineage backend ( #2368 )
2021-04-12 17:40:15 -07:00
Harshal Sheth
bd78b84bd3
feat(ingest): start airflow integration + metadata builders ( #2331 )
2021-04-05 19:11:28 -07:00
amy m
759288161c
feat(ingest): adding support for AWS Glue ( #2319 )
...
Co-authored-by: Harry Nash <harrywilliamnash@gmail.com>
2021-04-04 11:00:27 -07:00
Gabe Lyons
8ab2630fb6
feat(lineage): expandable lineage visualization for dataset <> dataset lineage ( #2333 )
2021-04-03 11:13:25 -07:00