156 Commits

Author SHA1 Message Date
NiharDoshi99
96e77af8a4
Fix: #1308 Docs and Atlas Connector Improvement (#9072)
* Fix: docs changes

* Fix: changes as per comment

* Fix: atlas connector

* Fix: as per comments

* Fix: as per comments
2022-12-08 12:14:15 +05:30
Teddy
ac77f33b08
Fixes #7447 -- Add freshness metrics to profiler (#9159)
* refactor(profiler): integrated getter func.

Removed metric getter function from their own file.
Added metric getter to their own interface classs.
created dispatch by value methdo to dispatch metric getter func.

* feature(profiler): added systemProfiler schema

* feat(profiler): workflow fresh. & snflk impl.

* feat(profiler): freshness endpoint for put and get

* feat(profiler): added system met. for redshift

* feat(profiler): freshness met. for bigquery

* fix(profiler): keyword not found in func

* feat(profiler): Added sample data for freshness

* fix(profiler): fetch previous day for BQ

* fix(profiler): sonar + data fetching logic

* fix: typo in SystemMetric Class

* fix: linting

* fix: extracted out EntityList class into models.py
2022-12-07 14:33:30 +01:00
Suresh Srinivas
16d97d6f96
Fix typos in JSON schemas and follow standard naming conventions (#9183) 2022-12-06 20:50:34 -08:00
Teddy
f86186d5b0
Fixed time field type for profiler (#9177) 2022-12-06 18:30:48 +01:00
Aniket Katkar
1fd1862ce6
Fix(UI)#8970: Fixed issue with user created Metadata Services not able to be deleted (#9140)
* Fixed issue with user created Metadata Services not able to be deleted

* Fixed default values for Amundsen connection configs

* Added missing break statement in switch block
2022-12-05 20:06:51 +05:30
NiharDoshi99
e0a3f3481b
Fix: aggregate type (#9138) 2022-12-05 14:50:37 +05:30
Milan Bariya
2f9f169de9
Fix: Safer default Mark Deleted Tables (#9065)
* Fix: Safer default Mark Deleted Tables

* Fix: Change Based On Comments

* Fix: Change Based On Comments

* Add: Migration Files

Co-authored-by: Sachin Chaurasiya <sachinchaurasiyachotey87@gmail.com>
2022-12-01 16:26:43 +05:30
Sriharsha Chintalapani
80a4399c1a
add required fields for amundsen connection (#9039)
* add required fields for amundsen connection

* Remove unused modelClass field

Co-authored-by: ulixius9 <mayursingal9@gmail.com>
2022-11-30 13:09:57 -08:00
NiharDoshi99
34aacb20d6
Fix: atlas connector (#9034) 2022-11-29 23:55:05 +05:30
Pere Miquel Brull
dda8f8054e
Fix #8571 - Log IngestionPipeline status change to CloudWatch (#9013)
* Prep monitoring skeleton

* Fix naming

* Push Ingestion Pipeline status to cloudwatch

* Review comments and tests

* import

* Remove singleton object

* Format
2022-11-28 17:45:18 +01:00
Ashish Gupta
36f27e947d
UI : Mask the JWT token in Metadata service (#8842)
* Mask the Jwt token in Metadata service

* minor fix

* fix the icon alignment in Add Ingestion button

* disable the test connection for metadata service OpenMetadata type

* change the css name

* fix unit test issue

* Fix Auth Provider

* Fix add ingestion dropdown icon alignment

* Fix formatting

* Do not encryt JWT auth mechanism with secrets manager

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
Co-authored-by: ulixius9 <mayursingal9@gmail.com>
Co-authored-by: Sachin Chaurasiya <sachinchaurasiyachotey87@gmail.com>
Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
Co-authored-by: Nahuel Verdugo Revigliono <nahuel@getcollate.io>
2022-11-22 14:17:40 +00:00
Pere Miquel Brull
f3ac6f53e6
Rename datatype (#8947) 2022-11-22 10:47:17 +00:00
Pere Miquel Brull
83ef3315c3
Fix query log parser (#8920) 2022-11-21 12:55:55 +00:00
Milan Bariya
27997b2f77
Fix: Revert Athena tests and types change (#8909) 2022-11-21 12:35:23 +01:00
Mohit Yadav
8e57787ad4
Fix Url For Email Verification etc (#8904)
* Fix Url For Email Verification etc

* yaml fix
2022-11-21 09:01:05 +00:00
Nahuel
b14a3bc6ca
Review sensitive data not marked as password in the JSON schemas (#8818) 2022-11-18 11:35:08 +01:00
Pere Miquel Brull
a23196d2fa
Centralise Component Config definition (#8863)
* Centralise Component Config def

* Move def to basic
2022-11-18 10:12:46 +01:00
Nahuel
2a77dd0ceb
Add secrets manager documentation back (#8771)
* Add secrets manager documentation back

* Change OM version
2022-11-17 15:38:06 +01:00
NiharDoshi99
03d4011a17
Fix: Changes in bigquey for project-id (#8708) 2022-11-17 14:26:37 +05:30
Miguel Sempere
8c67e0f627
Include links to consumer and registry configurations (#8107) 2022-11-16 19:57:02 -08:00
NiharDoshi99
d783869ed6
Fix: Added metadata service (#8759)
* Fix: Added metadata service

* Fix as per comment

* Fix changes for py-test

* Fix changes for py-test

* Fix py-checkstyle
2022-11-16 06:41:54 +01:00
Ayush Shah
5be0f8ee76
Dl Profiler (#8694)
* DQ commit

* Add DL Profiler

* Fix Ingestion and Profliing pylint checks

* Fix Tests

* PyFormat files

* Fix Tests

* Resolve Comments

* Fix Tests and Format Files

* Resolve Comments

* Fix Pylint and Code smells

* Resolve Comments

* Fix S3 parquet

* Fix Metrics Code Smell
2022-11-15 16:01:10 +01:00
Nahuel
f57fe1d507
Fix#8577: Ingestion Pipelines sensitive passwords fields are stored in secrets store (#8725)
* Services sensitive passwords fields are stored in secrets store

* Update services on application startup in case of changes in the JSON Schema

* Minor changes after manual test

* Bots sensitive passwords fields are stored in secrets store

* Ingestion pipeline sensitive passwords fields are stored in secrets store

* Fix Java style

* Add update for ingestion pipelines on start up

* Add pylint exception

* Fix broken test

* Update UI references
2022-11-15 06:34:49 +01:00
Suresh Srinivas
6a0e326f59
Fixes #7524 Backend: Allow users to rename the Glossary name and Glossary term names (#8734) 2022-11-14 14:09:00 -08:00
Pere Miquel Brull
f33003485e
ES reindex Airflow workflow (#8723)
* Added support for 'Metadata Service' connectors

* Fixed failing unit tests

* Removed delete button for 'Metadata Service' connector of type OpenMetadataServer

* Code optimization for SelectServiceType component

* Fixed errors arrived due to backend changes related to OpenMetadata Services

* init reindex

* Added additional step for adding Metadata to ES configs while creating ingestion pipeline for Metadata Service

* Add ES reindex

* Add ES reindex

* Format

* Format

* Rename service

* Pick service name from IngestionPipeline

* fix ui conflicts

Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
2022-11-14 18:59:56 +01:00
Sriharsha Chintalapani
895a123f81
Add restore method for metadata resource (#8695) 2022-11-13 23:18:28 -08:00
Sriharsha Chintalapani
9e34cb8440
Fix #8416: Restore Entities after soft delete with all it's old data (#8635) 2022-11-13 10:16:01 -08:00
Mohit Yadav
a20945e13f
Added pipelineTypes (#8692)
* Added SystemServices for creating DataInsight and ElasticSearch Reindexing Pipeling with Sink config

* Add Amundsen config

* Changed SystemService to MetadataService

* Change to MetadataConnection

* Fixed failing test

* Fixed Failing test

* Added Metadata Service Test

* Bootstrapped Openmetadat service

* checkstyle fix

* circular import fix

* circular import fix

* Fix FileNames Issue causing pytest failures and mismatch in connection name due to Java Class

* add dataInsight Pipeline

* Add DataInsight + elasticSearchReindexing Pipeline Types
2022-11-13 21:41:41 +05:30
Pere Miquel Brull
34ba9d95c5
Ingestion Pipeline deployed, Athena tests and pydantic extras (#8682)
* Always run python tests

* Fix athena tests and types

* Update deployed prop in IngestionPipeline

* Fix #8554

* Format

* Use true as default deployed migration

* Remove repeated req

* Pydantic wiggle room
2022-11-13 11:59:43 +01:00
Suresh Srinivas
6306c9ac8d
Fixes #8688 Enforce mutuallyExclusive tag labels (#8689) 2022-11-12 18:44:17 -08:00
Mohit Yadav
4dc24b904f
python fix circular import (#8670)
python fix circular import (#8670)
2022-11-11 17:03:43 +01:00
NiharDoshi99
6da01b13a9
Changed format for s3 staging (#8632) 2022-11-11 12:15:37 +01:00
Nahuel
91d0460b27
Fix#8577: Refactor part of the secrets manager implementation (#8617)
* Removed part of the secrets manager implementation in the server side

* Removed part of the secrets manager implementation in the openmetadata python library side

* Remove deprecated test

* Address pylint checks

* Address new pylint checks

* Address PR comments

* Fix import on airflows apis

* Clear singleton instances for running Ometa secrets manager test
2022-11-11 09:59:15 +01:00
Mohit Yadav
2dbd43a589
Added MetadataServices for creating DataInsight and ElasticSearch Reindex (#8565)
* Added SystemServices for creating DataInsight and ElasticSearch Reindexing Pipeling with Sink config

* Add Amundsen config

* Changed SystemService to MetadataService

* Change to MetadataConnection

* Fixed failing test

* Fixed Failing test

* Added Metadata Service Test

* Bootstrapped Openmetadat service

* checkstyle fix
2022-11-11 13:16:54 +05:30
Teddy
9d594efcb5
Fixes #8473 - Implements KPI Workflow (#8657)
* Added KPI workflow logic

* Update ingestion/src/metadata/data_insight/runner/run_result_registry.py

* ran py format

Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2022-11-11 06:35:33 +01:00
Suresh Srinivas
86c3ae30f1
Fixes #8596 Introduce mutually exclusive tags and glossary terms (#8597) 2022-11-10 16:47:21 -08:00
Teddy
199b342288
Fixes #8135 - Implement partitioning config for profiler (#8623)
* Added logic to handle partitioning config in profiler

* extracted get_partition_details out of workflow classes
2022-11-10 10:54:31 +01:00
Mohit Yadav
b4e5f6ec13
Emailing Task and Test Notification (#8626)
* Emailing Task Notification [WIP]

* Emailing Test Result to Owners of table in case it is enabled
2022-11-10 14:22:44 +05:30
Sachin Chaurasiya
4407c75e28
Fix typo in KPI json schema (#8629) 2022-11-10 10:57:40 +05:30
Mohit Yadav
5bd4e56b3f
Feat kpis (#8584)
* Added Kpi Resource

* Fix typo

* Fixed failing test

* Review Suggestions
2022-11-09 13:06:13 +05:30
Teddy
dcd0bbb566
Fixes 8470 -- Implements ES aggregation for web analytic data (#8566)
* Moved webanalytics type in its own folder

* Added data insight chart api endpoint

* Jave formatting

* Added resource descriptor

* Added metadata entity endpoint

* Added aggregation endpoint for dataInsight

* Fix tag name

* Added logic to ingestion pipeline resource to add ES config info if pipeline type is dataInsight

* added domo to test subpackage

* cleaned up branch by removing commit from issue-8353 that were not merged in main

* Added web analytics data refinement

* Added get_status function

* Added from __futur__ for typing

* Added dailyActiveUsers aggregation

* Added page views entities aggregation and active users aggregation
2022-11-08 17:25:31 +01:00
michizhou
da06d183db
feat: add QuickSight connector (#8442) 2022-11-08 19:54:49 +05:30
Mayur Singal
710e3d785f
Fix #8495: Update pipeline sources for pipeline as edge lineage (#8556) 2022-11-08 14:13:49 +05:30
Teddy
b44972ef60
Fixes #8470 - Implements refinement functions for web analytics events (#8528)
* Moved webanalytics type in its own folder

* Added data insight chart api endpoint

* Jave formatting

* Added resource descriptor

* Added metadata entity endpoint

* Added aggregation endpoint for dataInsight

* Fix tag name

* Added logic to ingestion pipeline resource to add ES config info if pipeline type is dataInsight

* added domo to test subpackage

* cleaned up branch by removing commit from issue-8353 that were not merged in main

* Added web analytics data refinement

* Added get_status function

* Added from __futur__ for typing

* Fixed typos brought up during reviews
2022-11-07 17:08:20 +01:00
Sachin Chaurasiya
833883e24a
Fix data insight Json Schema filename and content (#8534)
* Fix Json Schema filename

* Remove formatting

* FIx required property and change data type from array to object

* added items to array type

Co-authored-by: Teddy Crepineau <teddy.crepineau@gmail.com>
2022-11-04 14:07:36 +05:30
NiharDoshi99
1c1f8660f2
Fixed: dagster to connect to cloud (#8387)
* Fixed: dagster to connect to cloud

* Fixed: changed test_cases

* Fixed: lineage between the assets and jobs

* Changes as per comments

* Fix python checkstyle

* Fix as per comments

* Fix added migration scripts

* minor fixes
2022-11-03 18:19:50 +01:00
Tushar Mittal
6f2c93089c
feat: add SageMaker connector (#8435)
* feat: add sagemaker connector

Signed-off-by: Tushar Mittal <chiragmittal.mittal@gmail.com>

* fix: fix linting errors and update imports

Signed-off-by: Tushar Mittal <chiragmittal.mittal@gmail.com>

* test: add unit tests for sagemake source

Signed-off-by: Tushar Mittal <chiragmittal.mittal@gmail.com>

Signed-off-by: Tushar Mittal <chiragmittal.mittal@gmail.com>
2022-11-03 18:19:20 +01:00
Shailesh Parmar
0f0489a7d4
Feat: Added pipeline as edge from ui (#8503)
* Feat: Added pipeline as edge from ui

* fixed failing unit test

* removed pipeline

* added unit test for util function
2022-11-03 17:46:49 +05:30
Teddy
f4cc2b57a7
Fixes #8353 - Implement Aggregation Endpoint for dataInsight (#8464)
* Moved webanalytics type in its own folder

* Added data insight chart api endpoint

* Jave formatting

* Added resource descriptor

* Added metadata entity endpoint

* Added aggregation endpoint for dataInsight

* Fix tag name

* Added logic to ingestion pipeline resource to add ES config info if pipeline type is dataInsight

* Fix pageViewEvent import issue

* Changed dataType to PERCENTAGE and NUMBER

* Cleaned up PR by removing metadata resource and ingestionPipeline sink logic

* Update openmetadata-service/src/main/resources/json/data/dataInsight/totalEntitiesByTier.json

Co-authored-by: Sachin Chaurasiya <sachinchaurasiyachotey87@gmail.com>

* Update openmetadata-service/src/main/resources/json/data/dataInsight/totalEntitiesByType.json

Co-authored-by: Sachin Chaurasiya <sachinchaurasiyachotey87@gmail.com>

Co-authored-by: Sachin Chaurasiya <sachinchaurasiyachotey87@gmail.com>
2022-11-03 14:51:14 +05:30
Mayur Singal
1386b43607
Fix #6141: Ingestion Pipeline Status Updates (#8216)
* Fix #6141: Ingestion Pipeline Status Updates

* List Pipeline Status API & Improvements

* Rename State field to PipelineState in UI

* Convert Pipeline Status array to single object

* fix braking UI

* Rebase Fixes

* Profiler, TestStuite & DataInsigts Pipeline

* py_format

* fix logs page not loading
add pipelineStatus endpoint

* fix recent run changes

* Fix Tests

* address review comments for ui

* fix failing checks

* fix unit tests

* fix cypress test

* remove loader test as not using it

* wait for API when we click on the tab

* fix cypress waitFor api

* fix failed cypress tests

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
2022-11-03 14:37:26 +05:30