2535 Commits

Author SHA1 Message Date
Imri Paran
95982b9395
[GEN-356] Use ServiceSpec for loading sources based on connectors (#18322)
* ref(profiler): use di for system profile

- use source classes that can be overridden in system profiles
- use a manifest class instead of factory to specify which class to resolve for connectors
- example usage can be seen in redshift and snowflake

* - added manifests for all custom profilers
- used super() dependency injection in order for system metrics source
- formatting

* - implement spec for all source types
- added docs for the new specification
- added some pylint ignores in the importer module

* remove TYPE_CHECKING in core.py

* - deleted valuedispatch function
- deleted get_system_metrics_by_dialect
- implemented BigQueryProfiler with a system metrics source
- moved import_source_class to BaseSpec

* - removed tests related to the profiler factory

* - reverted start_time
- removed DML_STAT_TO_DML_STATEMENT_MAPPING
- removed unused logger

* - reverted start_time
- removed DML_STAT_TO_DML_STATEMENT_MAPPING
- removed unused logger

* fixed tests

* format

* bigquery system profile e2e tests

* fixed module docstring

* - removed import_side_effects from redshift. we still use it in postgres for the orm conversion maps.
- removed leftover methods

* - tests for BaseSpec
- moved get_class_path to importer

* - moved constructors around to get rid of useless kwargs

* - changed test_system_metric

* - added linage and usage to service_spec
- fixed postgres native lineage test

* add comments on collaborative constructors
2024-10-24 07:47:50 +02:00
Ayush Shah
51347a981a
fixes: Mode test connection returns data in dict instead of json (#18386) 2024-10-24 11:11:39 +05:30
Vijay Lakshmanan
4f2ef6fe5c
Fixes #16263: Fixed Mode dashboard ingestion API call (#18355) 2024-10-23 12:03:08 +05:30
Pere Miquel Brull
5e80ad9fc3
MINOR - Only timeout on main threads (#18341) 2024-10-21 15:18:33 +02:00
Teddy
dcf71aa0ea
fix: lazy load classes from factory method (#18321) 2024-10-21 11:29:03 +02:00
Mayur Singal
a4d62f6d85
MINOR: Add location path to table entity (#18307) 2024-10-21 10:31:27 +05:30
Pere Miquel Brull
c2929e67e6
MINOR - Return TestConnectionResult from test_connection_fn (#18320)
* MINOR - Return TestConnectionResult from test_connection fn

* MINOR - Return TestConnectionResult from test_connection fn
2024-10-18 09:54:07 +02:00
Pere Miquel Brull
5074f6588f
MINOR - Validate app runner init (#18316) 2024-10-18 09:40:06 +02:00
Katarzyna Kałek
c9995eecb6
FIX #18309: fixed task deserialization in Airflow metadata ingestion (#18310)
* fixed task deserialization in Airflow metadata ingestion

* fixed formatting

---------

Co-authored-by: Katarzyna Kałek <kkalek@olx.pl>
2024-10-17 14:51:55 -07:00
Pere Miquel Brull
7012e73d75
GEN-1166 - Improve Ingestion Workflow Error Summary (#18280)
* GEN-1166 - Improve Ingestion Workflow Error Summary

* fix test

* docs

* comments
2024-10-16 18:15:50 +02:00
Pere Miquel Brull
89b6c1c1cd
MINOR - Pass timeout to test connection and return TestConnectionStep (#18236)
* update connections

* MINOR - Pass timeout in test connection and return TestConnectionStep

* format

* comments

* comments
2024-10-16 18:15:28 +02:00
harshsoni2024
4f89dc582b
salesforce table description from label if not through query (#18286) 2024-10-16 12:56:44 +05:30
harshsoni2024
51448452d0
MINOR: Fix pinotdb col. datatype error (#18268) 2024-10-16 11:35:27 +05:30
Mayur Singal
592d7396bc
MINOR: Fix Couchbase columns not fetched (#18284) 2024-10-16 09:53:57 +05:30
Ayush Shah
40bd3bd3fa
Fixes #18186: Quicksight Ingestion Error handled (#18218) 2024-10-16 09:52:07 +05:30
Imri Paran
be82086e25
MINOR: add column case sensitivity parameter (#18115)
* fix(data-quality): table diff

- added handling for case-insensitive columns
- added handling for different numeric types (int/float/Decimal)
- added handling of boolean test case parameters

* add migrations for table diff

* add migrations for table diff

* removed cross type diff for now. it appears to be flaky

* fixed migrations

* use casefold() instead of lower()

* - implemented utils.get_test_case_param_value
- fixed params for case sensitive column

* handle bool test case parameters

* format

* testing

* format

* list -> List

* list -> List

* - change caseSensitiveColumns default to fase
- added migration to stay backward compatible

* - removed migration files
- updated logging message for table diff migration

* changed bool test case parameters default to always be false

* format

* docs: data diff

- added the caseSensitiveColumns parameter

requires: https://github.com/open-metadata/OpenMetadata/pull/18115

* fixed test_get_bool_test_case_param
2024-10-15 16:29:43 +02:00
Onkar Ravgan
e6705f25b3
fixed dbt tag name (#18273) 2024-10-15 16:43:03 +05:30
harshsoni2024
eb49d7a5bc
fix query for mysql con. (#18272) 2024-10-15 14:03:49 +05:30
Mayur Singal
8322c0f684
Fix #17963: Fix PinotDB Ingestion (#18266)
* Fix #17963: Fix PinotDB Ingestion

* fix conn args
2024-10-15 08:36:40 +05:30
Onkar Ravgan
2ee015e426
Add array supp for json schema parser (#18255) 2024-10-15 07:30:16 +05:30
Suman Maharana
dd08bc9ffd
GEN-895: Added Glue Pipeline Lineage (#18063) 2024-10-14 13:08:17 +05:30
Suman Maharana
142a506120
Fixed dbt Manifest and Run results parsing (#18234) 2024-10-14 13:07:22 +05:30
Imri Paran
bff50664b9
fix(data-quality): table diff url (#18221)
fixed url for table diff when using service connection
2024-10-11 01:07:29 +05:30
Imri Paran
68e71cb3dc
GEN-970: Refactor redshift system metrics to support freshness test (#17981)
* ref(profiler): redshift system metrics

- moved redshift system metrics to the redshift source module
- use Timestamp in data quality
- added plugin feature to test utils

* use timezone.utc

* format

* reverted unintended snowflake changes

* fixed import test_system_metrics.py

* revert

* fixed import in tests
2024-10-10 08:32:07 +02:00
devanshGarg-lumiq
fcd34afab5
Fix Redshift profiler for external tables in Redshift Spectrum (#18182) 2024-10-09 12:58:57 +02:00
Mayur Singal
e21843b752
Fix #18177: Fix Hive Metastore connection issue (#18178) 2024-10-09 11:34:41 +02:00
Sachin Chaurasiya
457f3d919a
GEN-1322: API Entity - Remove Beta (#17967)
* GEN-1322: API Entity - Remove Beta

* minor: add doc for the metadata pipeline

* api service refactor

* api service refactor backend changes

* add apiconnection in test service connection

* pytest fix

* fix java file formatting

* Fix casing of REST in ApiServiceRest.spec.ts

* Refactor REST to Rest in API classes

* minor change

* minor change

* minor change

* fix cashing for API to Api

* add playwright test for api service ingestion

* fix: playwright test

---------

Co-authored-by: harshsoni2024 <harshsoni2024@gmail.com>
2024-10-08 14:39:55 +05:30
harshsoni2024
421ad601e9
GEN-1552: Postgres stored procedures support (#18083) 2024-10-07 14:28:03 +05:30
Ethan
49fceb4674
Fixes #18104 : change parse_obj and assertEquals which was deprecated (#18105)
* change deprecationwarning

* fix format python

* fix replace module

* change : java function name
2024-10-07 09:02:41 +02:00
Imri Paran
71720ebc51
fix(table-diff): support cross database (#18085)
fixed table diff url to include database in all cases
2024-10-04 15:31:17 +02:00
harshsoni2024
1cd97fdac8
get primitive datatype in case of array, struct (#18102) 2024-10-04 18:49:50 +05:30
Suman Maharana
d9b15ef6cb
Minor: Resolve protected namespace warning (#18109) 2024-10-04 13:01:41 +05:30
Mayur Singal
f69db851b8
Fix #18070: Add support for glue database descriptions (#18094) 2024-10-03 21:53:39 -07:00
Mayur Singal
088282b0c1
MINOR: MSTR URI Fix (#18095) 2024-10-03 17:10:23 +02:00
Onkar Ravgan
23c6f1a6c1
AlationSink conn improvements (#18091) 2024-10-03 16:20:35 +05:30
Suman Maharana
bc6f4824ea
Added DBT tests with versionless and fixed v7 parsing (#18028) 2024-09-27 19:53:27 +05:30
IceS2
c9d37f6afc
Fix missing cast to str (#18016) 2024-09-27 16:03:45 +02:00
harshsoni2024
9442cac1a5
allow powerbi datamodel children in col.lineage (#18030) 2024-09-27 19:28:59 +05:30
Teddy
8dc6b7d282
MINOR - Fix General Profiler Bugs (#17995)
* fix import issue

* fix: better handle None values in profiler processing

* fix: profiler errors

* chore: fix comment

* style: fix python linting failure

* fix: null byte error with the database client

---------

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
2024-09-27 14:36:30 +02:00
sam-mccarty-mavenclinic
0dd3e97170
Fix 17911: Looker parsing improvements for liquid templating and view/model aliasing (#17912)
* Looker parsing improvements for liquid templating and view/model aliasing

* add python-liquid dependency to looker plugin requirements

* move to static method with 'openmetadata' context and add rendering tests

* remove backtick stripping

---------

Co-authored-by: Imri Paran <imri.paran@gmail.com>
2024-09-27 13:55:15 +02:00
Pere Miquel Brull
d26449576a
GEN-1234 - Clean up suggestions when a user is deleted (#17988)
* GEN-1234 - Clean up suggestions when a user is deleted

* add method

* add method

* fix postgres query
2024-09-26 16:22:36 +02:00
Imri Paran
25284e0232
MINOR: fix snowflake system metrics (#17989)
* fix snowflake system metrics

* format

* add link to logs and commit
fixed the dq cli test

* reverted bad formatting

* fixed models.py

* removed version pinning for data diff in tests
2024-09-26 11:55:17 +00:00
Suman Maharana
37b6dc8290
Add Sigma Dashboard Connector (#17855)
* Add Sigma Dashboard Connector

* changed to id instead of name in dashboard entity

* Address Comments

* addressed comments

* Added Docs

* yaml file changes

* fix ui changes
2024-09-26 16:29:35 +05:30
Suman Maharana
d2bce8adf4
Minor: fix lineage & queries in dbt (#17999) 2024-09-26 10:50:24 +05:30
Suman Maharana
b3bfa6433e
Fix Manifest is not parsed correctly on dbt versionless (#17975) 2024-09-24 23:39:17 +05:30
Aldwyn Cabarrubias
b9f881f2fa
Change time format for the created_at of the DBT cloud pipeline status (#17916)
Co-authored-by: Suman Maharana <sumanmaharana786@gmail.com>
2024-09-23 14:50:48 +05:30
Pere Miquel Brull
4cccaae446
GEN-996 - Allow PII Processor without storing Sample Data (#17927)
* GEN-996 - Allow PII Processor without storing Sample Data

* fix import

* fix import
2024-09-20 16:05:29 +02:00
Ayush Shah
8087f48e15
Fixes #17869: Add validations in place for salesforce connection (#17870) 2024-09-20 16:33:38 +05:30
Pere Miquel Brull
1e56c76c0e
FIX #17896 - Python lineage SDK to work with Uuid & FQN models (#17928) 2024-09-20 10:37:41 +02:00
Imri Paran
21af02d8d5
Fix 17903: fix(data-quality): snowflake data diff (#17907)
* fix(data-quality): snowflake data diff

- fixed schema in snowflake URL for data diff
- added e2e for snowflake data quality

* reverted unintended change
2024-09-19 17:55:48 +02:00