3385 Commits

Author SHA1 Message Date
Pere Miquel Brull
9dc56c3bb0
TEST - Add bots search RBAC validation (#19159)
* TEST - Add bots search RBAC validation

* format

* format
2025-01-02 15:32:03 +01:00
Akash Verma
39dcb5baef
Feature : Cockroach db connector (#18961) 2025-01-02 13:07:55 +05:30
IceS2
3d83f41b82
Remove the sample_data from assert in profiler since it was moved to the classification workflow (#19153) 2024-12-20 10:48:56 +01:00
Pere Miquel Brull
d084243d0a
MINOR - Add Gov entities for entity ref & Automator Actions (#19090)
* MINOR - Add Gov entities for entity ref

* MINOR - Add Gov entities for entity ref

* add automator configs

* add automator configs
2024-12-20 07:14:52 +01:00
Pere Miquel Brull
7c2850da7e
MINOR - Improve MissingExpectedEntityAckException msg (#19033) 2024-12-20 07:14:36 +01:00
IceS2
efbdf21098
Fix e2e tests (#19130)
* Fix E2E

* Fix E2E

* Fix MySQL count

* Tentative Fix

* Tentative Fix

* Revert PowerBI changes

* Revert PowerBI changes
2024-12-19 10:39:30 +01:00
Suman Maharana
6dc7e5c26d
Fix dbt Table not found Issue (#19117) 2024-12-18 18:36:43 +05:30
Mayur Singal
a49aab7111
MINOR: User search should only look in name & displayName (#19121)
* MINOR: User search should only look in name & displayname

* py_format

* pyformat

---------

Co-authored-by: Suman Maharana <sumanmaharana786@gmail.com>
2024-12-18 16:44:54 +05:30
harshsoni2024
91628b0d92
powerbi refactor, avoid prepare bulk data (#19123) 2024-12-18 16:39:06 +05:30
IceS2
c664ab93f6
Fix checkstyle (#19099) 2024-12-18 10:10:32 +01:00
Teddy
fec328bb98
fix: handle existing dbt test results (#19102) 2024-12-18 14:17:58 +05:30
Akash Verma
69557e8716
fixes: #15742 Oracle stored package feature (#18852) 2024-12-16 19:35:20 +05:30
Abdallah Serghine
ac967dfe50
ISSUE-16094: fix s3 storage parquet structureFormat ingestion (#18660)
This aims at fixing the s3 ingestion for parquet files, current behaviour is that
the pipeline will break if it encounters a file that is not valid parquet in the
the container, this is not great as containers might container non parquet files
on purpose like for example _SUCCESS files created by spark.

For that do not fail the whole pipeline when a single container fails, instead
count it as a failure and move on with the remainder of the containers, this is
already an improvement by ideally the ingestion should try a couple more files
under the given prefix before given up, additionally we can allow users to specify
file patterns to be ignored.

Co-authored-by: Abdallah Serghine <abdallah.serghine@olx.pl>
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2024-12-14 11:40:23 +01:00
Teddy
79b2888bb5
fix: azuresql sampler logic (#19034) 2024-12-13 07:35:04 +01:00
Keshav Mohta
f4ff43c24c
Fixes #17888 - Implemented Cross Database Lineage (#18831) 2024-12-12 15:13:36 +05:30
Keshav Mohta
cde3a7dd1e
Feature: Cassandra Connector (#18943) 2024-12-12 15:12:55 +05:30
Pere Miquel Brull
17c5ba8698
MINOR - Add APIs to Ref Map (#19019)
* MINOR - Add APIs to Ref Map

* MINOR - Add APIs to Ref Map

* MINOR - Add APIs to Ref Map

* format
2024-12-12 10:40:03 +01:00
Suman Maharana
85c7f6d22e
Minor: Fix kafkaconnect test connection (#18965) 2024-12-09 16:52:35 +05:30
Imri Paran
16875853a0
test(data-dff): fix flaky test (#18898)
use 99.5 CI for data diff sampling
2024-12-06 18:55:27 +05:30
Imri Paran
e30571cf4e
[GEN-2187] fix(data-diff): added MD5 handling for bigquery (#18904)
* fix(data-diff): added nd5 handling for bigquery

- added MD5 handling for bigquery
- use URL instead of Engine because it requires less steps and less prone to failure

* added e2e test for data diff with sampling in bigquery
2024-12-06 14:21:33 +01:00
Teddy
610322ffed
MINOR - MSSQL timestamp type profiler fix (#18935)
* fix: mssql timestamp processing

* fix: min/max test type on datetime column

* style: fix python format
2024-12-06 08:03:42 +01:00
LeoLuo123
b5750cbee1
Fixes 18300: Get last completed job run (#18891)
* Get last completed job run

* formatting

---------

Co-authored-by: 😺Leo Luo <leo.luo@mavenclinic.com>
Co-authored-by: Suman Maharana <sumanmaharana786@gmail.com>
2024-12-05 14:53:26 +01:00
Teddy
03bd8e9dc4
FEAT: added TABLESAMPLE for MSSQL (#18926)
* feat: added TABLESAMPLE for sqlserver

* fix: class name

* test: added test to generated sample query
2024-12-05 14:17:39 +01:00
Suman Maharana
113fae6be0
Fix: dbt manifest parsing issue (#18930) 2024-12-05 13:06:25 +05:30
harshsoni2024
c941569a9e
filter empty columns, better msg (#18932) 2024-12-05 13:05:58 +05:30
Ayush Shah
8664c8df75
Fixes GEN-2199: Allow Fivetran filtering of pipelines using name instead of id (#18929) 2024-12-05 10:55:11 +05:30
Mayur Singal
8ac80e6807
MINOR: Tableau Capture SQL in Lineage (#18925) 2024-12-05 09:55:21 +05:30
Mayur Singal
6326ae3cb2
MINOR: DB2 Support License (#18928) 2024-12-05 09:54:58 +05:30
Ayush Shah
81b68682aa
Fixes GEN-2198: Snowflake unencrypted key support, connection error on storing bytes (#18927) 2024-12-05 07:55:44 +05:30
Mayur Singal
6d21dd12a4
MINOR: Snowflake UDF Lineage Support - main (#18886) 2024-12-05 00:19:40 +05:30
Mayur Singal
e44ca06173
MINOR: Add Try/Except to store procedure yielding (#18883) 2024-12-04 21:36:53 +05:30
Teddy
a615f49ab2
MINOR: fix table/schema name (#18899)
* fix: fix table/schema name

* fix: minor profiler and dbt issues

* style: ran python linting

---------

Co-authored-by: Imri Paran <imri.paran@gmail.com>
2024-12-04 10:12:38 +01:00
Pere Miquel Brull
613fd331e0
MINOR - Clean up configs & add auto classification docs (#18907)
* MINOR - Clean up configs & add auto classification docs

* deprecation notice
2024-12-04 09:32:25 +01:00
Suman Maharana
11b0a9c376
Fixes: mircostrategy test connection (#18889) 2024-12-03 17:13:15 +01:00
Suman Maharana
e582cb4d9c
Minor: Fix dbt source file issue (#18903) 2024-12-03 21:42:11 +05:30
Imri Paran
7ae886e8f9
chore(importer): better error message for missing plugins (#18878)
Before:

"Generic ImportError message"

After:

Error initializing metadata: You might be missing the plugin [bigquery]. Try:
pip install "openmetadata-ingestion[bigquery]==1.6.0"
2024-12-03 14:41:47 +00:00
Mayur Singal
7c6f5e67dd
MINOR: Snowflake Support Alter Table Swap Lineage (#18887) 2024-12-03 16:23:58 +05:30
Imri Paran
416ba2c9cb
chore(data-quality): improve messaging for COUNT strategy failure (#18884) 2024-12-03 09:44:20 +00:00
harshsoni2024
1140578b05
GEN-2157: Snowflake increased cost issue (#18825)
* exit from active connection while query lineage process

* save rows resuslt/fetchall imple.

* use serverside cursor
2024-12-03 15:02:00 +05:30
Pere Miquel Brull
7aacfe032c
MINOR - FQN encoding in ometa_api, TestSuite pipeline creation & serialization of test case results (#18877)
* DOCS - Update ES config

* MINOR - Add missing FQN encoding & force types

* MINOR - Add missing FQN encoding & force types

* format

* fix tests
2024-12-02 17:17:21 +01:00
Mayur Singal
9b9509f4b9
MINOR: Mysql Lineage Support Main (#18780)
* MINOR: Mysql Lineage Support Main

* fix test

* fix test

---------

Co-authored-by: Teddy <teddy.crepineau@gmail.com>
2024-11-29 20:48:42 +05:30
Teddy
ac2f6d7132
MINOR - Fix sqa table reference (#18839)
* fix: sqa table reference

* style: ran python linting

* fix: added raw dataset to query runner

* fix: get table and schema name from orm object

* fix: get table level config for table tests
2024-11-28 18:49:11 +01:00
mgorsk1
da176767a8
feat: add dbt freshness check test (#18730)
* add dbt freshness check

* docs

* run linting

* add test case param definition

* fix test case param definition

* add config for dbt http, fix linting

* refactor (only create freshness test definition when user executed one)

* fix dbt files class

* fix dbt files class 2

* fix dbt objects class

* fix linting

* fix pylint

* fix linting once and for all

---------

Co-authored-by: Teddy <teddy.crepineau@gmail.com>
2024-11-28 18:30:11 +01:00
harshsoni2024
cb33f274fc
Connector: rename microstrategy connector (#18604) 2024-11-28 18:50:42 +05:30
Pere Miquel Brull
f99ed517bc
FIX - TestSuite source to get the right interface based on the service type (#18838) 2024-11-28 14:08:11 +01:00
Suman Maharana
9a21e77e15
Added dbt cloud multi projects and jobs filter (#18801)
* Added dbt cloud multi project and jobs filter

* added tests

* change to array type

* updated yaml config

* added migrations
2024-11-28 16:10:34 +05:30
Pere Miquel Brull
460d20a856
MINOR - Fix clean_uri and add before pagination (#18826)
* print

* MINOR - Fix clean_uri and add before pagination

* MINOR - Fix clean_uri and add before pagination
2024-11-28 09:35:41 +01:00
Akash Verma
609620e54f
fixes : #12891: "Added more test connection cases for Airflow ingestion" (#18769) 2024-11-28 13:02:04 +05:30
Imri Paran
cd74d8f55a
MINOR: ref(data-quality): modularized test case validator import (#18716)
* ref(data-quality): modularized test case validator import

- removed test_suite_factory
- implemented TestCaseImporter
- removed SQAValidatorBuilder and PandasValidatorBuilder in favor of a SourceType enum
- removed the orm table creation from test suite source

* format

* IValidatorBuilder -> ValidatorBuilder

* use the table from the sampler in the test suite interface

* linting

* fixed the profiler with similar solution

* removed unused inheritance

* removed unneeded super().__init__()

* removed all instances of orm_table

* fixed tests

* add reportExplicitAny=false

* fixed tests
2024-11-27 16:25:12 +01:00
Mayur Singal
9985286cdb
MINOR: Fix snowflake test connection on - in db name (#18817) 2024-11-27 18:38:46 +05:30