2979 Commits

Author SHA1 Message Date
IceS2
94cf3e0fd6
MINOR: Extend profile workflow config to allow engine configuration (#21840)
* Update Profile Workflow to allow engine configuration

* Add ui generated schemas

* Add Repository Override mechanism based on annotations

* Implement logic to use the ProcessingEngine configuration

* Update SparkEngine to use remote and not master
2025-06-26 19:11:26 +05:30
Mayur Singal
803abb9373
Minor: Fix Tableau Lineage in Multi Schema Model (#21965) 2025-06-25 23:43:06 +05:30
Mehul Shroff
35215762cb
Update metadata_service_helper.py (#21948) 2025-06-25 17:07:40 +05:30
IceS2
392f081255
Update PySpark and Delta-Spark Versions to use PySpark 3.5.6 (#21919) 2025-06-25 11:45:01 +02:00
Suman Maharana
2aa2282e03
Added project to datamodel (#21926) 2025-06-25 02:26:22 +05:30
Ayush Shah
11ac56356b
MINOR: Modify Sample data (#21599) 2025-06-24 17:16:13 +05:30
harshsoni2024
f490406968
MINOR: pbi improve logging (#21868) 2025-06-20 16:32:56 +05:30
Keshav Mohta
73ea60b898
Refactor: Unity Catalog (#21801) 2025-06-20 16:04:34 +05:30
IceS2
5bac5f2509
MINOR: Fix Airflow API Test Connection (#21818)
* Fix Airflow API Test Connection

* Fix query_parser_source test_connection

* Already update all test_connection I could find

* Fix circular dependency

* Fix invalid variable

* Fix wrong import

---------

Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
2025-06-19 17:58:07 +05:30
Himanshu Khairajani
79c3d55128
Fix #21679: Added metadata ingest-dbt CLI Command for Direct DBT Artifacts Ingestion (#21680)
* metadata dbt

* fix:
 - default path to current directory
 - addional warning and exception handling for missing metadata config vars

* test: add unit tests for DBT Ingestion CLI

* refactor

* PR review:
 - using Pydantic to parse and validate the openmetadata config in dbt's .yml
 - extended test-cases
 - giving user more configuration options for ingestion

* py refactoring

* add: dbt-auto ingest docs

* Improvements:
 - using environement variables for loading sensitve variables
 - added docs for auto dbt-ingestion for dbt-core
 - more test cases

* fix:
 - test case for reading JWT token inside the the method

* refactor: py code formatting

* refactor: py formatting

* ingest-dbt docs updated

* refined test cases

* Chore:
 - sonar vulnerability issue review
 - using existing URL class for host validation

---------

Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>
2025-06-19 17:57:10 +05:30
Suman Maharana
7be62f3ed9
Add: Tableau Hierarchy project filter (#21811) 2025-06-19 11:18:52 +05:30
IceS2
040a33117c
MINOR: Fix Profiler Infinite Loop (#21843) 2025-06-19 10:33:45 +05:30
harshsoni2024
d38ee0ed52
feat-21712: PowerBI internal entities & cross workspace lineage (#21837) 2025-06-18 20:46:17 +05:30
Keshav Mohta
7c0eeef049
Fixes #19692: Implemented Nifi Pipeline Lineage (#21802)
* feat: implemented nifi pipeline lineage

* test: implemented tests for nifi pipeline lineage

* fix: yield_pipeline_bulk_lineage_details output type hinting

* fix: component check in connections

---------

Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>
2025-06-18 07:31:04 +00:00
harshsoni2024
a09a696358
MINOR: Tableau proxy url for sourceurl (#21799) 2025-06-18 10:52:08 +05:30
Teddy
e4dffd281c
fix: preserve BQ struct field casing (#21716) 2025-06-17 23:58:35 +02:00
IceS2
cf288aa5de
Remove useless comment (#21819) 2025-06-17 14:27:41 -07:00
IceS2
e79c54e6a5
MINOR: Add injection to profiler (#21738)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Implement Dependency Injection for the Ingestion Framework

* Fix Test

* Fix Profile Test Connection

* Add Injection to Metrics in Profiler

* Add Injection to the Profiler

* Fix UnitTests

* Fix Pytests

* Fix Tests

* Fix types
2025-06-17 19:01:00 +02:00
harshsoni2024
0f79d8ea1d
MINOR: pytest opt out flaky test (#21800)
* remove mlflow test until fixed

* alationsink test count fixed

* pylint fix gx
2025-06-17 14:23:28 +05:30
IceS2
49df5fc9de
MINOR: Implement dependency injection on ingestion (#21719)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Implement Dependency Injection for the Ingestion Framework

* Fix Test

* Fix Profile Test Connection

* Fix test, making the injection test run last

* Update connections.py

* Changed NewType to an AbstractClass to avoid linting issues

* remove comment

* Fix bug in service spec

* Update PyTest version to avoid importlib.reader wrong import
2025-06-16 08:03:38 +02:00
Mayur Singal
64626dd4fd
MINOR: Implement Lineage Filter for UC (#21761)
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-13 22:14:56 +05:30
Pere Menal-Ferrer
44e09e41a2
Revert "FIX #1464 (#21520)" (#21726)
This reverts commit 1e86f9870fd663122b9bbb64f3cf17cf32619c7f.
2025-06-13 17:27:32 +02:00
IceS2
891ff4184d
MINOR: Initial implementation for our Connection Class (#21581)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Fix Test

* Fix Profile Test Connection

* Remove unit test

* Remove comment

* Fix tests and missing changes
2025-06-13 14:52:29 +02:00
Mayur Singal
93b5cec8f9
Fix #21099: fix Superset ingestion bad query (#21650) 2025-06-13 08:32:15 +05:30
Mayur Singal
f4e9d69930
Fix #21109: Unable to connect to Opensearch using AWS Credentials (#21441) 2025-06-13 08:30:44 +05:30
Keshav Mohta
cd24c0a69a
Feature: Microstrategy Lineage (#21678) 2025-06-13 08:28:29 +05:30
Mohit Tilala
2803e62f0b
Add missing Data space type in qlikcloud (#21698)
* Add missing `Data` space type in qlikcloud

* Fix broken json files
2025-06-12 14:49:10 -07:00
Mayur Singal
d20d278c4b
Minor: Improve UC owner ingestion (#21741)
* Minor: Improve UC owner ingestion

* lint
2025-06-12 14:48:29 -07:00
harshsoni2024
6a6180b2e3
powerbi change owner condition (#21724) 2025-06-12 16:11:43 +05:30
Suman Maharana
18f9f2cdb6
Fix: Tableau project id should always be a string (#21700) 2025-06-12 11:21:53 +05:30
Teddy
a680e2c802
fix: added profiler config when executing bundle suite (#21714) 2025-06-11 17:03:22 +02:00
Teddy
c09a8b27ae
ISSUE #16676 - Add Tag to CreateTestCase (#21366)
* refactor: removed testSuite field from CreateTestCase

BREAKING CHANGE: when creating a test case, testsuite is now derived from entityLink (fetch or created)

* feat: allow setting tags when creating a test case

* style: ran linters

* fix: compiling error

* fix: failing test case

* fix: failing tests

* removed testSuite from required filed

* fixed ui side

* style: ran java linting

* deprecation: remove testSuite param from ingestion

* fix: remove test suite filed

* fix: remove test_suite field

---------

Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>
2025-06-11 09:59:08 +02:00
Suman Maharana
0df058a53d
Fix: dbtcloud CL errors (#21685) 2025-06-10 21:45:07 +05:30
Mayur Singal
06ae2df2c3
MINOR: Fix bigquery import issue (#21444)
* MINOR: Fix bigquery import issue

* fix checkstyle
2025-06-09 16:08:16 -07:00
Mohit Tilala
8bda216a72
Fixes #21472: Add mention of why snowflake owners are not supported (#21519)
* Add mention of why snowflake owners are not supported

* Remove owners from docs as not supported
2025-06-09 16:05:54 -07:00
Pere Menal-Ferrer
1e86f9870f
FIX #1464 (#21520)
* Add PIICategoryTags and some utilities on top of them.

* Fix static-check

* Add test for fqn representation

* Add NEREntityGeneralTags.json from Collate

* Add test to check PIICategoryTags agree with the ones used by OM server

* Add LabelExtractor

* Fix style

* Add ignore superflous-parens for pylint

* Ass comment as per PR review

* Fix not-updated PII-IT

* Remove duplicated IT test for PII

---------

Co-authored-by: Pere Menal <pere.menal@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-09 16:05:35 -07:00
Mayur Singal
53817c2182
Minor: Make trino query table configurable (#21665) 2025-06-09 16:03:57 -07:00
harshsoni2024
4a3b6f4934
issue-21370: db2 custom driver installation (#21638)
* db2 custom driver installation

* pylint changes

* typo fix
2025-06-09 19:52:35 +05:30
Ayush Shah
05e6a56b41
Add Databricks Sampler, Refactor Unity Catalog Sampler (#21612) 2025-06-09 14:18:35 +05:30
Keshav Mohta
b7a7023890
Fix #20665: BigQuery - Adding billing project (#21231) 2025-06-09 13:09:40 +05:30
Mohit Tilala
9e36cfe012
Remove existing entity source hash presence check (#21621) 2025-06-08 12:37:51 +05:30
Teddy
5078a2fbb9
DEPRECATION: Remove testCaseResults endpoint from testCaseResource (#21527)
* deprecation: remove testCaseResults endpoint from testCaseResource

* fix: path in test e2e test

* fix: endpoint name to testCaseResults

* style: fix java linting
2025-06-07 21:02:54 +02:00
Suman Maharana
161b4a8b2a
Chore: Tableau Improvements (#21620)
* Chore: Tableau Improvements

* Added apiVersion

* linting

* Addressed Comments
2025-06-07 21:38:48 +05:30
Suman Maharana
fd88a6d449
Add: dbt tags Filter (#21628) 2025-06-07 12:25:23 +05:30
Suman Maharana
2c657d6034
Fix: Looker cll parsing issue (#21630)
* Fix: Looker cll parsing issue

* Added checks
2025-06-07 12:21:32 +05:30
Mohit Tilala
ea63db993a
Add lineage dialect for Exasol, Trino and Vertica (#21604) 2025-06-06 11:48:52 +05:30
IceS2
8540884ab1
MINOR: Add method to filter ingestion pipeline based on metadata (#21449)
* Add logic to handle WorkflowContext on Ingestion

* Revert base.py changes

* Removed comment

* Fix basedpyright complaints

* Make ContextManager automatically add its context to the PipelineStatus

* Small changes

* Only dump non-null keys

* Add Method to Filter Ingestion Pipeline based on Metadata

* Reduce the scope to filter only specifically on metadata->workflow->serviceName
2025-06-04 16:13:39 +02:00
Mohit Tilala
44c90557b7
Fix missing __pydantic_fields__ exceptions (#21521) 2025-06-04 16:44:31 +05:30
IceS2
5b20b84546
MINOR: Add logic to handle WorkflowContext on Ingestion (#21425)
* Add logic to handle WorkflowContext on Ingestion

* Revert base.py changes

* Removed comment

* Fix basedpyright complaints

* Make ContextManager automatically add its context to the PipelineStatus

* Small changes
2025-06-03 17:35:08 +02:00
Suman Maharana
720c6d3f9f
Add: Looker explore to view Column Lineage (#21504)
* Add: explore to view Column Lineage

* Add tags ingestion and fix cll warnings

* lint

* Addressed comments

* fixed tests
2025-06-03 20:23:43 +05:30