3572 Commits

Author SHA1 Message Date
Keshav Mohta
7c0eeef049
Fixes #19692: Implemented Nifi Pipeline Lineage (#21802)
* feat: implemented nifi pipeline lineage

* test: implemented tests for nifi pipeline lineage

* fix: yield_pipeline_bulk_lineage_details output type hinting

* fix: component check in connections

---------

Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>
2025-06-18 07:31:04 +00:00
harshsoni2024
a09a696358
MINOR: Tableau proxy url for sourceurl (#21799) 2025-06-18 10:52:08 +05:30
Teddy
e4dffd281c
fix: preserve BQ struct field casing (#21716) 2025-06-17 23:58:35 +02:00
IceS2
cf288aa5de
Remove useless comment (#21819) 2025-06-17 14:27:41 -07:00
Sriharsha Chintalapani
c90138501f
Fix #21822: OpenSearch by default limits the number of characters it will analyze for highlighting to 1,000,000 characters. If your description field is very large (e.g. Markdown docs, embedded HTML, or verbose documentation), this limit gets exceeded. (#21821)
* Add sample data

* Fix index mappings to optimize the highlighter for OpenSearch
2025-06-17 14:22:11 -07:00
Mayur Singal
34c43eaea0
MINOR: Fix pytests (#21807) 2025-06-17 23:44:29 +05:30
IceS2
e79c54e6a5
MINOR: Add injection to profiler (#21738)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Implement Dependency Injection for the Ingestion Framework

* Fix Test

* Fix Profile Test Connection

* Add Injection to Metrics in Profiler

* Add Injection to the Profiler

* Fix UnitTests

* Fix Pytests

* Fix Tests

* Fix types
2025-06-17 19:01:00 +02:00
harshsoni2024
0f79d8ea1d
MINOR: pytest opt out flaky test (#21800)
* remove mlflow test until fixed

* alationsink test count fixed

* pylint fix gx
2025-06-17 14:23:28 +05:30
IceS2
49df5fc9de
MINOR: Implement dependency injection on ingestion (#21719)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Implement Dependency Injection for the Ingestion Framework

* Fix Test

* Fix Profile Test Connection

* Fix test, making the injection test run last

* Update connections.py

* Changed NewType to an AbstractClass to avoid linting issues

* remove comment

* Fix bug in service spec

* Update PyTest version to avoid importlib.reader wrong import
2025-06-16 08:03:38 +02:00
Sriharsha Chintalapani
074329418f
Fix #17244: Pagination for columns in UI (#21508) 2025-06-15 21:30:31 +05:30
Mayur Singal
64626dd4fd
MINOR: Implement Lineage Filter for UC (#21761)
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-13 22:14:56 +05:30
Pere Menal-Ferrer
44e09e41a2
Revert "FIX #1464 (#21520)" (#21726)
This reverts commit 1e86f9870fd663122b9bbb64f3cf17cf32619c7f.
2025-06-13 17:27:32 +02:00
IceS2
891ff4184d
MINOR: Initial implementation for our Connection Class (#21581)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Fix Test

* Fix Profile Test Connection

* Remove unit test

* Remove comment

* Fix tests and missing changes
2025-06-13 14:52:29 +02:00
IceS2
f44d81ddf2
Update test_deltalake.py (#21735)
* Update test_deltalake.py

* Ignore it while collecting the tests
2025-06-13 08:27:38 +02:00
Mayur Singal
93b5cec8f9
Fix #21099: fix Superset ingestion bad query (#21650) 2025-06-13 08:32:15 +05:30
Mayur Singal
f4e9d69930
Fix #21109: Unable to connect to Opensearch using AWS Credentials (#21441) 2025-06-13 08:30:44 +05:30
Keshav Mohta
cd24c0a69a
Feature: Microstrategy Lineage (#21678) 2025-06-13 08:28:29 +05:30
Mohit Tilala
2803e62f0b
Add missing Data space type in qlikcloud (#21698)
* Add missing `Data` space type in qlikcloud

* Fix broken json files
2025-06-12 14:49:10 -07:00
Mayur Singal
d20d278c4b
Minor: Improve UC owner ingestion (#21741)
* Minor: Improve UC owner ingestion

* lint
2025-06-12 14:48:29 -07:00
harshsoni2024
6a6180b2e3
powerbi change owner condition (#21724) 2025-06-12 16:11:43 +05:30
Suman Maharana
18f9f2cdb6
Fix: Tableau project id should always be a string (#21700) 2025-06-12 11:21:53 +05:30
Teddy
a680e2c802
fix: added profiler config when executing bundle suite (#21714) 2025-06-11 17:03:22 +02:00
Teddy
c09a8b27ae
ISSUE #16676 - Add Tag to CreateTestCase (#21366)
* refactor: removed testSuite field from CreateTestCase

BREAKING CHANGE: when creating a test case, testsuite is now derived from entityLink (fetch or created)

* feat: allow setting tags when creating a test case

* style: ran linters

* fix: compiling error

* fix: failing test case

* fix: failing tests

* removed testSuite from required filed

* fixed ui side

* style: ran java linting

* deprecation: remove testSuite param from ingestion

* fix: remove test suite filed

* fix: remove test_suite field

---------

Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>
2025-06-11 09:59:08 +02:00
Suman Maharana
0df058a53d
Fix: dbtcloud CL errors (#21685) 2025-06-10 21:45:07 +05:30
Mayur Singal
06ae2df2c3
MINOR: Fix bigquery import issue (#21444)
* MINOR: Fix bigquery import issue

* fix checkstyle
2025-06-09 16:08:16 -07:00
Mohit Tilala
8bda216a72
Fixes #21472: Add mention of why snowflake owners are not supported (#21519)
* Add mention of why snowflake owners are not supported

* Remove owners from docs as not supported
2025-06-09 16:05:54 -07:00
Pere Menal-Ferrer
1e86f9870f
FIX #1464 (#21520)
* Add PIICategoryTags and some utilities on top of them.

* Fix static-check

* Add test for fqn representation

* Add NEREntityGeneralTags.json from Collate

* Add test to check PIICategoryTags agree with the ones used by OM server

* Add LabelExtractor

* Fix style

* Add ignore superflous-parens for pylint

* Ass comment as per PR review

* Fix not-updated PII-IT

* Remove duplicated IT test for PII

---------

Co-authored-by: Pere Menal <pere.menal@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-09 16:05:35 -07:00
Mayur Singal
53817c2182
Minor: Make trino query table configurable (#21665) 2025-06-09 16:03:57 -07:00
harshsoni2024
4a3b6f4934
issue-21370: db2 custom driver installation (#21638)
* db2 custom driver installation

* pylint changes

* typo fix
2025-06-09 19:52:35 +05:30
Ayush Shah
05e6a56b41
Add Databricks Sampler, Refactor Unity Catalog Sampler (#21612) 2025-06-09 14:18:35 +05:30
Keshav Mohta
b7a7023890
Fix #20665: BigQuery - Adding billing project (#21231) 2025-06-09 13:09:40 +05:30
Mohit Tilala
9e36cfe012
Remove existing entity source hash presence check (#21621) 2025-06-08 12:37:51 +05:30
Teddy
5078a2fbb9
DEPRECATION: Remove testCaseResults endpoint from testCaseResource (#21527)
* deprecation: remove testCaseResults endpoint from testCaseResource

* fix: path in test e2e test

* fix: endpoint name to testCaseResults

* style: fix java linting
2025-06-07 21:02:54 +02:00
Suman Maharana
161b4a8b2a
Chore: Tableau Improvements (#21620)
* Chore: Tableau Improvements

* Added apiVersion

* linting

* Addressed Comments
2025-06-07 21:38:48 +05:30
Suman Maharana
fd88a6d449
Add: dbt tags Filter (#21628) 2025-06-07 12:25:23 +05:30
Suman Maharana
2c657d6034
Fix: Looker cll parsing issue (#21630)
* Fix: Looker cll parsing issue

* Added checks
2025-06-07 12:21:32 +05:30
Mohit Tilala
ea63db993a
Add lineage dialect for Exasol, Trino and Vertica (#21604) 2025-06-06 11:48:52 +05:30
Elay Gelbart
dec346a84b
Fixes ISSUE 20899: upgrade google-cloud-secret-manager python requirement version (#20900)
* upgrade openmetadata-ingestion dependency google-cloud-secret-manager version to 2.23.3

* upgrade openmetadata-ingestion dependency google-cloud-secret-manager version to 2.23.3 with ~

* Bump up `mlflow` and `databricks-sdk` for protobuf 5.x.x, pin down google-cloud-secret-manager to 2.22.1 for airflow deps sync

* Pin down databricks-sdk to 0.20.0

---------

Co-authored-by: Mohit Tilala <tilalamohit123@gmail.com>
Co-authored-by: Mohit Tilala <63147650+mohittilala@users.noreply.github.com>
Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
2025-06-06 03:14:25 +05:30
IceS2
8540884ab1
MINOR: Add method to filter ingestion pipeline based on metadata (#21449)
* Add logic to handle WorkflowContext on Ingestion

* Revert base.py changes

* Removed comment

* Fix basedpyright complaints

* Make ContextManager automatically add its context to the PipelineStatus

* Small changes

* Only dump non-null keys

* Add Method to Filter Ingestion Pipeline based on Metadata

* Reduce the scope to filter only specifically on metadata->workflow->serviceName
2025-06-04 16:13:39 +02:00
Mohit Tilala
44c90557b7
Fix missing __pydantic_fields__ exceptions (#21521) 2025-06-04 16:44:31 +05:30
IceS2
5b20b84546
MINOR: Add logic to handle WorkflowContext on Ingestion (#21425)
* Add logic to handle WorkflowContext on Ingestion

* Revert base.py changes

* Removed comment

* Fix basedpyright complaints

* Make ContextManager automatically add its context to the PipelineStatus

* Small changes
2025-06-03 17:35:08 +02:00
Suman Maharana
720c6d3f9f
Add: Looker explore to view Column Lineage (#21504)
* Add: explore to view Column Lineage

* Add tags ingestion and fix cll warnings

* lint

* Addressed comments

* fixed tests
2025-06-03 20:23:43 +05:30
Suman Maharana
c00ed22866
Fix: Tableau Validation Errors (#21530) 2025-06-03 11:03:45 +05:30
Teddy
3c5fbffeaa
feat: add regex support for dbx (#21514) 2025-06-02 17:55:48 +02:00
Teddy
859f24aba7
MINOR: row sampling error (#21488)
* fix: row sampling error

* fix: return sample query
2025-06-02 09:02:17 +02:00
harshsoni2024
841cc5753d
issue-21439: dashboard lineage override (#21440) 2025-06-02 11:36:44 +05:30
Suman Maharana
7e3c732919
Fix: Databricks Schema Description (#21367) 2025-06-02 11:34:07 +05:30
Suman Maharana
1c4500b119
Fix: looker CLL errors (#21493)
* Fix: looker CLL errors

* Addressed comments- added exception handling

* addressed comments

* linting
2025-05-31 17:29:23 +05:30
Pere Menal-Ferrer
6683c632f4
FIX #21464 (#21463)
* Reproduce failing behaviour with non-date-time data

* Add a presidio patch for DateTimes

* Fix type-check error

---------

Co-authored-by: Pere Menal <pere.menal@getcollate.io>
2025-05-30 08:18:50 +02:00
Suman Maharana
21f3c4be3c
Add: Looker column level lineage (#21458)
* Add: Looker column level lineage

* Fix broken lineage

* add exception handling

---------

Co-authored-by: ulixius9 <mayursingal9@gmail.com>
2025-05-29 17:26:55 +05:30