3385 Commits

Author SHA1 Message Date
Suman Maharana
2aa2282e03
Added project to datamodel (#21926) 2025-06-25 02:26:22 +05:30
Ayush Shah
11ac56356b
MINOR: Modify Sample data (#21599) 2025-06-24 17:16:13 +05:30
Mayur Singal
43863ae6f3
MINOR: Fix pytests jaraco (#21894) 2025-06-23 13:55:43 +05:30
harshsoni2024
f490406968
MINOR: pbi improve logging (#21868) 2025-06-20 16:32:56 +05:30
Keshav Mohta
73ea60b898
Refactor: Unity Catalog (#21801) 2025-06-20 16:04:34 +05:30
IceS2
5bac5f2509
MINOR: Fix Airflow API Test Connection (#21818)
* Fix Airflow API Test Connection

* Fix query_parser_source test_connection

* Already update all test_connection I could find

* Fix circular dependency

* Fix invalid variable

* Fix wrong import

---------

Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
2025-06-19 17:58:07 +05:30
Himanshu Khairajani
79c3d55128
Fix #21679: Added metadata ingest-dbt CLI Command for Direct DBT Artifacts Ingestion (#21680)
* metadata dbt

* fix:
 - default path to current directory
 - addional warning and exception handling for missing metadata config vars

* test: add unit tests for DBT Ingestion CLI

* refactor

* PR review:
 - using Pydantic to parse and validate the openmetadata config in dbt's .yml
 - extended test-cases
 - giving user more configuration options for ingestion

* py refactoring

* add: dbt-auto ingest docs

* Improvements:
 - using environement variables for loading sensitve variables
 - added docs for auto dbt-ingestion for dbt-core
 - more test cases

* fix:
 - test case for reading JWT token inside the the method

* refactor: py code formatting

* refactor: py formatting

* ingest-dbt docs updated

* refined test cases

* Chore:
 - sonar vulnerability issue review
 - using existing URL class for host validation

---------

Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>
2025-06-19 17:57:10 +05:30
Suman Maharana
7be62f3ed9
Add: Tableau Hierarchy project filter (#21811) 2025-06-19 11:18:52 +05:30
IceS2
040a33117c
MINOR: Fix Profiler Infinite Loop (#21843) 2025-06-19 10:33:45 +05:30
Sriharsha Chintalapani
802438f0ea
Fix default boost score, improve fqn parsing (#21854)
* Fix explain turned by default, use dfs_query_then_fetch in cases of sharding of search cluster

* Add exact match configs

* Add exact match configs

* Update Logic to build search source builder with exact match priority

* Revert "Update Logic to build search source builder with exact match priority"

This reverts commit 175a2e9c6b67ee90d4b2a35af89bb035e8c45131.

* Revert "Add exact match configs"

This reverts commit 3fd52606610bbb97a676170004cab6d7adc31a0d.

* revert display name change

* make boost mode as sum by defaul

* add more fqnparts for schema and database

* revert DFS_QUERY_THEN_FETCH since sharding wasn the issue

* use fqn split

* refactor fqn parsing

---------

Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
2025-06-18 18:56:11 -07:00
Sriharsha Chintalapani
8adda4955c
Revert "Issues in Search Relevancy (#21841)" (#21853)
This reverts commit f388e570c1dac5b9eee31364870fb66e42715f18.
2025-06-18 16:43:34 -07:00
Mohit Yadav
f388e570c1
Issues in Search Relevancy (#21841)
* Fix explain turned by default, use dfs_query_then_fetch in cases of sharding of search cluster

* Add exact match configs

* Add exact match configs

* Update Logic to build search source builder with exact match priority

* Revert "Update Logic to build search source builder with exact match priority"

This reverts commit 175a2e9c6b67ee90d4b2a35af89bb035e8c45131.

* Revert "Add exact match configs"

This reverts commit 3fd52606610bbb97a676170004cab6d7adc31a0d.

* revert display name change

* make boost mode as sum by defaul

* add more fqnparts for schema and database

* revert DFS_QUERY_THEN_FETCH since sharding wasn the issue

* use fqn split

* Refactor FQN Parts

---------

Co-authored-by: Sriharsha Chintalapani <harsha@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-18 16:33:46 -07:00
harshsoni2024
d38ee0ed52
feat-21712: PowerBI internal entities & cross workspace lineage (#21837) 2025-06-18 20:46:17 +05:30
Keshav Mohta
7c0eeef049
Fixes #19692: Implemented Nifi Pipeline Lineage (#21802)
* feat: implemented nifi pipeline lineage

* test: implemented tests for nifi pipeline lineage

* fix: yield_pipeline_bulk_lineage_details output type hinting

* fix: component check in connections

---------

Co-authored-by: Mayur Singal <39544459+ulixius9@users.noreply.github.com>
2025-06-18 07:31:04 +00:00
harshsoni2024
a09a696358
MINOR: Tableau proxy url for sourceurl (#21799) 2025-06-18 10:52:08 +05:30
Teddy
e4dffd281c
fix: preserve BQ struct field casing (#21716) 2025-06-17 23:58:35 +02:00
IceS2
cf288aa5de
Remove useless comment (#21819) 2025-06-17 14:27:41 -07:00
Sriharsha Chintalapani
c90138501f
Fix #21822: OpenSearch by default limits the number of characters it will analyze for highlighting to 1,000,000 characters. If your description field is very large (e.g. Markdown docs, embedded HTML, or verbose documentation), this limit gets exceeded. (#21821)
* Add sample data

* Fix index mappings to optimize the highlighter for OpenSearch
2025-06-17 14:22:11 -07:00
Mayur Singal
34c43eaea0
MINOR: Fix pytests (#21807) 2025-06-17 23:44:29 +05:30
IceS2
e79c54e6a5
MINOR: Add injection to profiler (#21738)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Implement Dependency Injection for the Ingestion Framework

* Fix Test

* Fix Profile Test Connection

* Add Injection to Metrics in Profiler

* Add Injection to the Profiler

* Fix UnitTests

* Fix Pytests

* Fix Tests

* Fix types
2025-06-17 19:01:00 +02:00
harshsoni2024
0f79d8ea1d
MINOR: pytest opt out flaky test (#21800)
* remove mlflow test until fixed

* alationsink test count fixed

* pylint fix gx
2025-06-17 14:23:28 +05:30
IceS2
49df5fc9de
MINOR: Implement dependency injection on ingestion (#21719)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Implement Dependency Injection for the Ingestion Framework

* Fix Test

* Fix Profile Test Connection

* Fix test, making the injection test run last

* Update connections.py

* Changed NewType to an AbstractClass to avoid linting issues

* remove comment

* Fix bug in service spec

* Update PyTest version to avoid importlib.reader wrong import
2025-06-16 08:03:38 +02:00
Sriharsha Chintalapani
074329418f
Fix #17244: Pagination for columns in UI (#21508) 2025-06-15 21:30:31 +05:30
Mayur Singal
64626dd4fd
MINOR: Implement Lineage Filter for UC (#21761)
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-13 22:14:56 +05:30
Pere Menal-Ferrer
44e09e41a2
Revert "FIX #1464 (#21520)" (#21726)
This reverts commit 1e86f9870fd663122b9bbb64f3cf17cf32619c7f.
2025-06-13 17:27:32 +02:00
IceS2
891ff4184d
MINOR: Initial implementation for our Connection Class (#21581)
* Initial implementation for our Connection Class

* Implement the Initial Connection class

* Add Unit Tests

* Fix Test

* Fix Profile Test Connection

* Remove unit test

* Remove comment

* Fix tests and missing changes
2025-06-13 14:52:29 +02:00
IceS2
f44d81ddf2
Update test_deltalake.py (#21735)
* Update test_deltalake.py

* Ignore it while collecting the tests
2025-06-13 08:27:38 +02:00
Mayur Singal
93b5cec8f9
Fix #21099: fix Superset ingestion bad query (#21650) 2025-06-13 08:32:15 +05:30
Mayur Singal
f4e9d69930
Fix #21109: Unable to connect to Opensearch using AWS Credentials (#21441) 2025-06-13 08:30:44 +05:30
Keshav Mohta
cd24c0a69a
Feature: Microstrategy Lineage (#21678) 2025-06-13 08:28:29 +05:30
Mohit Tilala
2803e62f0b
Add missing Data space type in qlikcloud (#21698)
* Add missing `Data` space type in qlikcloud

* Fix broken json files
2025-06-12 14:49:10 -07:00
Mayur Singal
d20d278c4b
Minor: Improve UC owner ingestion (#21741)
* Minor: Improve UC owner ingestion

* lint
2025-06-12 14:48:29 -07:00
harshsoni2024
6a6180b2e3
powerbi change owner condition (#21724) 2025-06-12 16:11:43 +05:30
Suman Maharana
18f9f2cdb6
Fix: Tableau project id should always be a string (#21700) 2025-06-12 11:21:53 +05:30
Teddy
a680e2c802
fix: added profiler config when executing bundle suite (#21714) 2025-06-11 17:03:22 +02:00
Teddy
c09a8b27ae
ISSUE #16676 - Add Tag to CreateTestCase (#21366)
* refactor: removed testSuite field from CreateTestCase

BREAKING CHANGE: when creating a test case, testsuite is now derived from entityLink (fetch or created)

* feat: allow setting tags when creating a test case

* style: ran linters

* fix: compiling error

* fix: failing test case

* fix: failing tests

* removed testSuite from required filed

* fixed ui side

* style: ran java linting

* deprecation: remove testSuite param from ingestion

* fix: remove test suite filed

* fix: remove test_suite field

---------

Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>
2025-06-11 09:59:08 +02:00
Suman Maharana
0df058a53d
Fix: dbtcloud CL errors (#21685) 2025-06-10 21:45:07 +05:30
Mayur Singal
06ae2df2c3
MINOR: Fix bigquery import issue (#21444)
* MINOR: Fix bigquery import issue

* fix checkstyle
2025-06-09 16:08:16 -07:00
Mohit Tilala
8bda216a72
Fixes #21472: Add mention of why snowflake owners are not supported (#21519)
* Add mention of why snowflake owners are not supported

* Remove owners from docs as not supported
2025-06-09 16:05:54 -07:00
Pere Menal-Ferrer
1e86f9870f
FIX #1464 (#21520)
* Add PIICategoryTags and some utilities on top of them.

* Fix static-check

* Add test for fqn representation

* Add NEREntityGeneralTags.json from Collate

* Add test to check PIICategoryTags agree with the ones used by OM server

* Add LabelExtractor

* Fix style

* Add ignore superflous-parens for pylint

* Ass comment as per PR review

* Fix not-updated PII-IT

* Remove duplicated IT test for PII

---------

Co-authored-by: Pere Menal <pere.menal@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-09 16:05:35 -07:00
Mayur Singal
53817c2182
Minor: Make trino query table configurable (#21665) 2025-06-09 16:03:57 -07:00
harshsoni2024
4a3b6f4934
issue-21370: db2 custom driver installation (#21638)
* db2 custom driver installation

* pylint changes

* typo fix
2025-06-09 19:52:35 +05:30
Ayush Shah
05e6a56b41
Add Databricks Sampler, Refactor Unity Catalog Sampler (#21612) 2025-06-09 14:18:35 +05:30
Keshav Mohta
b7a7023890
Fix #20665: BigQuery - Adding billing project (#21231) 2025-06-09 13:09:40 +05:30
Mohit Tilala
9e36cfe012
Remove existing entity source hash presence check (#21621) 2025-06-08 12:37:51 +05:30
Teddy
5078a2fbb9
DEPRECATION: Remove testCaseResults endpoint from testCaseResource (#21527)
* deprecation: remove testCaseResults endpoint from testCaseResource

* fix: path in test e2e test

* fix: endpoint name to testCaseResults

* style: fix java linting
2025-06-07 21:02:54 +02:00
Suman Maharana
161b4a8b2a
Chore: Tableau Improvements (#21620)
* Chore: Tableau Improvements

* Added apiVersion

* linting

* Addressed Comments
2025-06-07 21:38:48 +05:30
Suman Maharana
fd88a6d449
Add: dbt tags Filter (#21628) 2025-06-07 12:25:23 +05:30
Suman Maharana
2c657d6034
Fix: Looker cll parsing issue (#21630)
* Fix: Looker cll parsing issue

* Added checks
2025-06-07 12:21:32 +05:30
Mohit Tilala
ea63db993a
Add lineage dialect for Exasol, Trino and Vertica (#21604) 2025-06-06 11:48:52 +05:30