2307 Commits

Author SHA1 Message Date
sonika-shah
971225dbce
Fix #21414: Custom properties display issue in advanced search (#21873)
* fix: remove recursive extraction of nested fields for entityReference and entityReferenceList type custom properties

* get displayName field along with other fields in /customProperties api

* show display name instead of name for custom props

* show display name instead of name for custom props

* fix extension type and add tests

* fix tests

* fix data model tests

---------

Co-authored-by: karanh37 <karanh37@gmail.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
2025-06-26 19:35:45 +05:30
IceS2
94cf3e0fd6
MINOR: Extend profile workflow config to allow engine configuration (#21840)
* Update Profile Workflow to allow engine configuration

* Add ui generated schemas

* Add Repository Override mechanism based on annotations

* Implement logic to use the ProcessingEngine configuration

* Update SparkEngine to use remote and not master
2025-06-26 19:11:26 +05:30
Mohit Yadav
b8afbb4ed2
FIx failing Maven CI Tests (#21975)
* Fix Glossary Test Failures

* Fix Soft Deletion Causing issues
2025-06-26 16:03:36 +05:30
Mohit Yadav
3e3f1d8f3b
feat: implement entity lifecycle event delegation (#21921)
* feat: implement entity lifecycle event delegation framework

* added tests

* added reindex method to search clients

* Use virtual thread pool
2025-06-25 23:26:45 +05:30
Mohit Yadav
0c6ded3ae3
Soft-Deleted Entities are not removed from system, so relations are still valid if there is entry in db (#21957) 2025-06-25 21:54:02 +05:30
Mohit Yadav
2c696bb1e2
Remove Default: null from columns and let json to pojo handle (#21934)
* Remove Default: null from columns and let json to pojo handle

* Remove Setting table columns to null

* Fix Failing Tests

---------

Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-25 21:53:32 +05:30
Mohit Yadav
89fcac6316
feat: enhance relationshipCleanup with service hierarchy cleanup and Data Retention App (#21904)
* Extended relationshipCleanup command to clean up broken entities across all
  OpenMetadata service hierarchies (Database, Dashboard, API, Messaging, Pipeline,
  Storage, ML Model, and Search services), not just database services.

  Key changes:
  - Added performServiceHierarchyCleanup() method in OpenMetadataOperations
  - Added --skip-hierarchy-cleanup option for backward compatibility
  - Added getBrokenRelationFromParentToChild() and deleteBrokenRelationFromParentToChild() methods in CollectionDAO
  - Support for detecting and cleaning orphaned entities across any parent-child relationship
  - Maintains dry-run and actual cleanup modes with detailed reporting

  Usage: relationshipCleanup [--delete] [--skip-hierarchy-cleanup] [--batch-size N

* refactor: extract reusable cleanup utilities and eliminate code duplication

 Changes:
  - Add ServiceHierarchyCleanup.java for reusable hierarchy cleanup
  - Add ComprehensiveCleanupUtil.java for combined cleanup operations
  - Refactor OpenMetadataOperations to use reusable utilities
  - Refactor DataRetention to use reusable utilities
  - Single source of truth for all service hierarchy definitions

* - missing commit

* - Data Retention App test
2025-06-25 14:19:22 +05:30
Mohit Yadav
16aa38da8f
Fix Cache To have immutable entities (#21933) 2025-06-24 16:40:32 -07:00
Sriharsha Chintalapani
97cf131d19
Fix query cost aggregation (#21907)
* Fix query aggregation to use checksum not query text

* Fix query aggregation to use checksum not query text

* fix query text

* fix opensearch class

---------

Co-authored-by: ulixius9 <mayursingal9@gmail.com>
2025-06-23 14:02:58 -07:00
sonika-shah
cf6eaf8a1a
Revert : query_cost_record_search_index Search reindexing issue (#21906) 2025-06-23 22:11:29 +05:30
sonika-shah
594ae851f5
Fix : query_cost_record_search_index Search reindexing issue (#21900) 2025-06-23 06:51:37 -07:00
Ajith Prasad
b8887e4df1
Scim Bot can update display name (#21879)
* Scim Bot can update display name

* Updated by Bot checked fir editdisplayname

* format

* Fixed TC

* Fixed policy tests

* comment

---------

Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-22 17:44:46 -07:00
Teddy
337ce90797
MINOR: Alert test message type (#21866)
* fix: align message with Content-Type for test webhook alert

* fix: align message with Content-Type for test webhook alert
2025-06-20 14:39:11 +02:00
Keshav Mohta
73ea60b898
Refactor: Unity Catalog (#21801) 2025-06-20 16:04:34 +05:30
Ajith Prasad
2f7c6ef05e
Swagger response code corrected for scim (#21871) 2025-06-20 15:49:18 +05:30
Ram Narayan Balaji
e7c0216b25
Fix:#21647 Inherited Tags from Glossary Term should be shown in put column response (#21832)
* Fix: Inherited Tags from Glossary Term should be shown in put/patch column response

* Placement of the statement fixed

---------

Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-06-19 15:13:20 +05:30
Ajith Prasad
33711766e2
commons-beanutils vulnerability reported in security scan (#21830) 2025-06-19 08:43:19 +05:30
Sriharsha Chintalapani
802438f0ea
Fix default boost score, improve fqn parsing (#21854)
* Fix explain turned by default, use dfs_query_then_fetch in cases of sharding of search cluster

* Add exact match configs

* Add exact match configs

* Update Logic to build search source builder with exact match priority

* Revert "Update Logic to build search source builder with exact match priority"

This reverts commit 175a2e9c6b67ee90d4b2a35af89bb035e8c45131.

* Revert "Add exact match configs"

This reverts commit 3fd52606610bbb97a676170004cab6d7adc31a0d.

* revert display name change

* make boost mode as sum by defaul

* add more fqnparts for schema and database

* revert DFS_QUERY_THEN_FETCH since sharding wasn the issue

* use fqn split

* refactor fqn parsing

---------

Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
2025-06-18 18:56:11 -07:00
Sriharsha Chintalapani
8adda4955c
Revert "Issues in Search Relevancy (#21841)" (#21853)
This reverts commit f388e570c1dac5b9eee31364870fb66e42715f18.
2025-06-18 16:43:34 -07:00
Mohit Yadav
f388e570c1
Issues in Search Relevancy (#21841)
* Fix explain turned by default, use dfs_query_then_fetch in cases of sharding of search cluster

* Add exact match configs

* Add exact match configs

* Update Logic to build search source builder with exact match priority

* Revert "Update Logic to build search source builder with exact match priority"

This reverts commit 175a2e9c6b67ee90d4b2a35af89bb035e8c45131.

* Revert "Add exact match configs"

This reverts commit 3fd52606610bbb97a676170004cab6d7adc31a0d.

* revert display name change

* make boost mode as sum by defaul

* add more fqnparts for schema and database

* revert DFS_QUERY_THEN_FETCH since sharding wasn the issue

* use fqn split

* Refactor FQN Parts

---------

Co-authored-by: Sriharsha Chintalapani <harsha@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-18 16:33:46 -07:00
Karan Hotchandani
631c6f58fe
Feat(ui): add owners field in tags and classification (#21757)
* add owners field in classification

* fix tests

* add owners field in tags page

* Tag Inherits owners from classification

* Added loadTags.ts for owners field

* add owners for tags page

* fix tests

* review comments

* add owners field in create form

* fix domain test

---------

Co-authored-by: Ram Narayan Balaji <ramnarayanb3005@gmail.com>
Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
2025-06-18 16:52:53 +05:30
sonika-shah
9e281d0ee5
fix #21394 Custom properties bug update with existing wrong values (#21825)
* fix #21394 allow only updated extension fields to be validated

* fix #21394 add tests

* fix #21394 add tests
2025-06-18 14:03:45 +05:30
Sriharsha Chintalapani
fb5e45e6cd
Fix TableResourceTest failures for column updates (#21826) 2025-06-17 17:48:01 -07:00
Sriharsha Chintalapani
c90138501f
Fix #21822: OpenSearch by default limits the number of characters it will analyze for highlighting to 1,000,000 characters. If your description field is very large (e.g. Markdown docs, embedded HTML, or verbose documentation), this limit gets exceeded. (#21821)
* Add sample data

* Fix index mappings to optimize the highlighter for OpenSearch
2025-06-17 14:22:11 -07:00
Ram Narayan Balaji
bd948de115
fix: GlossaryTermRepository.java (#21816) 2025-06-17 20:40:49 +05:30
mgorsk1
48eac68a10
feat: add lineage tool in MCP Server for impact analysis (#21516)
* 🎉 Init

* fix required args

* fix after recator

* fix linting

* revert mysql volume docker-compose.yml change

---------

Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
2025-06-17 12:09:19 +05:30
Sriharsha Chintalapani
acf0696c12
Search index app optimization (#21798)
* Use a fixed thread pool

* skip entity not found
2025-06-17 11:59:35 +05:30
Karan Hotchandani
2689676b9a
Feat(ui): add and edit certifications from asset page (#21758)
* Feat(ui): add and edit certifications from asset page (#21344)

* added styling to certification

* added tests

* changed icons for certificatie popup

---------

Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>

* #19529 Improvise import to include certification for Assets (Database, DatabaseSchema, DatabaseService), Introduce EDIT_CERTIFICATION policy and Fix Elastic Search Indexes on Certification (#21564)

* #19529 - Adding certification field in json schema for databaseService.json, databaseServiceCsvDocumentation.json for bulk import facility, Implemented bulk import for non-recursive databaseService import async

* #19529 -Bulk async import - databaseSchema entity

* #19529 -Bulk async import - database entity

* #19529 - Initial Implementation of Search Indexes for Certificate during bulk Import

* #19529 - Edit Certification Policy Implementation, Attaching it to DataStewards and DataConsumers, also adding it to Migration

* #19529 - Updated ElasticSearch Index mapping name

* #19529 - Updated the CSV indices for recursive import

* #19529 - Single Test working as expected but not as a test suite.

* #19529 - Fixed DatabaseSchemaResourceTest

* #19529 - Fixed DatabaseResource Test for exportImportCSV

* #19529 - Fixed DatabaseServiceResource Test for exportImportCSV

* #19529 - Updated and Improvised ElasticSearch Certification functionality

* #19529 - Added postgres migration as well along with mysql migration

* #19529 - Removed migration scripts from v160 and placed it at v180, Included mappings for database_service in zh and jp,
Applied mvn spotless:apply

* #19529 - Adding certification field in json schema for databaseService.json, databaseServiceCsvDocumentation.json for bulk import facility, Implemented bulk import for non-recursive databaseService import async

* #19529 -Bulk async import - databaseSchema entity

* #19529 -Bulk async import - database entity

* #19529 - Initial Implementation of Search Indexes for Certificate during bulk Import

* #19529 - Edit Certification Policy Implementation, Attaching it to DataStewards and DataConsumers, also adding it to Migration

* #19529 - Updated ElasticSearch Index mapping name

* #19529 - Updated the CSV indices for recursive import

* #19529 - Single Test working as expected but not as a test suite.

* #19529 - Fixed DatabaseSchemaResourceTest

* #19529 - Fixed DatabaseResource Test for exportImportCSV

* #19529 - Fixed DatabaseServiceResource Test for exportImportCSV

* #19529 - Updated and Improvised ElasticSearch Certification functionality

* #19529 - Added postgres migration as well along with mysql migration

* #19529 - Removed migration scripts from v160 and placed it at v180, Included mappings for database_service in zh and jp,
Applied mvn spotless:apply

* Applied mvn spotless:apply

* Reused the Existing UPDATE_CERTIFICATION_SCRIPT for ElasticSearch Indexing

* Added field certification in the static String FIELDS

* fix playwright test around bulk action

* #19529 - Persisting Null or empty in the bulk import for certification

* #19529 - Persisting Null or empty in the bulk import for certification - Moved the if block to the top

* mvn spotless:apply

* Reverted an unimportant file

* mvn spotless:apply

* #19529 - Persisting the field Certification in clearFIeldsInternal

* typescript files for edit_certification

* Revert "typescript files for edit_certification"

This reverts commit f5e5514a98008cbd0b62d7cb21fefe61659e97cb.

* typescript files for edit_certification

* mvn:spotless:apply

* Removed correction

* needed typescript file for edit_certification

* Removed Unnecessary Comments

* Improved Test Cases - Added DATA_ASSET_SEARCH alias instead of GLOBAL_ALIAS

* Fixed csv values in order

---------

Co-authored-by: Ashish Gupta <ashish@getcollate.io>
Co-authored-by: System Administrator <root@192.168.1.4>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>

* update certification component

* Fixed Certification Indexes

* Fixed Missing Certification Label for DBSchema

* add certification changes

* fix test errors

* Remove Certification field and mapping from database service asset

* Removed certification from schema and fixed test

* fix tests

---------

Co-authored-by: Dhruv Parmar <83108871+dhruvjsx@users.noreply.github.com>
Co-authored-by: Ram Narayan Balaji <81347100+yan-3005@users.noreply.github.com>
Co-authored-by: Ashish Gupta <ashish@getcollate.io>
Co-authored-by: System Administrator <root@192.168.1.4>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
Co-authored-by: Ram Narayan Balaji <ramnarayanb3005@gmail.com>
2025-06-17 11:28:07 +05:30
Ram Narayan Balaji
961a5357dd
Fix: #17965 Correct FQN values will be shown to the user in case of tag validation failures in Glossary Term (#21731)
* Correct FQN values will be shown to the user in case of validation failures in glossary term tags

* Applied mvn:spotless apply

* Added Test cases to verify the validation of assets
2025-06-17 10:40:38 +05:30
Mohit Yadav
0a6a1b37d2
Removed unecessary fields from the Es doc (#21792) 2025-06-17 09:56:45 +05:30
sonika-shah
4143308f96
Fix #1681 - Deleted Filter in Automations Not Working as Expected (#21760)
* Fix #1681 - Deleted Filter in Automations Not Working as Expected

* Revert "Fix #1681 - Deleted Filter in Automations Not Working as Expected"

This reverts commit d941ad440b6c5bcb66d4ac8bb31c321fb23cbcfc.

* Set deleted param in /search/query API to type Boolean to allow default null values

* fix AUT test failures related to user search in settings/users page

* fix AUT test failures related to user search in settings/users page

* Make `deleted` param optional for search API call in explore page

* Fix the deleted users showing on the user listing page after user search

* Fix playwright tests

* Fix playwright tests

---------

Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
2025-06-16 12:02:19 -07:00
mohitdeuex
0e2c00bd00 Add cleanSearchResponseObject() 2025-06-16 23:03:49 +05:30
Sriharsha Chintalapani
074329418f
Fix #17244: Pagination for columns in UI (#21508) 2025-06-15 21:30:31 +05:30
Mohit Yadav
5b6e25999e
Disabled bot creating activity feeds (#21773)
* - Disabled bot creating feeds

* - nit
2025-06-14 19:21:00 +05:30
Mohit Yadav
059289a83b
Disable query indexing (#21767) 2025-06-13 09:23:44 -07:00
Sriharsha Chintalapani
7d07222635
Fix #21668: Add Auto Tune to conifgure Search Index Application Parameters (#21669)
* Fix #21668: Add Auto Tune to conifgure Search Index Application Parameters

* add auto tune param in app json

* add test

* use virtual threads

* Fix Failing Test

---------

Co-authored-by: karanh37 <karanh37@gmail.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
2025-06-13 09:23:22 -07:00
Pere Menal-Ferrer
44e09e41a2
Revert "FIX #1464 (#21520)" (#21726)
This reverts commit 1e86f9870fd663122b9bbb64f3cf17cf32619c7f.
2025-06-13 17:27:32 +02:00
mohitdeuex
9b7b5f2b91 Revert "Add Data Quality Test Casae (#21764)"
This reverts commit c7e92d42f11f1fb2b3dbc7bf760494926b0f9d05.
2025-06-13 19:47:14 +05:30
Mohit Yadav
c7e92d42f1
Add Data Quality Test Casae (#21764)
* Add Test Parameter Definition tool

* Add Dq create test case
2025-06-13 19:28:35 +05:30
Ram Narayan Balaji
580880e8c1
#17584 Improve Bulk add assets API in Tags, GlossaryTerms (#21673)
* Fix: Remove operations and glossaryTags from bulk asset add api

* Fix: Improved the test case

* addTagToAssets schema - Removed operations field in the payload

* Replacing the request payload for validating tags in the UI

* UI changes for add and remove assets to glossary term

* Using getByName and Entity.GLOSSARY

---------

Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-06-13 16:04:05 +05:30
Ajith Prasad
023abfda1d
Scim Interface and resource in OM (#21512)
* SCIM interface

* removed unwanted code

* remove SCIM registration in OM

* reverted scim configuration settings, as it is not needed

* Added security context to create user

* Added scimusername in user jsons

* added externalid, scimUsername in createUser

* Added security context on create and update groups

* Added jakarta imports

* Authorization added

* Added role, policy and bot for SCIM
2025-06-13 13:12:41 +05:30
sonika-shah
ae5657c608
Add missing ngram fields for displayName and name in jp/zh indexes (#21750) 2025-06-12 20:26:13 -07:00
sonika-shah
69b45dcca2
Revert pr #21723 (#21749) 2025-06-12 20:25:53 -07:00
sonika-shah
d2c9952c9c
Bump jetty.version to 11.0.25 (#21699) 2025-06-12 18:24:42 -07:00
IceS2
a3968d40b0
Fix 172 migrations by avoiding throwing errors (#21697) 2025-06-12 14:49:26 -07:00
Mohit Tilala
2803e62f0b
Add missing Data space type in qlikcloud (#21698)
* Add missing `Data` space type in qlikcloud

* Fix broken json files
2025-06-12 14:49:10 -07:00
sonika-shah
6fd5778219
Fix #21653 - Tag search based on displayName (#21722)
* Fix #21653 - Tag search based on displayName

* added test

* update test to search by classification display name

---------

Co-authored-by: shrushti2000 <shrushtipolekar@gmail.com>
2025-06-12 14:48:14 -07:00
sonika-shah
66d2ef2c8b
Fix #21653 - Remove recently added delete param field in searchByField api (#21723) 2025-06-12 14:47:54 -07:00
Sriharsha Chintalapani
eb7ca6087e
Update reset-link.json (#21742) 2025-06-12 14:45:26 -07:00
Sriharsha Chintalapani
7e6e23b95a
Fix API pages (#21695)
* Fix API pages

* Fix API pages

* Fix API pages
2025-06-12 12:23:34 +05:30