1835 Commits

Author SHA1 Message Date
Ajith Prasad
2f7c6ef05e
Swagger response code corrected for scim (#21871) 2025-06-20 15:49:18 +05:30
Ram Narayan Balaji
e7c0216b25
Fix:#21647 Inherited Tags from Glossary Term should be shown in put column response (#21832)
* Fix: Inherited Tags from Glossary Term should be shown in put/patch column response

* Placement of the statement fixed

---------

Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-06-19 15:13:20 +05:30
Sriharsha Chintalapani
802438f0ea
Fix default boost score, improve fqn parsing (#21854)
* Fix explain turned by default, use dfs_query_then_fetch in cases of sharding of search cluster

* Add exact match configs

* Add exact match configs

* Update Logic to build search source builder with exact match priority

* Revert "Update Logic to build search source builder with exact match priority"

This reverts commit 175a2e9c6b67ee90d4b2a35af89bb035e8c45131.

* Revert "Add exact match configs"

This reverts commit 3fd52606610bbb97a676170004cab6d7adc31a0d.

* revert display name change

* make boost mode as sum by defaul

* add more fqnparts for schema and database

* revert DFS_QUERY_THEN_FETCH since sharding wasn the issue

* use fqn split

* refactor fqn parsing

---------

Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
2025-06-18 18:56:11 -07:00
Sriharsha Chintalapani
8adda4955c
Revert "Issues in Search Relevancy (#21841)" (#21853)
This reverts commit f388e570c1dac5b9eee31364870fb66e42715f18.
2025-06-18 16:43:34 -07:00
Mohit Yadav
f388e570c1
Issues in Search Relevancy (#21841)
* Fix explain turned by default, use dfs_query_then_fetch in cases of sharding of search cluster

* Add exact match configs

* Add exact match configs

* Update Logic to build search source builder with exact match priority

* Revert "Update Logic to build search source builder with exact match priority"

This reverts commit 175a2e9c6b67ee90d4b2a35af89bb035e8c45131.

* Revert "Add exact match configs"

This reverts commit 3fd52606610bbb97a676170004cab6d7adc31a0d.

* revert display name change

* make boost mode as sum by defaul

* add more fqnparts for schema and database

* revert DFS_QUERY_THEN_FETCH since sharding wasn the issue

* use fqn split

* Refactor FQN Parts

---------

Co-authored-by: Sriharsha Chintalapani <harsha@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-18 16:33:46 -07:00
Karan Hotchandani
631c6f58fe
Feat(ui): add owners field in tags and classification (#21757)
* add owners field in classification

* fix tests

* add owners field in tags page

* Tag Inherits owners from classification

* Added loadTags.ts for owners field

* add owners for tags page

* fix tests

* review comments

* add owners field in create form

* fix domain test

---------

Co-authored-by: Ram Narayan Balaji <ramnarayanb3005@gmail.com>
Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
2025-06-18 16:52:53 +05:30
sonika-shah
9e281d0ee5
fix #21394 Custom properties bug update with existing wrong values (#21825)
* fix #21394 allow only updated extension fields to be validated

* fix #21394 add tests

* fix #21394 add tests
2025-06-18 14:03:45 +05:30
Sriharsha Chintalapani
fb5e45e6cd
Fix TableResourceTest failures for column updates (#21826) 2025-06-17 17:48:01 -07:00
Ram Narayan Balaji
bd948de115
fix: GlossaryTermRepository.java (#21816) 2025-06-17 20:40:49 +05:30
mgorsk1
48eac68a10
feat: add lineage tool in MCP Server for impact analysis (#21516)
* 🎉 Init

* fix required args

* fix after recator

* fix linting

* revert mysql volume docker-compose.yml change

---------

Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
2025-06-17 12:09:19 +05:30
Sriharsha Chintalapani
acf0696c12
Search index app optimization (#21798)
* Use a fixed thread pool

* skip entity not found
2025-06-17 11:59:35 +05:30
Karan Hotchandani
2689676b9a
Feat(ui): add and edit certifications from asset page (#21758)
* Feat(ui): add and edit certifications from asset page (#21344)

* added styling to certification

* added tests

* changed icons for certificatie popup

---------

Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>

* #19529 Improvise import to include certification for Assets (Database, DatabaseSchema, DatabaseService), Introduce EDIT_CERTIFICATION policy and Fix Elastic Search Indexes on Certification (#21564)

* #19529 - Adding certification field in json schema for databaseService.json, databaseServiceCsvDocumentation.json for bulk import facility, Implemented bulk import for non-recursive databaseService import async

* #19529 -Bulk async import - databaseSchema entity

* #19529 -Bulk async import - database entity

* #19529 - Initial Implementation of Search Indexes for Certificate during bulk Import

* #19529 - Edit Certification Policy Implementation, Attaching it to DataStewards and DataConsumers, also adding it to Migration

* #19529 - Updated ElasticSearch Index mapping name

* #19529 - Updated the CSV indices for recursive import

* #19529 - Single Test working as expected but not as a test suite.

* #19529 - Fixed DatabaseSchemaResourceTest

* #19529 - Fixed DatabaseResource Test for exportImportCSV

* #19529 - Fixed DatabaseServiceResource Test for exportImportCSV

* #19529 - Updated and Improvised ElasticSearch Certification functionality

* #19529 - Added postgres migration as well along with mysql migration

* #19529 - Removed migration scripts from v160 and placed it at v180, Included mappings for database_service in zh and jp,
Applied mvn spotless:apply

* #19529 - Adding certification field in json schema for databaseService.json, databaseServiceCsvDocumentation.json for bulk import facility, Implemented bulk import for non-recursive databaseService import async

* #19529 -Bulk async import - databaseSchema entity

* #19529 -Bulk async import - database entity

* #19529 - Initial Implementation of Search Indexes for Certificate during bulk Import

* #19529 - Edit Certification Policy Implementation, Attaching it to DataStewards and DataConsumers, also adding it to Migration

* #19529 - Updated ElasticSearch Index mapping name

* #19529 - Updated the CSV indices for recursive import

* #19529 - Single Test working as expected but not as a test suite.

* #19529 - Fixed DatabaseSchemaResourceTest

* #19529 - Fixed DatabaseResource Test for exportImportCSV

* #19529 - Fixed DatabaseServiceResource Test for exportImportCSV

* #19529 - Updated and Improvised ElasticSearch Certification functionality

* #19529 - Added postgres migration as well along with mysql migration

* #19529 - Removed migration scripts from v160 and placed it at v180, Included mappings for database_service in zh and jp,
Applied mvn spotless:apply

* Applied mvn spotless:apply

* Reused the Existing UPDATE_CERTIFICATION_SCRIPT for ElasticSearch Indexing

* Added field certification in the static String FIELDS

* fix playwright test around bulk action

* #19529 - Persisting Null or empty in the bulk import for certification

* #19529 - Persisting Null or empty in the bulk import for certification - Moved the if block to the top

* mvn spotless:apply

* Reverted an unimportant file

* mvn spotless:apply

* #19529 - Persisting the field Certification in clearFIeldsInternal

* typescript files for edit_certification

* Revert "typescript files for edit_certification"

This reverts commit f5e5514a98008cbd0b62d7cb21fefe61659e97cb.

* typescript files for edit_certification

* mvn:spotless:apply

* Removed correction

* needed typescript file for edit_certification

* Removed Unnecessary Comments

* Improved Test Cases - Added DATA_ASSET_SEARCH alias instead of GLOBAL_ALIAS

* Fixed csv values in order

---------

Co-authored-by: Ashish Gupta <ashish@getcollate.io>
Co-authored-by: System Administrator <root@192.168.1.4>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>

* update certification component

* Fixed Certification Indexes

* Fixed Missing Certification Label for DBSchema

* add certification changes

* fix test errors

* Remove Certification field and mapping from database service asset

* Removed certification from schema and fixed test

* fix tests

---------

Co-authored-by: Dhruv Parmar <83108871+dhruvjsx@users.noreply.github.com>
Co-authored-by: Ram Narayan Balaji <81347100+yan-3005@users.noreply.github.com>
Co-authored-by: Ashish Gupta <ashish@getcollate.io>
Co-authored-by: System Administrator <root@192.168.1.4>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
Co-authored-by: Ram Narayan Balaji <ramnarayanb3005@gmail.com>
2025-06-17 11:28:07 +05:30
Ram Narayan Balaji
961a5357dd
Fix: #17965 Correct FQN values will be shown to the user in case of tag validation failures in Glossary Term (#21731)
* Correct FQN values will be shown to the user in case of validation failures in glossary term tags

* Applied mvn:spotless apply

* Added Test cases to verify the validation of assets
2025-06-17 10:40:38 +05:30
Mohit Yadav
0a6a1b37d2
Removed unecessary fields from the Es doc (#21792) 2025-06-17 09:56:45 +05:30
sonika-shah
4143308f96
Fix #1681 - Deleted Filter in Automations Not Working as Expected (#21760)
* Fix #1681 - Deleted Filter in Automations Not Working as Expected

* Revert "Fix #1681 - Deleted Filter in Automations Not Working as Expected"

This reverts commit d941ad440b6c5bcb66d4ac8bb31c321fb23cbcfc.

* Set deleted param in /search/query API to type Boolean to allow default null values

* fix AUT test failures related to user search in settings/users page

* fix AUT test failures related to user search in settings/users page

* Make `deleted` param optional for search API call in explore page

* Fix the deleted users showing on the user listing page after user search

* Fix playwright tests

* Fix playwright tests

---------

Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
2025-06-16 12:02:19 -07:00
mohitdeuex
0e2c00bd00 Add cleanSearchResponseObject() 2025-06-16 23:03:49 +05:30
Sriharsha Chintalapani
074329418f
Fix #17244: Pagination for columns in UI (#21508) 2025-06-15 21:30:31 +05:30
Mohit Yadav
5b6e25999e
Disabled bot creating activity feeds (#21773)
* - Disabled bot creating feeds

* - nit
2025-06-14 19:21:00 +05:30
Sriharsha Chintalapani
7d07222635
Fix #21668: Add Auto Tune to conifgure Search Index Application Parameters (#21669)
* Fix #21668: Add Auto Tune to conifgure Search Index Application Parameters

* add auto tune param in app json

* add test

* use virtual threads

* Fix Failing Test

---------

Co-authored-by: karanh37 <karanh37@gmail.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
2025-06-13 09:23:22 -07:00
mohitdeuex
9b7b5f2b91 Revert "Add Data Quality Test Casae (#21764)"
This reverts commit c7e92d42f11f1fb2b3dbc7bf760494926b0f9d05.
2025-06-13 19:47:14 +05:30
Mohit Yadav
c7e92d42f1
Add Data Quality Test Casae (#21764)
* Add Test Parameter Definition tool

* Add Dq create test case
2025-06-13 19:28:35 +05:30
Ram Narayan Balaji
580880e8c1
#17584 Improve Bulk add assets API in Tags, GlossaryTerms (#21673)
* Fix: Remove operations and glossaryTags from bulk asset add api

* Fix: Improved the test case

* addTagToAssets schema - Removed operations field in the payload

* Replacing the request payload for validating tags in the UI

* UI changes for add and remove assets to glossary term

* Using getByName and Entity.GLOSSARY

---------

Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-06-13 16:04:05 +05:30
Ajith Prasad
023abfda1d
Scim Interface and resource in OM (#21512)
* SCIM interface

* removed unwanted code

* remove SCIM registration in OM

* reverted scim configuration settings, as it is not needed

* Added security context to create user

* Added scimusername in user jsons

* added externalid, scimUsername in createUser

* Added security context on create and update groups

* Added jakarta imports

* Authorization added

* Added role, policy and bot for SCIM
2025-06-13 13:12:41 +05:30
sonika-shah
69b45dcca2
Revert pr #21723 (#21749) 2025-06-12 20:25:53 -07:00
IceS2
a3968d40b0
Fix 172 migrations by avoiding throwing errors (#21697) 2025-06-12 14:49:26 -07:00
sonika-shah
66d2ef2c8b
Fix #21653 - Remove recently added delete param field in searchByField api (#21723) 2025-06-12 14:47:54 -07:00
Sriharsha Chintalapani
7e6e23b95a
Fix API pages (#21695)
* Fix API pages

* Fix API pages

* Fix API pages
2025-06-12 12:23:34 +05:30
Teddy
1531130382
fix: failing DQ mvn test (#21702) 2025-06-11 14:56:03 +02:00
Mohit Yadav
9a0f614331
[MCP] Changed MCP as an APP (#21687)
* - Added Prompts

* - Add Prompts for Search

* Embedded Server Mcp as Application

* Add MCP Application

* Fix Prompts and Tool Context

* Get Wrapped Result

* Wrapped result Fixes

* Add Assets for App

* Document Update

* Add doc

* Update Doc

* Remove Config from yaml and use app

* Add Doc
2025-06-11 16:08:42 +05:30
Teddy
c09a8b27ae
ISSUE #16676 - Add Tag to CreateTestCase (#21366)
* refactor: removed testSuite field from CreateTestCase

BREAKING CHANGE: when creating a test case, testsuite is now derived from entityLink (fetch or created)

* feat: allow setting tags when creating a test case

* style: ran linters

* fix: compiling error

* fix: failing test case

* fix: failing tests

* removed testSuite from required filed

* fixed ui side

* style: ran java linting

* deprecation: remove testSuite param from ingestion

* fix: remove test suite filed

* fix: remove test_suite field

---------

Co-authored-by: Shailesh Parmar <shailesh.parmar.webdev@gmail.com>
2025-06-11 09:59:08 +02:00
Ram Narayan Balaji
febd195bfd
#16279 Update Classification Schema to include Governance Fields - Schema and Java Implementation (#21636)
* Update Classification Schema to include Governance Fields

* Removed Tags, Reviewers, Domain from Classification as they are needed and corrected tests

* Added Permission check for owners in Classification Resource Test

* Added LoadTags.ts generated from createClassificationSchema.json

* Only have my schema changes in the typescript files, ignore other changes.

---------

Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
2025-06-10 07:06:20 -07:00
Mohit Yadav
b8cb82c25c
[Fix-21275] Service pipeline lineage not appearing (#21664)
* Fix Pipeline Service Lineage

* Unique Id issue
2025-06-10 17:57:57 +05:30
Ram Narayan Balaji
d065d85e0a
Revert "Fix: #17584 Improve Glossary Bulk Asset Upload APIs (#21652)" (#21672)
This reverts commit 5b30aa94e2e018eb4c486d14424a5d34b2877979.
2025-06-10 11:36:42 +05:30
Ram Narayan Balaji
5b30aa94e2
Fix: #17584 Improve Glossary Bulk Asset Upload APIs (#21652)
* Fix: Remove operations and glossaryTags from bulk asset add api

* Fix: Improved the test case

* addTagToAssets schema - Removed operations field in the payload

---------

Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
2025-06-10 11:09:07 +05:30
Mohit Yadav
d1ea459a81
Queries not tableQueries (#21670) 2025-06-10 10:08:42 +05:30
Mohit Yadav
dc25350ea2
MCP Core Items Improvements (#21643)
* Search Util fix and added tableQueries

* some json input fix

* Add team and user

* WIP : Add Streamable HTTP

* - Add proper tools/list schema and tools/call

* - auth filter exact match

* - Add Tools Class to dynamically build tools

* Add Origin Validation Mandate

* Refactor MCP Stream

* comment

* Cleanups

* Typo

* Typo
2025-06-10 09:42:24 +05:30
Sriharsha Chintalapani
c2b2d37f95
Fix #21457: SearchIndexing: Limit of total fields [1000] has been exceeded (#21503)
* Fix #21457: SearchIndexing: Limit of total fields [1000] has been exceeded

* Fix #21457: SearchIndexing: Limit of total fields [1000] has been exceeded

* Fix BulkItemException throwing back to the UI

* add upstreamLineage to all appropriate indexes

* Remove conflicting lines and clean up index mappings for api_collection and stored_procedure JSON files

---------

Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
Co-authored-by: Ayush Shah <ayush@getcollate.io>
2025-06-09 16:07:39 -07:00
Sriharsha Chintalapani
98aff312da
Fix #21502: TooManyNestedClauses: Query contains too many nested clauses (#21505)
* Fix #21502: TooManyNestedClauses: Query contains too many nested clauses; maxClauseCount is set to 4096 error in table_search_index for complex search queries in Opensearch

* Fix #21502: TooManyNestedClauses: Query contains too many nested clauses; maxClauseCount is set to 4096 error in table_search_index for complex search queries in Opensearch

* Update openmetadata-service/src/test/java/org/openmetadata/service/resources/search/SearchResourceTest.java

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* fix mvn build

* fix checkstyle

* fix tests in SearchResourceTest

* fix tests in SearchResourceTest

* instead of wrapping baseQuery+functionScore in a new BoolQuery,use functionScore directly as the final query - avoid duplication of mult_match query

* add requestSchema fields in APIEndpointIndex, and remove array fields in searchSettings to use fuzzyfields avoiding clause explosion

* - Set fuzzyMaxExpansions/maxExpansions to 10 instead of default 50
- Turn off fuzziness (Fuzziness.ZERO) for `.ngram` fields
- Update QUERY_SYNTAX_PATTERN so only uppercase AND, OR, NOT are treated as boolean operators

* use non Fuzzy query for ngram Fields in QueryStringBuilder , remove duplication of query in should phrase

* add fuzzy/nonFuzzy Fields for each asset in searchSettings.json

* add fuzzy/nonFuzzy Fields for each asset in searchSettings.json

* update isFuzzyField function

* fix playwright

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: sonikashah <sonikashah94@gmail.com>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-06-09 16:07:23 -07:00
Siddhant
f30eff4dc5
Fixes #19466: Cleanup Activity Feed In Retention App (#21640)
* add activityThreadsRetentionPeriod in the DataRetentionApplication jsons

* clean up activity threads of type conversation
2025-06-09 19:22:40 +05:30
Siddhant
1511871fc1
Fixes #20142 : Add QueryParams To The Webhook Request URL (#21637)
* queryParams to the webhook request URL

* add activityThreadsRetentionPeriod in the DataRetentionApplication jsons

* clean up activity threads of type conversation

* Revert "clean up activity threads of type conversation"

This reverts commit e66ea8623be6aa1c5d07afdec04b4807eb21c7e7.

* Revert "add activityThreadsRetentionPeriod in the DataRetentionApplication jsons"

This reverts commit 4180be18a8d5e0d4fd4b3d4662df0532bb16dbed.
2025-06-09 19:21:53 +05:30
Ajith Prasad
9a9db8129c
Fixes #21511 :Domain not getting removed through import (#21616)
* Minor: Domain not getting removed through bulk import/ recursive import, bug fix

* Added tag, owner, domain removal in TCs

---------

Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-06-09 19:19:30 +05:30
Teddy
5078a2fbb9
DEPRECATION: Remove testCaseResults endpoint from testCaseResource (#21527)
* deprecation: remove testCaseResults endpoint from testCaseResource

* fix: path in test e2e test

* fix: endpoint name to testCaseResults

* style: fix java linting
2025-06-07 21:02:54 +02:00
Sriharsha Chintalapani
1c5772d6f8
Fix #21533 - Add a tool to openmetadata-ops.sh to delete orphaned relations (#21534)
* Fix #21533 - Add a tool to openmetadata-ops.sh to delete orphaned relations

* Fix #21533 - Add a tool to openmetadata-ops.sh to delete orphaned relations

* Add exit 1 and make dry-run default

* Add exit 1 and make dry-run default

* Fix checkstyle

---------

Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2025-06-06 16:19:39 -07:00
Pere Miquel Brull
a51187d46b
FIX CL-1641 - Allow to search by descriptionSources (#21487)
* FIX CL-1641 - Allow to search by descriptionSources

* Add new fields for advanced search filters

* feat: added createdBy params

* Move the tag label type options to constants

* Add tier & tag source

---------

Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
Co-authored-by: Teddy Crepineau <teddy.crepineau@gmail.com>
Co-authored-by: ulixius9 <mayursingal9@gmail.com>
2025-06-06 14:10:25 +02:00
Pere Miquel Brull
635382dd1d
Revert "[WIP] MCP Core Items Improvements (#21598)" (#21614)
This reverts commit 0b3bf4ac0d3a7ac74e39552ad49896d37e469516.
2025-06-06 07:32:20 +02:00
IceS2
472b95516d
MINOR: Add data insights migrations to remove stale objects (#21249)
* Adding Data Insights Migrations to remove Stale Objects

* Adding Data Insights Migrations to remove Stale Objects

* Adding dettachment to ES ILM policy from Index and template

* Adding dettachment to ES ILM policy from Index and template

* Adding dettachment to ES ILM policy from Index and template

* Fix Checkstyle

* Moved migrations to 172

* Removed code that was left over on 171 migration

* Improve SearchClient error handling
2025-06-05 16:08:38 +02:00
Mayur Singal
fabf352522
Fix #21226: External ingestion not visible on UI without Airflow (#21412) 2025-06-05 16:52:23 +05:30
Mohit Yadav
0b3bf4ac0d
[WIP] MCP Core Items Improvements (#21598)
* Search Util fix and added tableQueries

* some json input fix

* Add team and user

* WIP : Add Streamable HTTP

* - Add proper tools/list schema and tools/call
2025-06-05 12:30:50 +02:00
sonika-shah
cfd548556e
Support followers in Domain and DataProduct (#21478)
* Support followers in DataProduct

* Support followers in domain
2025-06-05 11:17:13 +05:30
IceS2
8540884ab1
MINOR: Add method to filter ingestion pipeline based on metadata (#21449)
* Add logic to handle WorkflowContext on Ingestion

* Revert base.py changes

* Removed comment

* Fix basedpyright complaints

* Make ContextManager automatically add its context to the PipelineStatus

* Small changes

* Only dump non-null keys

* Add Method to Filter Ingestion Pipeline based on Metadata

* Reduce the scope to filter only specifically on metadata->workflow->serviceName
2025-06-04 16:13:39 +02:00