441 Commits

Author SHA1 Message Date
Ram Narayan Balaji
9cc0441da2
Feat# Implementation of Custom Workflows (#23023)
* Draft Implementation of Custom Workflows

* Multiple Entities in the Same Trigger for Workflow, along with draft implementation of signal id

* Improved User Approval Task and Impl

* Custom Workflows - Draft Implementation 2, improved periodic batch entity filter and other improvements

* feat(governance): Implement transactional custom workflows - improved

This commit introduces a robust, transactional, and extensible framework for custom governance workflows in OpenMetadata.

Key features and improvements include:

Transactional Workflow Management: A new WorkflowTransactionManager ensures atomic operations for creating, updating, and deleting workflow definitions, maintaining consistency between the OpenMetadata database and the Flowable engine.

Safe ID Encoding: Implemented a WorkflowIdEncoder to generate safe, Base64-encoded, and collision-resistant IDs for Flowable processes, preventing errors from ID truncation.

Rollback and Deprecation Tasks:

Added RollbackEntityTask to revert entities to their last approved state.

Introduced DeprecateStaleEntityTask for automated lifecycle management of stale assets.

Enhanced Workflow Engine:

Improved WorkflowHandler to validate workflow definitions before deployment.

Added new custom functions to the rule engine for checking entity update timestamps and calculating field completeness scores.

CI/CD and Build Improvements:

Updated the CI Dockerfile with a multi-stage build and refined dependency installation.

Modified POM files to include necessary dependencies for new features.

* Adding DataCompleteness Task Node, Flowable Debug logs

* Transaction handling for Custom Workflow - Initial Draft

* add new tasks to node definition interface

* Update generated TypeScript types

* Draft Implementation of Multi Reviewer Approval Task with Migration

* Update generated TypeScript types

* Transaction handling fix, id truncation fix by migration, feed repo fix for multi reviewer pattern, copilot comments

* Update generated TypeScript types

* Fixed Multi Reviewer approval to take consideration of the namespaced variables, Fixed RollBackEntity task to follow subprocess like other automated tasks, copilot nitpicks

* Remove conditionalSetEntityAttributes as it is not needed anymore

* Update generated TypeScript types

* Completely remove the setConditionalAttributes to fix compilation errors

* Removed the comments in the schemaChanges

* Created a new Task called CreateDetailedApprovalTaskImpl, Fixed RollBackEntityTask to roll back to either Approved or Rejected state, use namespaced variables, Updated the workflow handler to resolve the tasks and remove them from the user who has approved the task in the feed during multi reviewer approval, TransactionManager updated to keep Transactions in place, Improve the Validation Node input in the Mainworkflow java to handle proper graph traversal, Find Proper Termination Message for the two user events that are conflicting, Include Message in the request approval thread task for proper messages

* Update generated TypeScript types

* Rendering of messages in task thread

* Fix PeriodicBatchEntityTriggerTask to separate workflows based on entities Draft, Fix: SetEntityAttributes Impl to handle multiple fields, Fix: DataCompletenessTask Draft

* Fix DataCompletenessTask BoundaryEvent in Flowable

* Introduced Wf Deployment Strategy for transaction Handling, Improved user tasks for better termination event naming, fix periodic batch entity trigger to fetch the entity instead of all the entities in the trigger, Migrated GlossaryApprovalWorkflow.json with new nodes and edges, Fixed test cases and some edge cases

* Update generated TypeScript types

* Added performTasks for TagRepository and DataProductRepository, Removed test api, removed unnecessary change from workflow json, improved DataCompletenessImpl, Improved CreateDetailedApprovalTaskImpl to show what updated and not updated in the thread response

* Remove Entity Certification and Entity Status tasks and make use of generic set entity attribute task

* Update generated TypeScript types

* Fix the compilation issues!

* Remove setCertification and setEntityAttributes from createWorkflowDefinition

* Test cases for custom workflows related to glossaryTermApprovalWorkflow

* Test cases for custom workflows

* Changed info to debug logs

* Update generated TypeScript types

* DetailedUserApprovalTask changed to reviewChange task, Have validations for workflows where user approval tasks can only be available for entities that support reviewers

* Fix compilation issues and mvn spotless apply

* Update generated TypeScript types

* Remove Extra assignees from user tasks

* Update generated TypeScript types

* Replace Tags/GlossaryTerms during Mutual Exclusivity and Append when there is no Mutual Exclusivity

* Workflow Transaction Manager to handle authorization as part of its functionality, Added Validation for workflows using /validate endpoint

* Increase the Flowable's polling time to 60 seconds to poll timer and async jobs table

* Update generated TypeScript types

* Enum for UserTask Termination Messages, Removed WorkflowUtils to use WorkflowHandler's terminateDuplicate Instances, Approval and rejecters list in the variables, using namespaced variables for updatedBy

* Reverted the enum for userTaskType for now

* Added new tests for dataContract, dataProduct and Tag for user approval tasks

* Glossary Approval Workflow changed to handle jsonLogic according to UI expectations

* Added a new Task type for change description tasks to review and suggest the changes, Added checkUpdatedByReviewer incase of perfom tasks, increase wf test timeout

* Update generated TypeScript types

* TaskWorkflow constructor public to be used by Collate Repo

* AutoApproveServiceTaskImpl incase the assignees are not there for the userTask with ExlusiveGateway builder with hasAssignees

* Fix Compilation Issues - Upgrade of deps to lang3

* ExclusiveGatewayBuilder set exclusive as true, and other minor changes for test and approval tasks

* Added a different wait mechanism for user tasks in WorkflowDefinitionResourceTest.java

* Combined UserApprovalTask and ChangeReviewTask into one to handle suggestions in the same task!

* Update generated TypeScript types

* Test Case Approval Workflow, Removing orphaned changeReviewTask, Test case fix and new test cases

* Update generated TypeScript types

* Treat empty strings, arrays as missing by default

* Update generated TypeScript types

* fix compilation issues by changing the schema properly

* Remove Stale ChangeReview Task

* Update generated TypeScript types

* Enhanced User Approval task to show changed fields along with what changed and the task resolve function as well

* Update generated TypeScript types

* Add Knowledge center page to workflows event consumer, remove legacy backward compatibility for triggers, lenient validations for workflows when there are no nodes

* EntitySpecific Filters for EventBased Trigger, Removing Backward Compatibility logic for filters, Removed unnecessary comments and execution variables, Handle Structured task resolver for dataProduct, dataContract, tag and testCase, Modified GlossaryApprovalWorkflow.json and improved the MigrationUtil

* Update generated TypeScript types

* Bring back entitycertification and glosarystatus task for backward compatibility

* Update generated TypeScript types

* Filter is a map, entity specific filters are stringified, have certification and glossary status tasks in nodeinterface

* Merge Main

* Remove Suggestions for RequestApproval

* Update generated TypeScript types

* Remove Old Deployments of Periodic Batch Entity Trigger and use hiphen to trigger to avoid accidental triggering

* Resolve Merge conflicts, Java Checkstyle

* Update generated TypeScript types

* Fix Migrations

* Added alter table queries in 1.6.0 to avoid cached plan queries in flowable

* Increase timeout in workflow definition resource test

* Increase polling in workflow definition resource test

* Fix java Checkstyle

* comment the flaky test

* COmmented out Flaky Test Cases, fixed a bug on team reviewers

* Resolve java checkstyle after resolving conflicts

* add updated at field in json logic for glossary

* update fields

* add version field

* Update the rule config of new term node with "and" and update the migration as well

* add equal not equal op

* Delete the trigger workflows using like condition

* Defensive Fallback for multiple task instances created by race condition, Terminate duplicate instances of main workflows

* reverting operators

* Approval Capabilities for Metrics

* Update generated TypeScript types

* Move Migrations to 1.10.1 from 1.10.0

* Removed the extra spaces in Migration 1.10

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: karanh37 <karanh37@gmail.com>
2025-10-08 18:57:44 +05:30
Eugenio
af0672e4cf
Fixes #22302: add table2.keyColumns parameter for table diff validation (#23667)
* Update `TableDiffParamsSetter` to move data at table level

This means that `key_columns` and `extra_columns` will be defined per table instead of "globally", just like `data_diff` expects

* Update `TableDiffValidator` to use table's `key_columns`

Call `data_diff` and run validations using each table's `key_columns`

* Create migration to update `tableDiff` test definition

* Fix Playwright test
2025-10-08 09:32:00 +02:00
Eugenio
a6ac42371d
Ensure recognizers are created (#23645)
* Add the migration classes and data for recognizers

This is so that we can run a migration that sets `json->recognizers` of `PII.Sensitive` and `PII.NonSensitive` tags from json values.

The issue with normal migrations was that the value of recognizers was too long to be persisted in the server migrations log.

Created a common `migration.utils.v1110.MigrationProcessBase`

* Ensure building automatically with the right parameters

* Update typescript types
2025-10-07 15:13:35 +00:00
Ram Narayan Balaji
5b59b34c1d
App Screenshots Migration (#23722)
* App Screenshots Migration queries

* Replace the png only in the appScreenshots

* Cast json to jsonb for apps_Data_Store in the migration
2025-10-06 18:56:11 +05:30
Keshav Mohta
3d49b6689d
Fixes #23356: Databricks & UnityCatalog OAuth and Azure AD Auth (#23561)
* feat: databricks oauth and azure ad auth setup

* refactor: add auth type changes in databricks.md

* fix: test after oauth changes

* refactor: unity catalog connection to databricks connection code

* feat: added oauth and azure ad for unity catalog

* fix: unitycatalog tests, doc & required type in connection.json

* fix: generated tx files

* fix: exporter databricksConnection file

* refactor: unitycatalog example file

* fix: usage example files

* fix: unity catalog sqlalchemy connection

* fix: unity catalog client headers

* refactor: make common auth.py for dbx and unitycatalog

* fix: auth functions import

* fix: test unity catalog tags as None

* fix: type hinting and sql migration

* fix: migration for postgres
2025-10-03 19:53:19 +05:30
Eugenio
dff2b394d5
Fix classification scoring (#23523)
* Add `reason` property to `TagLabel`

This is to understand what score was used for selecting the entity

* Build `TagLabel`s with `reason`

* Increase `PIIProcessor._tolerance`

This is so we correctly filter out low scores from classifiers while still maintaining the normalization that filters out confusing outcomes.

e.g: an output with scores 0.3, 0.7 and 0.75, would initially filter the 0.3 and then discard the other two because they're both relatively high results.

* Make database and DAO changes needed to persist `TagLabel.reason`

* Update generated TypeScript types

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-10-01 12:11:14 +00:00
Sriharsha Chintalapani
18677afd39
Add support for Tags customizable rules, capturing feedback (#23289)
* Add support for translations in multi lang

* Add Tag Feedback System

* Update generated TypeScript types

* Fix typing issues and add tests to reocgnizer factory

* Updated `TagResourceTest.assertFieldChange` to fix broken test

This is because change description values had been serialized into strings and for some reason the keys ended up in a different order. So instead of performing String comparison, we do Json comparisons

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Eugenio Doñaque <eugenio.donaque@getcollate.io>
2025-09-30 07:17:18 +02:00
sonika-shah
92c30f2a0a
Fix: PostgreSQL migration immutability issue with databaseSchemaHash … (#23606)
* Fix: PostgreSQL migration immutability issue with databaseSchemaHash column

* Fix: PostgreSQL migration immutability issue with databaseSchemaHash column
2025-09-29 17:07:09 +05:30
Sriharsha Chintalapani
ed144ab0e8
Add threat model and incident response (#23603) 2025-09-28 13:17:23 -07:00
sonika-shah
648a8f86de
add entityType.keyword aggregation in searchSettings.json (#23559)
* add searchSettings aggregation in searchSettings.json

* refactor migrations to only update the agg in searchSettings

* fix migrations
2025-09-25 17:04:49 +05:30
Sriharsha Chintalapani
e77d1940cb
Domain policy must be evaluated during PolicyEvaluator (#23302)
* Domain Only Access Role to be evaluated as part of PolicyEvaluator

* Domain Only Access Role to be evaluated as part of PolicyEvaluator

* Cleanup unnecessary comments

* Add migration fix for domain only policy

* fix playwright domain rbac

* fix hasDomain role issues for TestCaseResourceContext

* allow user to create only with their domain

* move DomainOnlyAccessPolicy migrations to 1.9.10 from 1.9.9

* For resources that don't support domains (like DataInsights), always returns true

* fix adding test case to bundleSuite

* revert supportsDomains changes in RuleEvaluator and handle it in postFiltering steps

* Inherit domains when creating task from test case incident, and skip few entities for domain check + other remaining DQ related domain fixes

* for CreateResourceContext consider assigned domains + inherited domains

* fix permission for table metrics page

* remove SKIP_DOMAIN_CHECK_ENTITY_LIST

* fix test

---------

Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
Co-authored-by: sonikashah <sonikashah94@gmail.com>
Co-authored-by: Ayush Shah <ayush@getcollate.io>
2025-09-24 09:56:55 +05:30
Keshav Mohta
cb26c91442
Revert "Fixes #23356: Databricks OAuth & Azure AD Auth (#23482)" (#23530)
This reverts commit f1afe8f5f114ee58090168fd7ae5d66b38a01ab0.
2025-09-23 17:44:16 +02:00
Keshav Mohta
f1afe8f5f1
Fixes #23356: Databricks OAuth & Azure AD Auth (#23482)
* feat: databricks oauth and azure ad auth setup

* refactor: add auth type changes in databricks.md

* fix: test after oauth changes

* refactor: unity catalog connection to databricks connection code
2025-09-23 15:22:50 +05:30
Teddy
71f993a2fc
Minor fix broken 1.9.8 migrations (#23487)
* chore: fix duplicate migrations from 1.9.8

* fix: psql update query
2025-09-22 13:13:25 +00:00
Teddy
4be7d65fbf
Minor my sql migration (#23476)
* chore: fix migration performance MySQL

* fix: remove comment
2025-09-18 20:59:42 +02:00
Teddy
f8f57fc4af
chore: change memory engine to innodb in migrations (#23460) 2025-09-18 09:53:49 +02:00
Sriharsha Chintalapani
ffc4dcf643
Fix post profiler time series migration (#23444)
* Fix post profiler time series migration

* add prefix index

* Update prefix index length for entityFQNHash

* Update prefix index length for LIKE queries

* add prefix index

* fix mysql syntax
2025-09-17 19:17:25 -07:00
Adrià Manero
840f7e2a68
Setup NotificationTemplate CRUD (#23293) 2025-09-16 15:21:11 +02:00
sonika-shah
cc18ba1554
feat: Add configurable filters aggregation for tag source segregation in search (#23363)
* feat: Add configurable filters aggregation for tag source segregation in search

* Update generated TypeScript types

* use script based aggregation

* use script based aggregation

* Update generated TypeScript types

* update aggregations with script

* add migrations to fetch updated searchSettings

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
2025-09-16 15:16:06 +05:30
Ram Narayan Balaji
9fd34c8f89
Feat #20586 Implementation of Custom Metrics Measurement Units (#22876)
* Initial Implementation of Custom Metrics Measurement Units

* Update generated TypeScript types

* Removed Regex patterns and length validations as they are not needed

* Add a new column with index for custom units

* Remove comments in the sql

* update ui and add playwright

* fix metric selector

* fix tests

* address feedbacks

* remove unused field

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
Co-authored-by: karanh37 <karanh37@gmail.com>
2025-09-11 15:14:38 +05:30
Teddy
1ef191a2aa
ISSUE #1534 - Profiler Refactor for Metadata Extraction Application (#23200)
* feat: added exporter app config

* refactor: added entityprofile resource & added backward compatibility to existing API

* feat: added tests to get_profile_data_by_type

* feat: remove non supported event types

* chore: added migrations to 1.9.7

* chore: added application creation readme

* chore: move migrations to 1.9.8

* fix: failing java test

* style: ran java linting
2025-09-05 13:07:04 +02:00
Sriharsha Chintalapani
cb42409999
GlossaryTerm Search endpoint (#23005)
* Glossary Term load issues

* Glossary Term load issues

* Fix #20081: Infinite scroll Pagination support for Glossary Term Table in UI

* Fix #20081: Infinite scroll Pagination support for Glossary Term Table in UI

* fixed glossary infinite scroll

* removed script

* fixed glossary fialing playwright

* fixed expand and collapse for entire hierarchy

* Add search endpoint to support glossary term search

* Add search endpoint to support glossary term search

* fixed fetchAllTerm glitch (#23037)

* resolve conflicts and fix style

* minor prettier fix

* Fix(ui): Infinity Scroll Playwright  (#23088)

* fixed comments

* removed unncessary files

* addressed ui issue

---------

Co-authored-by: Ashish Gupta <ashish@getcollate.io>

* Fix(ui):Infinite Scroll Unit Tests (#23131)

* fixed failing unit tests

* fixed mock type field

* Minor: Move migrations to 1.9.6

* Revert unintended changes in 1.9.5 migration files

* fix build

---------

Co-authored-by: Dhruv Parmar <83108871+dhruvjsx@users.noreply.github.com>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
Co-authored-by: sonikashah <sonikashah94@gmail.com>
Co-authored-by: Ashish Gupta <ashish@getcollate.io>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
2025-09-02 09:32:16 +05:30
Pere Miquel Brull
abcdc4e3d6
MINOR - Domain Independent DP Rule (#23067)
* MINOR - Domain Independent DP Rule

* handle DP

* Handle DP

* add migration

* improve rule mgmt

* improve rule mgmt

* add test for bulk op

* fix test

* handle in bulk

---------

Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-08-29 17:28:29 +02:00
Pere Miquel Brull
8c028b81c0
MINOR - Move migrations to 1.9.5 (#23095) 2025-08-28 09:23:21 +02:00
Pere Miquel Brull
d054f386c7
MINOR - tag_usage performance (#22950)
* MINOR WIP - tag_usage performance

* Update EntityRepository.java

remove unnecessary call

* update query and indexes

* Merge 1.9.3 migration queries from main branch

- Add performance indexes for tag_usage and entity_relationship tables
- Add entity relationship update for dataContract relations
- Merge queries from both main and tag-usage-perf branches

Co-authored-by: Pere Miquel Brull <pmbrull@users.noreply.github.com>

---------

Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
Co-authored-by: Pere Miquel Brull <pmbrull@users.noreply.github.com>
2025-08-25 17:45:39 +02:00
Pere Miquel Brull
bdf3659b41
MINOR - Data Contract fixes and improvements (#23043)
* fix change event handling for alerts

* contract is deleted when asset is deleted

* add support for custom properties

* Update generated TypeScript types

* handle suite index deletion

* validate owner is not coming back if not requested

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-08-22 12:58:06 +02:00
sonika-shah
b1a7d4d8ae
Add missing domain migrations for entity version history (#23032) 2025-08-21 14:33:37 +05:30
Teddy
d58b8a63d6
ISSUE #1753 - Add Row Count to Custom SQL Test (#22697)
* feat: add count rows support for custom SQL

* style: ran python linting

* feat: added logic for partitioned custom sql row count

* migration: partitionExpression parameter

* chore: resolve conflicts
2025-08-19 06:40:49 +02:00
Pere Miquel Brull
8607f10d95
MINOR - Add Tests & fix migrations (#22714) 2025-08-03 15:19:54 +02:00
sonika-shah
d5515ded3b
Add migrations to update widget descriptions (#22707)
* Add migrations to update widget descriptions

* move to postDataMigrationSQLScript.sql

* move to postDataMigrationSQLScript.sql

* move to postDataMigrationSQLScript.sql
2025-08-02 16:55:02 +05:30
sonika-shah
4cf5a410fe
Feed migrations to support multi-domain (#22601)
* WIP : feed migrations to support multi-domain

* fix domain field in ChangeEvent and thread.json , move to maintaining list<uuid>
rather than list<entityRef>

* Update generated TypeScript types

* fix query migrations

* add default for domains field in thread and changeEvent

* add backend test , and allow domain param in createThread

* Update generated TypeScript types

* Apply suggestions from code review

handling null issue in JSON_ARRAY

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* fix flakiness in test, add pending migration

* fix sql query

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-02 04:59:05 +05:30
Copilot
5057f29efc
Fix alert events ordering to show latest events first (#22309)
* Initial plan

* Initial analysis: identify ordering issue in alert events

Co-authored-by: harshach <38649+harshach@users.noreply.github.com>

* Fix PostgreSQL query ordering in listAllEventsWithStatuses

Co-authored-by: harshach <38649+harshach@users.noreply.github.com>

* Add comprehensive test coverage and performance indexes for event ordering

- Add CollectionDAOEventOrderingTest with 4 comprehensive test methods
- Test chronological ordering, pagination, edge cases, and cross-database consistency
- Add critical timestamp indexes for consumers_dlq and successful_sent_change_events tables
- Indexes prevent table scans on ORDER BY timestamp DESC queries
- Migration 1.9.1 includes both MySQL and PostgreSQL index definitions

Addresses performance concerns and test coverage gaps identified in PR review.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

* Fix tests

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: harshach <38649+harshach@users.noreply.github.com>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
Co-authored-by: Claude <noreply@anthropic.com>
Co-authored-by: Sriharsha Chintalapani <harsha@getcollate.io>
Co-authored-by: Teddy <teddy.crepineau@gmail.com>
2025-07-31 13:31:53 +02:00
Keshav Mohta
f8b588e6f0
Feature: Security Service (#22450)
* feat: security service initial code

* support secrity type for service from ui

* refactor: added missing java files for security service and require json files

* refactor: added doc file, oneOf java changes for ranger

* Merge branch 'main' of https://github.com/open-metadata/OpenMetadata into issue-20145

* update the security services schema to have domains instead of domain

* fix: added security service metadata pipeline

* Make changes for security service UI

* Update missing translation

* update type

* Fix the java checkstyle

* fix: added ingestionRunner in security service json

* Hide security service for OSS

* fix: reverse metadata documentation

* refactor: added securityConnection in test connection

* fix: documentation and ts file changes

* fix: deleted __init__ file and added SSISConnection again in the workflowclassconverter.java

* fix: convert domain to domains

* fix: added new file for basicAuth for ranger and change in rangerConnectionClassConverter.java file

* fix: java checkstyle

* fix: mysql security service sql

* fix: security service index mapping

* fix: moved migrations in new version and fix classConverterFactory

---------

Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>
Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com>
2025-07-31 06:38:21 +02:00
sonika-shah
79e00ebc94
Add missing migrations for changes related to searchSettings introduced in version 1.8.7. (#22634) 2025-07-29 18:49:19 +05:30
IceS2
bad772db39
FIX #22099: enable 'Column values to be in set' test case for boolean columns (#22491)
* fix(dq): enable ''Column values to be in set'' test case for boolean columns

Add BOOLEAN to supportedDataTypes array in columnValuesToBeInSet.json
to allow boolean column validation with predefined allowed values.

This enables users to enforce strict true/false validation on boolean
columns directly at the column level, resolving issue #22099.

Co-authored-by: IceS2 <IceS2@users.noreply.github.com>

* Add tests to the new feature

* Add migrations and columnValuesToBeNotInSet

---------

Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
Co-authored-by: IceS2 <IceS2@users.noreply.github.com>
2025-07-25 15:17:38 +02:00
Sriharsha Chintalapani
d1bf66052b
Fix #22422: Classification term count showing 0 in UI … (#22470)
* Fix #22422: Classification term count showing 0 in UI                                                                                                                   │
│                                                                                                                                                                                          │
│   - Added bulk fetch methods to ClassificationDAO for efficient term count retrieval                                                                                                     │
│   - Implemented proper batch fetching in ClassificationRepository.setFieldsInBulk                                                                                                        │
│   - Added ConnectionAwareSqlQuery annotations for MySQL and PostgreSQL compatibility                                                                                                     │
│   - Fixed duplicate key error in TagRepository.batchFetchUsageCounts                                                                                                                     │
│   - Added Playwright test to verify classification term counts display correctly

* Fix java checkstyle

* remove unused query

* refactor query :  switch to join-based classification hash matching and add generated column for classificationHash

---------

Co-authored-by: sonikashah <sonikashah94@gmail.com>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-07-24 11:38:49 +05:30
Sriharsha Chintalapani
e3ba945630
Update databaseServiceMetadataPipeline.json (#22518)
* Update databaseServiceMetadataPipeline.json

markDeletedSchemas and markDeletedDatabases should be false by default

* add migrations

---------

Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
Co-authored-by: sonika-shah <58761340+sonika-shah@users.noreply.github.com>
2025-07-23 11:14:18 +02:00
Chirag Madlani
b098395602
Data contracts support for tables & Multi Domain Migration (#22108)
* WIP - MINOR - Rule Engine

* WIP - MINOR - Rule Engine

* WIP - MINOR - Rule Engine

* WIP - MINOR - Rule Engine

* rules

* rules

* rules

* fix retrieval by entity

* test dc

* test dc

* WIP: Data contract feature

* destructure component to it's own files

* WIP contract tab

* update local

* fix test

* First iteration for multi domain support

* fix inheritance fields

* fix inheritance fields

* fix create interface

* fix few more tests

* fix indexing updates

* fix domain rel

* update domain --> domains

* merge

* fix merge

* fix csv tests and createEntity interface

* Update generated TypeScript types

* Trigger Build

* migrations

* fix tests

* fix tests

* fix tests

* Update generated TypeScript types

* Trigger Build

* handle drive service

* fix pg migration

* fix domains ref after merge and clean python tests

* Update generated TypeScript types

* fix merge domains

* format

* add missing migrations

* Update generated TypeScript types

* tests

* Update generated TypeScript types

* Trigger Build

* tests

* tests

* fix py test

* migrate domain to domains and fix compilation errors

* fix domain assignement

* fix domain spec

* fix py tests

* fix data product creation issue

* fix domain tests

* fix bulk import

* fix tests

* fix tests

* fix query and domain migration

* fix py test

* fix playwrights

* fix getEntitiesWithDisplayName indexing quotes

* fix domain prapogation tests

* fix domain propagation

* Fix patch api

* fix domain schema build edit playwright

* fix test

* fix test

* fix domain selection issue and console errors

* quick fix landing page changes

* fix remaining tests

* fix ui tests

* Fix adding data products

* format

---------

Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sriharsha Chintalapani <harsha@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-07-22 09:34:50 +02:00
sonika-shah
4d473c8204
Fix 1.7.3 pending migrations (#22434)
* Fix 1.7.3 pending migrations

* move to 1.8.7

* move to 1.8.7
2025-07-21 20:58:11 -07:00
Sriharsha Chintalapani
e3e248ef0a
Fix #22440: Performance Optimization for Database Service Cascade Deletion (#22441)
* Fix #22440: Performance Optimization for Database Service Cascade Deletion

* Fix stylecheck

* Fix #22440: Performance Optimization for Database Service Cascade Deletion

* Fix stylecheck

* Fix styling
2025-07-20 07:32:48 -07:00
Sriharsha Chintalapani
1c092ea743
Fix #22403: Smart Reindex (#22404)
* Fix #22403: Smart Reindex

* Update generated TypeScript types

* Docs: Artifacts Removal (#22442)

Co-authored-by: “Rounak <“rounakpreet.d@deuexsolutions.com”>

* Docs: Meta Tags Updation (#22444)

Co-authored-by: “Rounak <“rounakpreet.d@deuexsolutions.com”>

* Fixes #22363 #22369: Stored procedure temp table processing and lineage filtering with db & schema (#22416)

* Process temp table graph in stored procedure processor and add db/schema filtering on lineage

* Add tests for stored procedure lineage processing

* Fix tests and py_format

* Fix the filters and log stored proc query count info

* Playwright Update: Bulk Import (#22436)

Co-authored-by: Satish <satish@Satishs-MacBook-Pro.local>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>

* Docs: Databrick Features Updation (#22448)

Co-authored-by: “Rounak <“rounakpreet.d@deuexsolutions.com”>

* Docs: Meta Updation (#22451)

Co-authored-by: “Rounak <“rounakpreet.d@deuexsolutions.com”>
Co-authored-by: Prajwal214 <167504578+Prajwal214@users.noreply.github.com>

* Docs: 18.4 to 1.8.6 Release Updation (#22455)

Co-authored-by: “Rounak <“rounakpreet.d@deuexsolutions.com”>

* fix the flaky glossary bulk edit failure (#22446)

* Move schema changes to 1.8.7

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
Co-authored-by: Rounak Dhillon <162090200+RounakDhillon@users.noreply.github.com>
Co-authored-by: “Rounak <“rounakpreet.d@deuexsolutions.com”>
Co-authored-by: Mohit Tilala <63147650+mohittilala@users.noreply.github.com>
Co-authored-by: satish <satishmwaghela@gmail.com>
Co-authored-by: Satish <satish@Satishs-MacBook-Pro.local>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
Co-authored-by: Prajwal214 <167504578+Prajwal214@users.noreply.github.com>
Co-authored-by: Ashish Gupta <ashish@getcollate.io>
2025-07-18 21:53:43 -07:00
Mohit Yadav
060df56aaf
Fix Entity Relationship Cleanup to not deleted entities without repo… (#22400)
* - Fix Entity Relationship Cleanup to not delted entities without repositories

* - Add Migration from Test Case to Test Case Resolution

* - Typo should be resolution id

* Add Tests and Fix Condition

* Update BatchSize
2025-07-16 14:26:38 -07:00
Pere Miquel Brull
397c41457c
MINOR - Add columns.description in search settings (#22299)
* MINOR - Add columns.description in search settings

* remove from defaults
2025-07-15 09:21:57 +02:00
Sriharsha Chintalapani
58cd7daf7e
Add drive as service (#22159)
* Fix #22157: Add Drive Service Support to OpenMetadata

* Fix #22157: Add Drive Service Support to OpenMetadata

* Update generated TypeScript types

* Fix tests

* Update generated TypeScript types

* Fix tests

* Tests are passing

* Tests are passing

* Update generated TypeScript types

* Address comments

* Update generated TypeScript types

* Add sample data and ingestion for drives, directories, spreadsheets, worksheets, files

* Fix postgres migrations

* Fix postgres tests, optimize queries

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2025-07-11 10:00:54 -07:00
Keshav Mohta
6e40f976e7
Fix #20145: Implemented Prefix For Dashboard Service (#21585)
* feat: implemented microstrategy lineage & dbServicePrefix

* feat: added dbServicePrefixes support in other dashboards

* fix: test_metabase and powerbi extra code remove

* fix: python checkstyle

* refactor: added prefix support for other connectors - superset, tableau, etc

* refactor: added migration for prefix change and fix dbServicePrefixes field description

* refactor: added prefix changes in superset db source

* doc: add prefix in tableau doc

* fix: typescript files and postgres migration for prefix

* fix: moved migration in 1.8.2

---------

Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>
2025-07-08 18:54:35 +02:00
Sriharsha Chintalapani
02a5a5fcc2
Bulk load fields for paginations and indexing (#21771)
* Add db max connections to used while auto tune search index app

* Add more logging

* Add batch reading of entities

* Fix bulk loading issues

* Fix bulk loading issues

* Fix bulk loading issues

* Increase the default max concurrent requests

* Remove entitynotfound from errors

* Add higher no.of threads

* Fix KPI, TestCase, Policy repository fixes

* Fix Merge Issues

* Remove column tests

* Add bulk load for Tags

* Fix tests

* Fix tests for TestSuiteRepository

* Improve bulk loading in Table, DatabaseSchema

* Improve bulk loading in all entities

* fix merge error

* Improve indexing for entities

* Improve performance while pagination

* fix tests

* fix tests

* fix tests

* fix markdown test

* Address issues with bulkloading owners,inherited owners in database schema, tables, roles

---------

Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com>
Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>
Co-authored-by: karanh37 <karanh37@gmail.com>
2025-07-07 07:46:42 -07:00
Sriharsha Chintalapani
0162f42a1d
Fix #20621: User Status Tracking in the System (#21911)
* Fix #20621: User Status Tracking in the System

* Fix #20621: User Status Tracking in the System

* Fix #20621: Add Playwright tests

* Update generated TypeScript types

* fix import

* remove unused imports and fix broken ui for settings user page

* revert path  change for PersonaDetailsPage

* fix unit tests

* fix tests

* Add separate api for last login time

* update last login time api

* remove last login time patch

* remove updateLoginTime

---------

Co-authored-by: shrushti2000 <shrushtipolekar@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: sonikashah <sonikashah94@gmail.com>
Co-authored-by: karanh37 <karanh37@gmail.com>
Co-authored-by: mohitdeuex <mohit.y@deuexsolutions.com>
2025-07-02 14:37:36 -07:00
Sriharsha Chintalapani
db8e2fa548
Improve indexing auto tune based on heap mem, parse custer metrics for new versions of OpenSearch (#21931)
* Improve indexing auto tune based on heap mem, parse custer metrics for new versions of OpenSearch

* Use native bulk processors

* lower the batch size, add composite index for app_runs table

* while updated secrets, if there is entityNotFoundException ignore and move on

* Fix deadlock

* Fix deadlock

* Fix gzip headers

* remove unnecessary code to check compression, batch sizes etc..

* Improved performance

* Improved performance

* fix merge error

* fix styling

* fix tests

* Fix build issues

* improve queue size recommendations

* put max limits on threads

* fix checkstyle
2025-06-30 07:40:25 -07:00
Mohit Yadav
5b6e25999e
Disabled bot creating activity feeds (#21773)
* - Disabled bot creating feeds

* - nit
2025-06-14 19:21:00 +05:30
Mohit Yadav
6124c5573e
fix tests (#21751) 2025-06-12 20:47:41 -07:00