* Add PII Tag and Sensitivity Level enums.
* Add feature-extraction for PII classification tasks
* Add faker as test dependency
* Add unit tests for presidio tag extractor
* Add PIISensitivityTags enum and update sensitivity mapping logic
* Add Presidio utility functions for PII analysis
* Extend column name regexs for PII
* Add tests for PAN, NIF, SSN entities
* Fix version of faker to prevent flaky tests. Fix failing tests.
* Add Generated to State enum
* Integrate PIISensitive classifier to PIIProcessor
* Add PII Tag and Sensitivity Level enums.
* Add feature-extraction for PII classification tasks
* Add faker as test dependency
* Add unit tests for presidio tag extractor
* Add PIISensitivityTags enum and update sensitivity mapping logic
* Add Presidio utility functions for PII analysis
* Extend column name regexs for PII
* Add colum name split
* Move pii algorithms to dedicated package
* Add tests for PAN, NIF, SSN entities
* Fix linting
* Add comment on why we need to set specific lanaguage to Presidio recognizers, as per PR suggestion.
* Fix version of faker to prevent flaky tests. Fix failing tests.
* Fix wrong import
---------
Co-authored-by: Pere Menal <pere.menal@getcollate.io>
* Add missing workflow config status migrations
* Moved to Java based migrations to redeploy pipeline
* Moved migrations to Java. Updated Listener based on config
* Improved update on the migrations
(cherry picked from commit b761efbfd33e1888dc59e0ef7fe636a9a98aa8c5)
* wip
* feat: trigger external apps with override config
- Added in openmetadata-airflow-apis functionality to trigger DAG with feature.
- Modified openmetadata-airflow-apis application runner to accept override config from params.
- Added overloaded runPipeline with `Map<String,Object> config` to allow triggering apps with configuration. We might want to expand this to all ingestion pipelines. For now its just for apps.
- Implemented an example external app that can be used to test functionality of external apps. The app can be enabled by setting the `ENABLE_APP_HelloPipelines=true` environment variable.
* fix class doc for application
* fixed README for airflow apis
* fixes
* set HelloPipelines to disabeld by default
* fixed basedpywright errros
* fixed app schema
* reduced airflow client runPipeline to an overload with null config
removed duplicate call to runPipeline in AppResource
* Update openmetadata-docs/content/v1.7.x-SNAPSHOT/developers/applications/index.md
Co-authored-by: Matias Puerta <matias@getcollate.io>
* deleted documentation file
---------
Co-authored-by: Matias Puerta <matias@getcollate.io>
(cherry picked from commit d91273a30d28de793941de74636963e29aa9e5d4)
* fix the data asset header spacing in Metric entity
* fix the spacing between floating cards on homepage
(cherry picked from commit 992d3f966407369cabba2271a749180d2c17b2b9)
* fix: test case alerts and un-indexable fields
* fix: test case alerts and un-indexable fields
(cherry picked from commit b1edb964699c1d365b9e1154f05591e4b26ebb21)
* fix: properly close connection on sampler ingestion
* fix: dangling connection test
* style: ran python linting
* fix: revert to 9
(cherry picked from commit cd6434dd73cd7c60ef22a11972740f12686b3558)
* fixed BulkEditEntity flaky test
* fixed enter click issue
* removed changes that does not belongs to me in pr
* removed changes that does not belongs to me in pr
* removed unwated code
(cherry picked from commit c24a2dbdca9a8e9a99b0f53dc2373bdae1edb351)
* added edit display name for dashboard Data model
* used reusbale component
* used reusbale component
* removed unncessary code
(cherry picked from commit e886d708dfa179c747990569420fc233192ef842)
* Adds followers to services and database schema
* Add delete and followers fields
* Add missing fields
* Fix failing test for get entity with different fields
* Fix ingestion test case failure