OpenMetadata/.pre-commit-config.yaml
Pere Menal-Ferrer 5d2dfa712a
feature/pii-processor-improvement (#21248)
* Add PII Tag and Sensitivity Level enums.

* Add feature-extraction for PII classification tasks

* Add faker as test dependency

* Add unit tests for presidio tag extractor

* Add PIISensitivityTags enum and update sensitivity mapping logic

* Add Presidio utility functions for PII analysis

* Extend column name regexs for PII

* Add tests for PAN, NIF, SSN entities

* Fix version of faker to prevent flaky tests. Fix failing tests.

* Add Generated to State enum

* Integrate PIISensitive classifier to PIIProcessor
2025-05-19 17:52:17 +00:00

50 lines
1.5 KiB
YAML

default_language_version:
python: python3
repos:
- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v2.3.0
hooks:
- id: check-json
exclude: vscode
- repo: https://github.com/hadialqattan/pycln
rev: v2.5.0
hooks:
- id: pycln
files: ^(ingestion|openmetadata-airflow-apis)/
args: [ "--config", "ingestion/pyproject.toml" ]
- repo: https://github.com/timothycrosley/isort
rev: 5.12.0
hooks:
- id: isort
files: ^(ingestion|openmetadata-airflow-apis)/
args: [ "--settings-file", "ingestion/pyproject.toml" ]
- repo: https://github.com/ambv/black
rev: 22.3.0
hooks:
- id: black
files: ^(ingestion|openmetadata-airflow-apis)/
args: [ "--config", "ingestion/pyproject.toml" ]
- repo: https://github.com/pre-commit/mirrors-prettier
rev: v2.5.1
hooks:
- id: prettier
files: ^openmetadata-service/src/main/resources/json/schema/
- repo: local
hooks:
- id: google-style-java
name: Google Java Code Style for Java
description: Formats code in Google's Java codestyle with 120 line length.
entry: scripts/format-code.sh
language: script
files: \.java$
require_serial: true
- repo: local
hooks:
- id: lint-staged
name: lint-staged
description: Formats TS and JS code.
entry: scripts/lint-staged.sh
language: script
files: \.(js|jsx|ts|tsx)$
require_serial: true