mirror of
https://github.com/open-metadata/OpenMetadata.git
synced 2025-11-02 11:39:12 +00:00
* Refactor presidio utils Extract the spacy model functionality from the analyzer building function * Added a new `TagClassifier` This classifier uses tags to dynamically build presidio `RecognizerRegistry`s * Added a new `TagProcessor` This processor uses `TagClassifier` to label a column based on the tags' recognizers * Create `TagProcessor` based on workflow configuration * Create decorator to apply threshold to recognizers This is so that we can apply thresholds on recognizer results without subclassing or having to keep a map between the presidio recognizer and the recognizer configuration * Fix broken test