Pere Menal-Ferrer
44e09e41a2
Revert "FIX #1464 ( #21520 )" ( #21726 )
...
This reverts commit 1e86f9870fd663122b9bbb64f3cf17cf32619c7f.
2025-06-13 17:27:32 +02:00
Pere Menal-Ferrer
1e86f9870f
FIX #1464 ( #21520 )
...
* Add PIICategoryTags and some utilities on top of them.
* Fix static-check
* Add test for fqn representation
* Add NEREntityGeneralTags.json from Collate
* Add test to check PIICategoryTags agree with the ones used by OM server
* Add LabelExtractor
* Fix style
* Add ignore superflous-parens for pylint
* Ass comment as per PR review
* Fix not-updated PII-IT
* Remove duplicated IT test for PII
---------
Co-authored-by: Pere Menal <pere.menal@getcollate.io>
Co-authored-by: Sriharsha Chintalapani <harshach@users.noreply.github.com>
2025-06-09 16:05:35 -07:00
Mayur Singal
7760663b22
MINOR: Change ingestion licence header ( #20549 )
2025-04-03 10:39:47 +05:30
Pere Miquel Brull
c309906a1b
MINOR - Bump Presidio Analyzer and validate support for legal entities ( #17750 )
2024-09-06 16:07:08 +02:00
Pere Miquel Brull
8191202850
MINOR - Better PII classification for JSON data ( #17734 )
...
* MINOR - Better PII classification for JSON data
* linting
2024-09-06 08:54:23 +02:00
Pere Miquel Brull
2237d5a8d5
MINOR - PII Scanner tests and log levels ( #17686 )
...
* MINOR - PII Scanner tests and log levels
* MINOR - PII Scanner tests and log levels
2024-09-04 12:11:07 +02:00
Pere Miquel Brull
0282574bdd
Create ometa client once and pass it around & improve pycln config ( #13310 )
...
* Create ometa client once and pass it around & improve pycln config
* Fix
* Fix
* Fix tests
* Fix maven ci
* Fix tests
* Fix tests
* Fix tests
* Format
* Fix DI
2023-10-04 09:14:03 +02:00
Pere Miquel Brull
de7e06d024
Update structure for PII processing ( #13079 )
...
* Update structure for PII processing
* Fix tests
* Fix tests
* Lint
* Remove typo
2023-09-06 11:30:46 +02:00
Pere Miquel Brull
a3bfd4e696
Part of #11968 - Restructure Profiler Workflow and PII Processor ( #13059 )
...
* Structure PII
* Restructure Profiler Workflow
* Update signature for abc
* remove profiler sink
* Fix tests
* Fix lint
* Fix test
* Fix test
2023-09-04 11:02:57 +02:00
Pere Miquel Brull
0eb2201f94
Restructure NER Scanner internals ( #11690 )
...
* Simplify col name scanner
* Restructure NER Scanner internals
2023-05-19 18:21:01 +02:00
Pere Miquel Brull
8795337f88
Clean NER Scanner imports ( #11653 )
2023-05-18 12:53:22 +02:00
Pere Miquel Brull
1b90badd0e
Restructure PII processor ( #11640 )
...
* Restructure PII processor
* Restructure PII processor
* Format
2023-05-17 15:58:17 +02:00