Eugenio a49b720482
Improvements to accuracy of auto-classification (#24337)
* Add test cases

* Change preprocessing for autoclassification

* Change patterns of recognizers

Affected recognizers:
- `CreditCardRecognizer`: also included a sanitation step before analyzing
- `UsLicenseRecognizer`: pattern that avoids version strings like `V1`
- `AuTfnNumber`: be more specific with patterns

* Remove normalization in processor

The goal is to remove a normalization that doesn't make much sense in an environment where we may get only one result.

Use `tolerance` to filter noise and depend on `confidence_threshold` to remove low score results

* Fix tests
2025-11-17 10:48:12 +01:00
..
2025-05-27 10:56:52 +02:00
2025-06-17 23:44:29 +05:30