mirror of
				https://github.com/open-metadata/OpenMetadata.git
				synced 2025-10-26 00:04:52 +00:00 
			
		
		
		
	
		
			
				
	
	
	
		
			2.8 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			2.8 KiB
		
	
	
	
	
	
	
	
| title | description | slug | 
|---|---|---|
| DatabaseServiceAutoClassificationPipeline | Official Documentation | Connect Databaseserviceautoclassificationpipeline to enable streamlined access, monitoring, or search of enterprise data using secure and scalable integrations. | /main-concepts/metadata-standard/schemas/metadataingestion/databaseserviceautoclassificationpipeline | 
DatabaseServiceAutoClassificationPipeline
DatabaseService AutoClassification & Auto Classification Pipeline Configuration.
Properties
- type: Pipeline type. Refer to #/definitions/autoClassificationConfigType. Default:- "AutoClassification".
- classificationFilterPattern: Regex to only compute metrics for table that matches the given tag, tiers, gloassary pattern. Refer to ../type/filterPattern.json#/definitions/filterPattern.
- schemaFilterPattern: Regex to only fetch tables or databases that matches the pattern. Refer to ../type/filterPattern.json#/definitions/filterPattern.
- tableFilterPattern: Regex exclude tables or databases that matches the pattern. Refer to ../type/filterPattern.json#/definitions/filterPattern.
- databaseFilterPattern: Regex to only fetch databases that matches the pattern. Refer to ../type/filterPattern.json#/definitions/filterPattern.
- includeViews(boolean): Optional configuration to turn off fetching metadata for views. Default:- true.
- useFqnForFiltering(boolean): Regex will be applied on fully qualified name (e.g service_name.db_name.schema_name.table_name) instead of raw name (e.g. table_name). Default:- false.
- storeSampleData(boolean): Option to turn on/off storing sample data. If enabled, we will ingest sample data for each table. Default:- true.
- enableAutoClassification(boolean): Optional configuration to automatically tag columns that might contain sensitive information. Default:- false.
- confidence(number): Set the Confidence value for which you want the column to be tagged as PII. Confidence value ranges from 0 to 100. A higher number will yield less false positives but more false negatives. A lower number will yield more false positives but less false negatives. Default:- 80.
- sampleDataCount(integer): Number of sample rows to ingest when 'Generate Sample Data' is enabled. Default:- 50.
Definitions
- autoClassificationConfigType(string): Profiler Source Config Pipeline type. Must be one of:- ["AutoClassification"]. Default:- "AutoClassification".
Documentation file automatically generated at 2025-01-15 09:05:41.923720+00:00.
