OpenMetadata/ingestion/examples/workflows/redshift_profiler.yaml
Teddy 78b5f8c8e2
Part 1 of #5831 -- Profiler workflow implementation (#6809)
* Added database filter in workflow

* Removed association between profiler and data quality

* fixed tests with removed association

* Fixed sonar code smells and bugs

* Updated profiler workflow to:
- support only running profiler (removed test run)
- support column inclusion and exclusion
- added back support for partitioned table and sample

* moved status to workflow

* Fixed tests

* removed test logic from profiler sink

* Added logic to return sample from workflow sample value

* Added profiler examples

* Updated documentation for profiler

* Fixed code smells
2022-08-19 10:52:08 +02:00

46 lines
1.0 KiB
YAML

source:
type: redshift
serviceName: local_redshift
serviceConnection:
config:
hostPort: my-host:5439
username: username
password: strongPassword
database: databseToConnect
type: Redshift
sourceConfig:
config:
type: Profiler
generateSampleData: true
databaseFilterPattern:
includes:
- database
schemaFilterPattern:
includes:
- schema_one
excludes:
- schema_two
tableFilterPattern:
includes:
- orders
- customers
processor:
type: "orm-profiler"
config:
tableConfig:
- fullyQualifiedName: local_redshift.database.schema_one.orders
profileSample: 85
columnConfig:
includeColumns:
- columnName: order_id
- columnName: order_date
- columnName: status
sink:
type: metadata-rest
config: {}
workflowConfig:
openMetadataServerConfig:
hostPort: http://localhost:8585/api
authProvider: no-auth