Nahuel 3156f39d73
Fix #6020: Doc migration - Dev & SDK (#6116)
* Add SDK documentation

* Add developers documentation
2022-07-17 18:18:22 +02:00

43 lines
1.4 KiB
Markdown

---
title: Bulk Sink
slug: /sdk/python/build-connector/bulk-sink
---
# BulkSink
**BulkSink** is an optional component in the workflow. It can be used to bulk update the records generated in a workflow. It needs to be used in conjunction with Stage
## API
```python
@dataclass # type: ignore[misc]
class BulkSink(Closeable, metaclass=ABCMeta):
ctx: WorkflowContext
@classmethod
@abstractmethod
def create(cls, config_dict: dict, metadata_config_dict: dict, ctx: WorkflowContext) -> "BulkSink":
pass
@abstractmethod
def write_records(self) -> None:
pass
@abstractmethod
def get_status(self) -> BulkSinkStatus:
pass
@abstractmethod
def close(self) -> None:
pass
```
**create** method is called during the workflow instantiation and creates an instance of the bulk sink.
**write_records** this method is called only once in Workflow. Its developer responsibility is to make bulk actions inside this method. Such as read the entire file or store to generate the API calls to external services.
**get_status** to report the status of the bulk_sink ex: how many records, failures or warnings etc.
**close** gets called before the workflow stops. Can be used to clean up any connections or other resources.
## Example
[Example implementation](https://github.com/open-metadata/OpenMetadata/blob/main/ingestion/src/metadata/ingestion/bulksink/metadata_usage.py#L52)