tarunpandey23 69342d4783
Feat: update generate docs schema (#19372)
* Updated generate_docs_schema

* Updated generate_docs_schema

* Updated generate_docs_schema

* applied generate_docs_schema script

* Revert the changes in this file

---------

Co-authored-by: Tarun <tarun.p@deuexsolutions.com>
Co-authored-by: Prajwal214 <167504578+Prajwal214@users.noreply.github.com>
2025-01-15 17:46:55 +05:30

6.2 KiB

title slug
container /main-concepts/metadata-standard/schemas/entity/data/container

Container

This schema defines the Container entity. A Container is an abstraction for any path(including the top level eg. bucket in S3) storing data in an Object store such as S3, GCP, Azure. It maps a tree-like structure, where each Container can have a parent and a list of sub-folders, and it can be structured - where it contains structured data, or unstructured where no schema for its data is defined.

Properties

Definitions

  • containerDataModel (object): This captures information about how the container's data is modeled, if it has a schema. . Cannot contain additional properties.
    • isPartitioned (boolean): Whether the data under this container is partitioned by some property, eg. eventTime=yyyy-mm-dd. Default: false.
    • columns (array, required): Columns belonging to this container's schema.
  • fileFormat (string): This schema defines the file formats for the object/files within a container. Must be one of: ["zip", "gz", "zstd", "csv", "tsv", "json", "parquet", "avro"].

Documentation file automatically generated at 2025-01-15 09:05:25.266839+00:00.