docling/docs/integrations/data_prep_kit.md
Oleg Lavrovsky 844babb390
docs: update links in data_prep_kit (#1559)
Update data_prep_kit.md

The links were broken, since the repository was renamed. I also noticed that PDF2Parquet is now referred to as Docling2Parquet.

Signed-off-by: Oleg Lavrovsky <31819+loleg@users.noreply.github.com>
2025-05-11 20:38:25 +02:00

741 B

Docling is used by the Data Prep Kit open-source toolkit for preparing unstructured data for LLM application development ranging from laptop scale to datacenter scale.

Components

PDF ingestion to Parquet

Document chunking