Description
Toolkit for linearizing PDFs for LLM datasets/training
Readme Apache-2.0 375 MiB
Languages
Python 87.9%
Shell 6.4%
HTML 5.6%
Dockerfile 0.1%