Description
Toolkit for linearizing PDFs for LLM datasets/training
Readme Apache-2.0 359 MiB
Languages
Python 90.2%
HTML 5.7%
Shell 3.9%
Dockerfile 0.2%