Description
Toolkit for linearizing PDFs for LLM datasets/training
Readme Apache-2.0 360 MiB
Languages
Python 90.1%
HTML 5.7%
Shell 4%
Dockerfile 0.2%