Jake Poznanski c00e40d1c4 More fixes
2024-09-26 23:10:07 +00:00
2024-09-26 23:10:07 +00:00
2024-09-25 22:08:36 +00:00
2024-09-26 23:10:07 +00:00
2024-09-17 07:53:43 -07:00
2024-09-17 07:53:43 -07:00
2024-09-17 07:53:43 -07:00
bnb
2024-09-26 03:30:35 +00:00
2024-09-17 07:53:43 -07:00

pdelfin

Description
Toolkit for linearizing PDFs for LLM datasets/training
Readme Apache-2.0 349 MiB
Languages
Python 91.7%
HTML 5.6%
Shell 2.5%
Dockerfile 0.2%