Jake Poznanski c00e40d1c4 More fixes
2024-09-26 23:10:07 +00:00
2024-09-26 23:10:07 +00:00
2024-09-25 22:08:36 +00:00
2024-09-26 23:10:07 +00:00
2024-09-17 07:53:43 -07:00
2024-09-17 07:53:43 -07:00
2024-09-17 07:53:43 -07:00
bnb
2024-09-26 03:30:35 +00:00
2024-09-17 07:53:43 -07:00

pdelfin

Description
Toolkit for linearizing PDFs for LLM datasets/training
Readme Apache-2.0 359 MiB
Languages
Python 90.2%
HTML 5.7%
Shell 3.9%
Dockerfile 0.2%