mirror of
https://github.com/Unstructured-IO/unstructured.git
synced 2025-09-06 07:12:37 +00:00

Refactor the evaluation scripts including `unstructured/ingest/evaluation.py` `test_unstructured_ingest/evaluation-metrics.sh` for more structured code and usage. - The script is now only use one python script call with param - Adds function to build string for output_args (`--output_dir --output_list) and source_args (`--source_dir --source_args`) - Now accepts evaluation to call as a param, currently only accepts `text-extraction` and `element-type` Example to call the function: ```sh evaluation-metrics.sh text-extraction``` --------- Co-authored-by: ryannikolaidis <1208590+ryannikolaidis@users.noreply.github.com> Co-authored-by: Klaijan <Klaijan@users.noreply.github.com>
150 B
150 B
1 | filename | connector | cct-accuracy | cct-%missing |
---|---|---|---|---|
2 | handbook-1p.docx | box | 0.974 | 0.03 |
3 | example-10k.html | local | 0.686 | 0.04 |
4 | IRS-form-1987.pdf | azure | 0.783 | 0.13 |