diff --git a/CHANGELOG.md b/CHANGELOG.md index e91cca51c..d359164a6 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -1,3 +1,11 @@ +## 0.10.22-dev0 + +### Enhancements + +### Features + +### Fixes + ## 0.10.21 * **Adds Scarf analytics**. diff --git a/Makefile b/Makefile index 1601bc99a..25b4a56d9 100644 --- a/Makefile +++ b/Makefile @@ -29,7 +29,7 @@ install-base-ci: install-base-pip-packages install-nltk-models install-test .PHONY: install-base-pip-packages install-base-pip-packages: python3 -m pip install pip==${PIP_VERSION} - python3 -m pip install -r requirements/base.txt --extra-index-url https://packages.unstructured.io/simple/ + python3 -m pip install -r requirements/base.txt .PHONY: install-huggingface install-huggingface: diff --git a/README.md b/README.md index f2fbafb42..f342c74b2 100644 --- a/README.md +++ b/README.md @@ -110,9 +110,9 @@ python3 Use the following instructions to get up and running with `unstructured` and test your installation. -- Install the Python SDK to support all document types with `pip install "unstructured[all-docs]" --extra-index-url https://packages.unstructured.io/simple/` - - For plain text files, HTML, XML, JSON and Emails that do not require any extra dependencies, you can run `pip install unstructured --extra-index-url https://packages.unstructured.io/simple/` - - To process other doc types, you can install the extras required for those documents, such as `pip install "unstructured[docx,pptx]" --extra-index-url https://packages.unstructured.io/simple/` +- Install the Python SDK to support all document types with `pip install "unstructured[all-docs]"` + - For plain text files, HTML, XML, JSON and Emails that do not require any extra dependencies, you can run `pip install unstructured` + - To process other doc types, you can install the extras required for those documents, such as `pip install "unstructured[docx,pptx]"` - Install the following system dependencies if they are not already available on your system. Depending on what document types you're parsing, you may not need all of these. - `libmagic-dev` (filetype detection) @@ -192,7 +192,7 @@ The **Connectors** 🔗 in `unstructured` serve as vital links between the pre-p ### PDF Document Parsing Example The following examples show how to get started with the `unstructured` library. You can parse over a dozen document types with one line of code! Use this [Colab notebook](https://colab.research.google.com/drive/1U8VCjY2-x8c6y5TYMbSFtQGlQVFHCVIW) to run the example below. -The easiest way to parse a document in unstructured is to use the `partition` brick. If you use `partition` brick, `unstructured` will detect the file type and route it to the appropriate file-specific partitioning brick. If you are using the `partition` brick, you may need to install additional parameters via `pip install unstructured[local-inference] --extra-index-url https://packages.unstructured.io/simple/`. Ensure you first install `libmagic` using the instructions outlined [here](https://unstructured-io.github.io/unstructured/installing.html#filetype-detection) `partition` will always apply the default arguments. If you need advanced features, use a document-specific brick. +The easiest way to parse a document in unstructured is to use the `partition` brick. If you use `partition` brick, `unstructured` will detect the file type and route it to the appropriate file-specific partitioning brick. If you are using the `partition` brick, you may need to install additional parameters via `pip install unstructured[local-inference]`. Ensure you first install `libmagic` using the instructions outlined [here](https://unstructured-io.github.io/unstructured/installing.html#filetype-detection) `partition` will always apply the default arguments. If you need advanced features, use a document-specific brick. ```python from unstructured.partition.auto import partition diff --git a/unstructured/__version__.py b/unstructured/__version__.py index 737b6043f..9be3804ea 100644 --- a/unstructured/__version__.py +++ b/unstructured/__version__.py @@ -1 +1 @@ -__version__ = "0.10.21" # pragma: no cover +__version__ = "0.10.22-dev0" # pragma: no cover