108 Commits

Author SHA1 Message Date
dependabot[bot]
6b15d706fd
build(deps): Bump huggingface-hub from 0.10.1 to 0.11.1 in /requirements (#94)
Bumps [huggingface-hub](https://github.com/huggingface/huggingface_hub) from 0.10.1 to 0.11.1.
- [Release notes](https://github.com/huggingface/huggingface_hub/releases)
- [Commits](https://github.com/huggingface/huggingface_hub/compare/v0.10.1...v0.11.1)

---
updated-dependencies:
- dependency-name: huggingface-hub
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: qued <64741807+qued@users.noreply.github.com>
2022-12-12 11:32:50 -06:00
dependabot[bot]
87a77abe45
build(deps): Bump argilla from 1.1.0 to 1.1.1 in /requirements (#93)
Bumps [argilla](https://github.com/argilla-io/argilla) from 1.1.0 to 1.1.1.
- [Release notes](https://github.com/argilla-io/argilla/releases)
- [Changelog](https://github.com/argilla-io/argilla/blob/develop/release.Dockerfile)
- [Commits](https://github.com/argilla-io/argilla/compare/v1.1.0...v1.1.1)

---
updated-dependencies:
- dependency-name: argilla
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-12 16:56:51 +00:00
dependabot[bot]
a111c3bef3
build(deps): Bump urllib3 from 1.26.12 to 1.26.13 in /requirements (#95)
Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.12 to 1.26.13.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.12...1.26.13)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Matt Robinson <mrobinson@unstructured.io>
2022-12-12 16:43:47 +00:00
dependabot[bot]
adf48f2530
build(deps): Bump filelock from 3.8.0 to 3.8.2 in /requirements (#96)
Bumps [filelock](https://github.com/tox-dev/py-filelock) from 3.8.0 to 3.8.2.
- [Release notes](https://github.com/tox-dev/py-filelock/releases)
- [Changelog](https://github.com/tox-dev/py-filelock/blob/main/docs/changelog.rst)
- [Commits](https://github.com/tox-dev/py-filelock/compare/3.8.0...3.8.2)

---
updated-dependencies:
- dependency-name: filelock
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-12-12 11:35:27 -05:00
asymness
2170a2aae2
feat: Implement Argilla staging brick (#81)
* Add argilla to dependencies and run pip-compile

* Implement Argilla staging brick and add unit tests

* Update version and changelog

* Update docs with description and usage for Argilla staging brick

* Remove unused fixtures and fix typo in Argilla tests

* add missing quote in docs

* changelog tweak

* doc tweaks

Co-authored-by: Matt Robinson <mrobinson@unstructuredai.io>
Co-authored-by: Matt Robinson <mrobinson@unstructured.io>
2022-11-28 14:41:48 +00:00
dependabot[bot]
d5bd44bee4
build(deps): Bump tokenizers from 0.13.1 to 0.13.2 in /requirements (#55)
Bumps [tokenizers](https://github.com/huggingface/tokenizers) from 0.13.1 to 0.13.2.
- [Release notes](https://github.com/huggingface/tokenizers/releases)
- [Changelog](https://github.com/huggingface/tokenizers/blob/main/RELEASE.md)
- [Commits](https://github.com/huggingface/tokenizers/compare/v0.13.1...v0.13.2)

---
updated-dependencies:
- dependency-name: tokenizers
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Matt Robinson <mrobinson@unstructured.io>
2022-11-07 11:40:02 -05:00
dependabot[bot]
ee9ead57c4
build(deps): Bump regex from 2022.9.13 to 2022.10.31 in /requirements (#48)
Bumps [regex](https://github.com/mrabarnett/mrab-regex) from 2022.9.13 to 2022.10.31.
- [Release notes](https://github.com/mrabarnett/mrab-regex/releases)
- [Commits](https://github.com/mrabarnett/mrab-regex/compare/2022.9.13...2022.10.31)

---
updated-dependencies:
- dependency-name: regex
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: cragwolfe <cragcw@gmail.com>
2022-10-31 10:52:51 -07:00
Matt Robinson
fb16847946
feat: Staging brick for attention window chunking (#34)
* add huggingface dependencies and re pip-compile

* first pass on chunk by attention window

* test for chunking function

* completed tests for chunk_by_attention_window

* change default buffer size to 2

* wrapper function for staging

* added docs for transformers

* fix wording and typos

* updated change log and bumped the version

* added docs on huggingface dependencies

* fix typo

* re pip-compile
2022-10-13 11:18:27 -04:00