mirror of
https://github.com/FlagOpen/FlagEmbedding.git
synced 2026-01-05 19:51:31 +00:00
dataset
This commit is contained in:
parent
ca5bf23c89
commit
23fd41436c
@ -7,6 +7,6 @@ This will point to the training data we use for training various models.
|
||||
| [MLDR](https://huggingface.co/datasets/Shitao/MLDR) | Docuemtn Retrieval Dataset, covering 13 languages |
|
||||
| [bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data) | Fine-tuning data used by [bge-m3](https://huggingface.co/BAAI/bge-m3) |
|
||||
| [public-data](https://huggingface.co/datasets/cfli/bge-e5data) | Public data identical to [e5-mistral](https://huggingface.co/intfloat/e5-mistral-7b-instruct) |
|
||||
| [full-data](https://huggingface.co/datasets/cfli/bge-full-data) | The full dataset we used for training [bge-en-icl](BAAI/bge-en-icl) |
|
||||
| [full-data](https://huggingface.co/datasets/cfli/bge-full-data) | The full dataset we used for training [bge-en-icl](https://huggingface.co/BAAI/bge-en-icl) |
|
||||
| [reranker-data](Shitao/bge-reranker-data) | a mixture of multilingual datasets |
|
||||
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user