mirror of
https://github.com/FlagOpen/FlagEmbedding.git
synced 2026-01-08 05:03:10 +00:00
47 lines
1.2 KiB
ReStructuredText
47 lines
1.2 KiB
ReStructuredText
.. C-MTEB
|
|
.. ======
|
|
|
|
.. Introduction
|
|
.. ------------
|
|
|
|
.. `C-MTEB <https://github.com/FlagOpen/FlagEmbedding/tree/master/C_MTEB>`_ is a benchmark for chinese text embedding. It contains 35
|
|
.. datasets in 6 different tasks, providing a comprehensive evaluation to the quality of an embedding model on Chinese.
|
|
|
|
|
|
.. .. image:: ../_static/img/C_MTEB.png
|
|
.. :width: 700
|
|
.. :align: center
|
|
|
|
|
|
.. Installation
|
|
.. ------------
|
|
|
|
.. C-MTEB is developed based on MTEB, you can install C-MTEB by:
|
|
|
|
.. .. code:: bash
|
|
|
|
.. pip install -U C_MTEB
|
|
|
|
.. or install by FlagEmbedding's repo:
|
|
|
|
.. .. code:: bash
|
|
|
|
.. git clone https://github.com/FlagOpen/FlagEmbedding.git
|
|
.. cd FlagEmbedding/C_MTEB
|
|
.. pip install -e .
|
|
|
|
.. Citing the Work
|
|
.. ---------------
|
|
|
|
.. There are more details in our publication. If you find C-MTEB useful, you can cite it by:
|
|
|
|
.. .. code::
|
|
|
|
.. @misc{c-pack,
|
|
.. title={C-Pack: Packaged Resources To Advance General Chinese Embedding},
|
|
.. author={Shitao Xiao and Zheng Liu and Peitian Zhang and Niklas Muennighoff},
|
|
.. year={2023},
|
|
.. eprint={2309.07597},
|
|
.. archivePrefix={arXiv},
|
|
.. primaryClass={cs.CL}
|
|
.. } |