2024-11-08 15:36:04 +08:00

47 lines
1.2 KiB
ReStructuredText

.. C-MTEB
.. ======
.. Introduction
.. ------------
.. `C-MTEB <https://github.com/FlagOpen/FlagEmbedding/tree/master/C_MTEB>`_ is a benchmark for chinese text embedding. It contains 35
.. datasets in 6 different tasks, providing a comprehensive evaluation to the quality of an embedding model on Chinese.
.. .. image:: ../_static/img/C_MTEB.png
.. :width: 700
.. :align: center
.. Installation
.. ------------
.. C-MTEB is developed based on MTEB, you can install C-MTEB by:
.. .. code:: bash
.. pip install -U C_MTEB
.. or install by FlagEmbedding's repo:
.. .. code:: bash
.. git clone https://github.com/FlagOpen/FlagEmbedding.git
.. cd FlagEmbedding/C_MTEB
.. pip install -e .
.. Citing the Work
.. ---------------
.. There are more details in our publication. If you find C-MTEB useful, you can cite it by:
.. .. code::
.. @misc{c-pack,
.. title={C-Pack: Packaged Resources To Advance General Chinese Embedding},
.. author={Shitao Xiao and Zheng Liu and Peitian Zhang and Niklas Muennighoff},
.. year={2023},
.. eprint={2309.07597},
.. archivePrefix={arXiv},
.. primaryClass={cs.CL}
.. }