6 Commits

Author SHA1 Message Date
chenbin11200
ff948d5403
feat(nn4k): not load model to cuda when using deepspeed (#207) 2024-04-19 11:20:25 +08:00
chenbin11200
bf57b3319f
feat(nn4k): support huggingface decode only model local inference (#128)
Co-authored-by: xionghuaidong <huaidong.xhd@antgroup.com>
2024-03-08 13:54:15 +08:00
chenbin11200
eb2590aada
feat(nn4k): add huggingface decode only model local sft feature (#1) (#109)
Co-authored-by: xionghuaidong <huaidong.xhd@antgroup.com>
2024-02-22 14:08:21 +08:00
xionghuaidong
945cf8fbbd
feat(nn4k): implement text embeddings (#104) 2024-02-02 16:29:24 +08:00
baifuyu
042c4e1ed1
chore(license): update license (#88) 2024-01-16 14:38:37 +08:00
xionghuaidong
6c3f8584ec
feat(nn4k): implement openai invoker and local hf executor (#57)
Co-authored-by: 基尔 <qy266141@antgroup.com>
Co-authored-by: didicout <julin.jl@antgroup.com>
2024-01-06 12:12:12 +08:00