shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
PythonApache-2.0
Pinned issues
Issues
- 3
在跑第二段示例代码时遇到的警告:FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`. warnings.warn(
#151 opened by Phoenix233333 - 1
- 1
是否支持ollama
#155 opened by smileyboy2019 - 5
- 2
我该如何指定模型所在的位置,这样就不用到Hugging Face下载了
#153 opened by omaiyiwa - 4
loss function
#152 opened by riyajatar37003 - 1
- 0
1024 维度的模型在哪找呢
#150 opened by wmvm0 - 1
input.txt 文件是什么格式?
#148 opened by DokiDoki1103 - 6
训练的loss一直为0,请问这是怎么回事
#146 opened by ann22 - 2
关于BGE 微调疑问
#125 opened by CuteMing - 2
- 2
关于spearman评估方法的疑问
#133 opened by hellopahe - 1
执行sim.get_score("hello","hello") 报错。
#147 opened by melisa81 - 1
text2vec中,关于token与汉字字符换算
#145 opened by cutelitchi - 3
请教bge用自己数据微调的问题
#144 opened by LanceLuoyuan - 1
相似度問題。
#142 opened by chunnan6666 - 1
examples/gradio_demo.py
#136 opened by realnghon - 1
代码疑问:矩阵计算batch内的cos loss
#143 opened by CathyKitten - 3
模型微调问题
#141 opened by CathyKitten - 2
关于模型的选择
#140 opened by Xiao-congxi - 2
text2vec-base-multilingual 向量维度可以从 384 调整到 768 吗
#138 opened by dingcb - 1
模型微调卡住
#139 opened by CathyKitten - 2
关于BGE的蒸馏问题
#137 opened by hgwu4869 - 2
CoSENT损失计算问题
#135 opened by YingchaoX - 1
- 2
多进程encode无法正常运行
#131 opened by imempty - 2
想问一下,simcse是不是就是sbert
#129 opened by TanXiang7o - 2
v1.2.0优化细节咨询
#130 opened by MingFL - 1
triton server Deployment
#122 opened by gaoyuan5251 - 1
能否测试LLM大规模语言模型计算出的text vector 是否能大幅度提高语义匹配的精度
#128 opened by doptime - 7
如何使用多卡进行im_model.model.encode()
#126 opened by newbietuan - 13
使用命令运行training_sup_text_matching_model_mydata.py脚本报错You have to specify either input_ids or inputs_embeds
#124 opened by 1006076811 - 1
训练数据
#127 opened by HaoRenkk123 - 2
- 2
模型离线使用
#116 opened by Huanyongji - 4
loss反向传播全0
#120 opened by rangehow - 3
- 1
小白求问
#121 opened by xuyang20232333 - 1
- 1
标题图片显示不正常
#117 opened by peilongchencc - 6
- 5
使用中发现单机8卡的训练速度比sentense-transformer的速度慢。
#114 opened by sangyongjia - 3
- 1
转ONNX后,出现问题,求解答
#115 opened by dtMndas - 0
关于cosnet中loss的不解
#112 opened by lightislost - 1
The difference between STSB and STSBenchmark
#111 opened by staoxiao - 3
Loss稳定在较高数值不再下降,无法复现效果
#113 opened by Mike4Ellis - 1
AutoTokenizer保存ernie3.0失败
#110 opened by yuankunW - 2
如何开启多卡训练?
#109 opened by sangyongjia