princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
PythonMIT
Issues
- 1
4 x 24 GB OOM for sup-SimCSE-BERT-base
#286 opened by Anonymous-AI1 - 0
[Question] Pre-trained models for RoBERTa
#290 opened by fahos - 2
Sampling Negatives for Sentiment Analysis
#288 opened by konan009 - 0
3.10及以上的python无法安装simcse0.4版本
#289 opened by ganlinganlin - 4
[Question] The optimizer used for training
#287 opened by fahos - 1
Installation Issues
#284 opened by Diamondterritory - 4
Cannot reproduce the result for `bert-base-uncased`, `avg_first_last` setting
#285 opened by kuriyan1204 - 2
The function ‘search’ only returns one result
#282 opened by MeiNanXue - 4
- 2
- 7
couldn't install cimcse
#280 opened by vincent507cpu - 2
drpout
#279 opened by riyajatar37003 - 1
An error when max_seq_length is set too long
#278 opened by Madilynalisa - 2
setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (750,) + inhomogeneous part.
#276 opened by JYru - 2
- 2
AttributeError: 'OurTrainingArguments' object has no attribute 'distributed_state'
#275 opened by zongxindu - 4
- 2
关于 Supervised SimCSE 的 GPU Memory Usage
#277 opened by AnonymXXXXX - 4
Can I replace the base model with longformer? Is it a simple replacement or does the code also need to be changed? Thank you for your answer.
#271 opened by Struggle-lsl - 2
Can I load saved index to GPU?
#270 opened by Maydaytyh - 0
ValueError: Mixed precision training with AMP or APEX (`--fp16`) can only be used on CUDA devices.
#273 opened by LittleZ2022 - 1
- 3
- 4
- 6
关于simcse build_index 的速度问题
#266 opened by Maydaytyh - 4
About model format conversion
#262 opened by fzxxg - 6
Problems pretraining simcse on custom dataset
#260 opened by mphomokoatle - 2
- 2
- 3
Why add two sentences in prepare_features?
#263 opened by FinalFlowers - 1
no response with training customer dataset
#261 opened by BarryC7 - 2
unsup-SimCSE-BERTbase复现不出paper的结果
#258 opened by oufangwei - 5
Support python 3.10 (current Google Colab runtime)
#248 opened by mocobeta - 2
求roberta中文预训练模型
#256 opened by XiangwenNing - 3
Training dimension error for supervised SIMCSE
#251 opened by ko120 - 2
questions related to the comparison with other common data augmentation in Table 1
#259 opened by Shuwen27 - 2
Cannot install transformers==4.2.1
#239 opened by namin-an - 0
Google Colab error
#252 opened by NorahUseringithub - 2
请问非对称语义的数据集可否应用在有监督的simCSE模型上呢?
#253 opened by zhuzhushiw - 0
求一份中文版albert预训练模型
#255 opened by XiangwenNing - 0
求一份中文roberta预训练模型
#254 opened by XiangwenNing - 1
training code error OSerror
#250 opened by shyzzz521 - 1
你好,请问我用model.encode为什么获取不到句子的长度信息
#245 opened by Reset-aa - 1
- 1
- 1
如何用于层级多标签数据呢?
#243 opened by littttttlebird - 2
Question About bad results from trained model
#240 opened by Alison-starbeat - 2
How do you get the supervised nli dataset?
#242 opened by leoozy - 3
How to train your model to better fit SentEval
#241 opened by ZBWpro - 2
Number of rows in NLI dataset
#238 opened by xlpczv