simcse

There are 16 repositories under simcse topic.

  • shawroad/Semantic-Textual-Similarity-Pytorch

    experiments of some semantic matching models and comparison of experimental results.

    Language:Python16311216
  • muyuuuu/E-commerce-Search-Recall

    天池阿里灵杰问天引擎电商搜索算法赛非官方 baseline,又名 NLP 从入门到 22/2771。

    Language:Python912211
  • luozhouyang/transformers-keras

    Transformer-based models implemented in tensorflow 2.x(using keras).

    Language:Python764413
  • hellonlp/sentence-similarity

    文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT

    Language:Python742113
  • hppRC/simple-simcse-ja

    Exploring Japanese SimCSE

    Language:Python69204
  • perceptiveshawty/RankCSE

    Implementation of "RankCSE: Unsupervised Sentence Representation Learning via Learning to Rank" (ACL 2023)

    Language:Python47547
  • daekeun-ml/KoSimCSE-SageMaker

    This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and distributed training is possible with Amazon SageMaker.

    Language:Jupyter Notebook22207
  • jifei/simcse-tf2

    A TensorFlow 2 Keras implementation of SimCSE with unsupervised and supervised.

    Language:Python18122
  • MoleculeTransformers/smiles-featurizers

    Extract Molecular SMILES embeddings from language models pre-trained with various objectives architectures.

    Language:Python18131
  • langml

    4AI/langml

    A Keras-based and TensorFlow-backend NLP Models Toolkit.

    Language:Python12111
  • DolbyUUU/Reinforcement-Calibration-SimCSE

    Reinforcement Calibration SimCSE, combining contrastive learning, artificial potential fields, perceptual loss, and RLHF to achieve improved Semantic Textual Similarity (STS) embeddings. PyTorch-based implementations of PerceptualBERT and ForceBasedInfoNCE, along with fine-tuning capabilities via RLHF and evaluation using SentEval.

    Language:Python11100
  • luozhouyang/DeepSE

    Sentence Embeddings using Deep Nerual Networks in PRODUCTION!

    Language:Python9231
  • Lollipop/CRLT

    CRLT: A Unified Contrastive Learning Toolkit for Unsupervised Text Representation Learning

    Language:Python8100
  • sn2727/finetuning-embedding-models

    Domain adaption for an embedding model using unsupervised and supervised finetuning on scientific texts for the SciFact retrieval task.

    Language:Jupyter Notebook2100
  • naivenlp/rapidnlp-datasets

    Data pipelines for both TensorFlow and PyTorch!

    Language:Python1100
  • qbxlvnf11/SimCSE

    SimCSE (Simple Contrastive Learning of Sentence Embeddings) Sample Code: Generating Data & Train

    Language:Python