Zoe-xy's Stars
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
lizhe2004/chatbot-list
行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍
PaddlePaddle/TrustAI
飞桨可信AI
weaiken/ebook
classic books of computer science!
DSXiangLi/BertManual
godweiyang/GrabGPU
Grab GPU whenever available
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
entropy2333/awesome-multi-label-paper-list
A curated list of papers for multi-label classification.
optuna/optuna
A hyperparameter optimization framework
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
yongzhuo/Pytorch-NLU
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of spee
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
suolyer/PyTorch_BERT_Biaffine_NER
论文复现《Named Entity Recognition as Dependency Parsing》
Doragd/Awesome-Sentence-Embedding
A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.
WeiHongLee/Awesome-Multi-Task-Learning
An up-to-date list of works on Multi-Task Learning
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
zhusleep/pytorch_chinese_lm_pretrain
pytorch中文语言模型预训练
Macielyoung/sentence_representation_matching
句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。
qf6101/topwords
Implementation of paper: Deng K, Bol P K, Li K J, et al. On the unsupervised analysis of domain-specific Chinese texts[J]. Proceedings of the National Academy of Sciences, 2016: 201516510.
bytedance/byteps
A high performance and generic framework for distributed DNN training
Wchoward/IECE
Implicit Emotion Cause Extraction
subhadarship/learning-to-unjumble
learning to unjumble as a pretraining objective for RoBERTa
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
luozhouyang/AutoPhraseX
Automated Phrase Mining from Massive Text Corpora in Python.
shangjingbo1226/AutoPhrase
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
lucidworks/auto-phrase-tokenfilter
Lucene Auto Phrase TokenFilter implementation
bojone/word-discovery
速度更快、效果更好的中文新词发现
smoothnlp/SmoothNLP
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference