mathCrazyy's Stars
deepglint/V-SWIFT
V-SWIFT: Training a Small VideoMAE Model on a Single Machine in a Day
predibase/llm_distillation_playbook
Best practices for distilling large language models.
mst272/LLM-Dojo
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
modelscope/data-juicer
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
liucongg/ChatGLM-Finetuning
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
mtchetch/wiztree-csv-generator
A Linux shell script for generating Wiztree compatible CSV listings
pytorch/text
Models, data loaders and abstractions for language processing, powered by PyTorch
finlay-liu/tianchi-multi-task-nlp
NLP中文预训练模型泛化能力挑战赛
enze5088/WenTianSearch
A baseline for WenTianSearch
mathCrazyy/pretrained-models
Open Language Pre-trained Model Zoo
ZhuiyiTechnology/pretrained-models
Open Language Pre-trained Model Zoo
MachineLP/TextMatch
QAmatch(qa_match)/文本匹配/文本分类/文本embedding/文本聚类/文本检索(bow/ifidf/ngramtf-df/bert/albert/bm25/…/nn/gbdt/xgb/kmeans/dscan/faiss/….)
wellinxu/EasyTransfer
EasyTransfer is designed to make the development of transfer learning in NLP applications easier.
sgrvinod/Deep-Tutorials-for-PyTorch
In-depth tutorials for implementing deep learning models on your own with PyTorch.
ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
xiaoqi6666/NYCSDE
公众号【码农田小齐】的分类合集
keep-steady/NER_pytorch
Named Entity Recognition on CoNLL dataset using BiLSTM+CRF implemented with Pytorch
achernodub/targer
BiLSTM-CNN-CRF tagger
CarryChang/Customer_Satisfaction_Analysis
基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。
xuhaiming1996/BERT-T2T
这是一个用于解决生成在生成任务中(翻译,复述等等),多样性不足问题的模型。
AnubhavGupta3377/Text-Classification-Models-Pytorch
Implementation of State-of-the-art Text Classification Models in Pytorch
pbloem/former
Simple transformer implementation from scratch in pytorch.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
mttk/rnn-classifier
Minimal RNN classifier with self-attention in Pytorch
cic1love/ChangeNumToEng
Change Number to English
TerryX-Lee/AlfredWorkflow_CnyTransform
alfred workflow,货币数字、中文大写和英文相互转换
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
ZhixiuYe/NER-pytorch
LSTM+CRF NER
zylo117/Yet-Another-EfficientDet-Pytorch
The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.