ShupengHu's Stars
facebookresearch/cc_net
Tools to download and cleanup Common Crawl data
ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
trevorhobenshield/twitter-api-client
Implementation of X/Twitter v1, v2, and GraphQL APIs
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Stability-AI/StableLM
StableLM: Stability AI Language Models
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
NJUNLP/GTS
Code and data for paper "Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction". Aspect opinion pair datasets and aspect triplet datasets.
hoangdzung/GTS-ASOTE
GTS-ASOTE
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
renmada/t5-pegasus-pytorch
ZhuiyiTechnology/t5-pegasus
中文生成式预训练模型
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
irina1nik/context_data
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
vdumoulin/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
openslide/openslide-python
Python bindings to OpenSlide
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
monologg/R-BERT
Pytorch implementation of R-BERT: "Enriching Pre-trained Language Model with Entity Information for Relation Classification"
mattzheng/ChineseWiki
维基百科中文语料整理
limccn/cacl2
Lexicon for Chinese lexical analyzing, 中文语言分词词库
robert-bor/aho-corasick
Java implementation of the Aho-Corasick algorithm for efficient string matching
InsaneLife/NLPDataAugmentation
Chinese NLP Data Augmentation, BERT Contextual Augmentation
abusix/ahocorapy
Pure python Aho-Corasick library.