Sshuoshuo's Stars
LearnPrompt/LLMs-cookbook
Examples and guides for using the LLMs
zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
okankop/vidaug
Effective Video Augmentation Techniques for Training Convolutional Neural Networks
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Timothyxxx/Chain-of-ThoughtsPapers
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
shayanalibhatti/Video-Augmentation-Code
In this repository, a simple implementation of Video augmentation is provided to augment videos for machine learning training tasks.
jssprz/video_captioning_datasets
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
yanqiangmiffy/GoGPT
GoGPT:基于Llama/Llama 2训练的中英文增强大模型|Chinese-Llama2
phellonchen/X-LLM
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
yanqiangmiffy/how-to-train-tokenizer
怎么训练一个LLM分词器
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
microsoft/torchscale
Foundation Architecture for (M)LLMs
chen1310054465/SBN
Code for ‘A Span-level Bidirectional Network for Aspect Sentiment Triplet Extraction’
Avmb/marian-mBART
Training harness to pretrain a mBART model using Marian
chiayewken/Span-ASTE
Code Implementation of "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".
qftie/MiduCTC-competition
蜜度&paddle 文本智能较对大赛第17名方案
xv44586/ccf_2020_qa_match
ccf 2020 qa match competition top1
CoinCheung/pytorch-loss
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
AI-confused/CGEC-with-Pointer-Generator-Network-Bart
基于Bart语言模型的指针生成网络,用于中文语法纠错任务
nilboy/rlt2t
Text to text with reinforcement learning
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
AI-confused/Sequence-to-Action
Grammar correct project based Tencent's paper(Sequence to Action)
LZ-CH/GAIIC2023
GAIIC赛道一:影像学 NLP — 医学影像诊断报告生成 [A100换你大棚甜瓜 Rank-12 方案]