Pashisfisuta's Stars
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
state-spaces/mamba
Mamba SSM architecture
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
wgwang/awesome-LLMs-In-China
**大模型
649453932/Chinese-Text-Classification-Pytorch
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
kaieye/2022-Machine-Learning-Specialization
dalinvip/Awesome-ChatGPT
ChatGPT资料汇总学习,持续更新......
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
X-D-Lab/LangChain-ChatGLM-Webui
基于LangChain和ChatGLM-6B等系列LLM的针对本地知识库的自动问答
openai/weak-to-strong
zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
microsoft/DeBERTa
The implementation of DeBERTa
bloc97/CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
jackaduma/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
nomic-ai/contrastors
Train Models Contrastively in Pytorch
HazyResearch/m2
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
EvilPsyCHo/Play-with-LLMs
Tutorial on training, evaluating LLM, as well as utilizing RAG, Agent, Chain to build entertaining applications with LLMs.分享如何训练、评估LLMs,如何基于RAG、Agent、Chain构建有趣的LLMs应用。
PacktPublishing/Mastering-Transformers
Mastering Transformers, published by Packt
fanqiwan/FuseLLM
FuseLLM & FuseChat Project
ghsama/ConvTransformerTimeSeries
Convolutional Transformer for time series
CVxTz/time_series_forecasting
DAMO-DI-ML/AI-for-Time-Series-Papers-Tutorials-Surveys
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.