sytmxl's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
quilljs/quill
Quill is a modern WYSIWYG editor built for compatibility and extensibility.
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
meta-llama/llama3
The official Meta Llama 3 GitHub site
karpathy/llm.c
LLM training in simple, raw C/CUDA
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
jackhawks/rectg
我们从5000多个Telegram群组、频道和机器人中精心挑选了最优质的资源。本项目中的所有内容均来自互联网,仅用于学习和技术研究目的。
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
developersdigest/llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
mshumer/gpt-llm-trainer
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
BingyanStudio/LapisCV
📃 开箱即用的 Obsidian / Typora 简历
langchain-ai/rag-from-scratch
MLGroupJLU/LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
McGill-NLP/webllama
Llama-3 agents that can browse the web by following instructions and talking to you
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
mistralai-sf24/hackathon
Technion-Kishony-lab/data-to-paper
data-to-paper: Backward-traceable AI-driven scientific research
seanzhang-zhichen/llama3-chinese
Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
Academic-Hammer/HammerLLM
1.4B sLLM for Chinese and English - HammerLLM🔨