Pinned Repositories
3D-ptychography
alignment-handbook
Robust recipes to align language models with human and AI preferences
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
alpaca_farm
A Simulation Framework for RLHF and alternatives.
AnimateDiff
Official implementation of AnimateDiff.
animatediff-cli-prompt-travel
animatediff prompt travel
API-Pack
Dr.LLaMA
Noise-resilience-deep-reconstruction-for-X-ray-Tomography
Physics-assisted-Generative-Adversarial-Network-for-X-Ray-Tomography
zguo0525's Repositories
zguo0525/API-Pack
zguo0525/alignment-handbook
Robust recipes to align language models with human and AI preferences
zguo0525/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
zguo0525/animatediff-cli-prompt-travel
animatediff prompt travel
zguo0525/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
zguo0525/chatgpt_system_prompt
store all agent's system prompt
zguo0525/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
zguo0525/AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
zguo0525/bagel
A bagel, with everything.
zguo0525/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
zguo0525/dspy
DSPy: The framework for programming—not prompting—foundation models
zguo0525/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
zguo0525/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
zguo0525/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
zguo0525/Fooocus
Focus on prompting and generating
zguo0525/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
zguo0525/LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
zguo0525/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
zguo0525/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
zguo0525/mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
zguo0525/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
zguo0525/nanotron
Minimalistic large language model 3D-parallelism training
zguo0525/Online-RLHF
zguo0525/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
zguo0525/open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
zguo0525/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
zguo0525/RLHF-Reward-Modeling
A recipe to train reward models for RLHF.
zguo0525/trl
Train transformer language models with reinforcement learning.
zguo0525/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
zguo0525/zguo0525.github.io