zguo0525

Pinned Repositories

3D-ptychography
Language:Python0 1 00
alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python0 0 00
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook0 0 00
alpaca_farm
A Simulation Framework for RLHF and alternatives.
Language:Python0 0 00
AnimateDiff
Official implementation of AnimateDiff.
Language:Python0 0 00
animatediff-cli-prompt-travel
animatediff prompt travel
Language:Python0 0 00
API-Pack
Language:Jupyter Notebook110
Dr.LLaMA
Language:Jupyter Notebook57 1 23
Noise-resilience-deep-reconstruction-for-X-ray-Tomography
Language:Jupyter Notebook2 1 00
Physics-assisted-Generative-Adversarial-Network-for-X-Ray-Tomography
Language:Python1 2 01

zguo0525's Repositories

zguo0525/API-Pack
Language:Jupyter Notebook110
zguo0525/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python0 0 00
zguo0525/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook0 0 00
zguo0525/animatediff-cli-prompt-travel
animatediff prompt travel
Language:Python0 0 00
zguo0525/annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Jupyter Notebook0 0 00
zguo0525/chatgpt_system_prompt
store all agent's system prompt
Language:C0 0 00
zguo0525/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python0 0 00
zguo0525/AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
zguo0525/bagel
A bagel, with everything.
zguo0525/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python
zguo0525/dspy
DSPy: The framework for programming—not prompting—foundation models
zguo0525/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
zguo0525/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Language:Python0 0
zguo0525/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
Language:Python0 0
zguo0525/Fooocus
Focus on prompting and generating
Language:Python0 0
zguo0525/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
zguo0525/LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
Language:Python0 0
zguo0525/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
Language:Python0 0
zguo0525/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
zguo0525/mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
zguo0525/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
zguo0525/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python
zguo0525/Online-RLHF
zguo0525/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
zguo0525/open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
zguo0525/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)
zguo0525/RLHF-Reward-Modeling
A recipe to train reward models for RLHF.
zguo0525/trl
Train transformer language models with reinforcement learning.
zguo0525/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0
zguo0525/zguo0525.github.io
Language:SCSS