ZihanWang314
PhD student at Northwestern University. Previously @deepseek-ai @uiucnlp & Renmin University
Pinned Repositories
ESFT
Expert Specialized Fine-Tuning
mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
ESFT
Expert Specialized Fine-Tuning
lab-website-template
min-p-physics
NOVO
RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
ViT-for-medical-image
project for Berkeley CS182/282A.
ZihanWang314's Repositories
ZihanWang314/RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
ZihanWang314/CoE
Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models
ZihanWang314/coeCheck
ZihanWang314/NOVO
ZihanWang314/SETUP
ZihanWang314/min-p-physics
ZihanWang314/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
ZihanWang314/code-repo-instructions
ZihanWang314/ESFT
Expert Specialized Fine-Tuning
ZihanWang314/lab-website-template
ZihanWang314/ViT-for-medical-image
project for Berkeley CS182/282A.
ZihanWang314/ZihanWang314.github.io
ZihanWang314/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
ZihanWang314/AI-wrench
A toolkit of simple, powerful tools to boost productivity in AI development
ZihanWang314/comments
ZihanWang314/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
ZihanWang314/homework_fall2022
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)
ZihanWang314/LLaVA-NeXT
ZihanWang314/RUC-recruitment
ZihanWang314/trl
Train transformer language models with reinforcement learning.
ZihanWang314/dump-to-gpt
a super simple tool to share your entire codebase with GPT models in just one line of code
ZihanWang314/RAGENv2-Dev
We present a development version of a refactored second-generation codebase of RAGEN.
ZihanWang314/verl
verl: Volcano Engine Reinforcement Learning for LLMs
ZihanWang314/VideoAgent