ZihanWang314

PhD student at Northwestern University. Previously @deepseek-ai @uiucnlp & Renmin University

Pinned Repositories

ESFT
Expert Specialized Fine-Tuning
Language:Python589 15 11244
mint-bench
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and Heng Ji.
Language:Python117 4 47
AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python0 0 00
awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
10
ESFT
Expert Specialized Fine-Tuning
Language:Python1 0 00
lab-website-template
Language:CSS11
min-p-physics
Language:TeX31
NOVO
5 1 11
RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Language:Python1.2k 19 2380
ViT-for-medical-image
project for Berkeley CS182/282A.
Language:Jupyter Notebook1 1 02

ZihanWang314's Repositories

ZihanWang314/RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Language:Python1.2k 19 2380
ZihanWang314/CoE
Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models
Language:Python13515
ZihanWang314/coeCheck
161
ZihanWang314/NOVO
5 1 11
ZihanWang314/SETUP
Language:Shell4
ZihanWang314/min-p-physics
Language:TeX31
ZihanWang314/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
10
ZihanWang314/code-repo-instructions
1
ZihanWang314/ESFT
Expert Specialized Fine-Tuning
Language:Python1 0 00
ZihanWang314/lab-website-template
Language:CSS11
ZihanWang314/ViT-for-medical-image
project for Berkeley CS182/282A.
Language:Jupyter Notebook1 1 02
ZihanWang314/ZihanWang314.github.io
Language:HTML1 1 02
ZihanWang314/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python0 0 00
ZihanWang314/AI-wrench
A toolkit of simple, powerful tools to boost productivity in AI development
01
ZihanWang314/comments
0 0 01
ZihanWang314/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
Language:Python0 0 01
ZihanWang314/homework_fall2022
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2022)
Language:Jupyter Notebook0 0 01
ZihanWang314/LLaVA-NeXT
Language:Python0 0 00
ZihanWang314/RUC-recruitment
Language:Jupyter Notebook01
ZihanWang314/trl
Train transformer language models with reinforcement learning.
Language:Python0 0 01
ZihanWang314/dump-to-gpt
a super simple tool to share your entire codebase with GPT models in just one line of code
Language:Python1
ZihanWang314/RAGENv2-Dev
We present a development version of a refactored second-generation codebase of RAGEN.
ZihanWang314/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Language:Python
ZihanWang314/VideoAgent
Language:Python1

ZihanWang314

Pinned Repositories

ESFT

mint-bench

AgentGym

awesome-llm-powered-agent

ESFT

lab-website-template

min-p-physics

NOVO

RAGEN

ViT-for-medical-image

ZihanWang314's Repositories

ZihanWang314/RAGEN

ZihanWang314/CoE

ZihanWang314/coeCheck

ZihanWang314/NOVO

ZihanWang314/SETUP

ZihanWang314/min-p-physics

ZihanWang314/awesome-llm-powered-agent

ZihanWang314/code-repo-instructions

ZihanWang314/ESFT

ZihanWang314/lab-website-template

ZihanWang314/ViT-for-medical-image

ZihanWang314/ZihanWang314.github.io

ZihanWang314/AgentGym

ZihanWang314/AI-wrench

ZihanWang314/comments

ZihanWang314/CSrankings

ZihanWang314/homework_fall2022

ZihanWang314/LLaVA-NeXT

ZihanWang314/RUC-recruitment

ZihanWang314/trl

ZihanWang314/dump-to-gpt

ZihanWang314/RAGENv2-Dev

ZihanWang314/verl

ZihanWang314/VideoAgent