YifeiZhou02

personal homepage: https://yifeizhou02.github.io/

YifeiZhou02's Stars

gimme1dollar/b-moca
Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation)
Language:Python19
ServiceNow/BrowserGym
BrowserGym, a gym environment for web task automation in the Chromium browser.
Language:Python22626
YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Language:Python7311
snu-mllab/Achievement-Distillation
Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 2023)
Language:Python25
OSU-NLP-Group/TravelPlanner
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
Language:Python19024
princeton-nlp/WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Language:Python23347
princeton-nlp/intercode
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
Language:Python17931
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2k138
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.3k115
Jiayi-Pan/GPT-V-on-Web
👀🧠 GPT-4 Vision x 💪⌨️ Vimium = Autonomous Web Agent
Language:Python1617
microsoft/TextWorld
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
Language:Jupyter Notebook1.2k188
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook60062
mingkaid/rl-prompt
Accompanying repo for the RLPrompt paper
Language:Python28652
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Language:Python2.3k247
Sea-Snell/JAXSeq
Train very large language models in Jax.
Language:Python18817
Sea-Snell/Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
Language:Python19317
fengyuli-dev/distribution-normalization
Test-Time Distribution Normalization For Contrastively Learned Vision-language Models
Language:Python24
ikostrikov/rlpd
Language:Python18821
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
Language:Python42894
kairproject/kair_algorithms_draft
Reinforcement learning algorithms for robot control tasks
Language:Python2810
YifeiZhou02/generalized_paraphrase_identification
Research code for "GAPX: Generalized Autoregressive Paraphrase-identification X", NeurIPS 2022
Language:Python3
jungokasai/THumB
15
yudasong/HyQ
Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.
Language:Python213
LexiFi/csml
High-level bindings between .Net and OCaml
Language:OCaml645
SimplifyJobs/Summer2025-Internships
Collection of Summer 2025 tech internships!
32.2k2.6k
northwesternfintech/2025QuantInternships
Public quant internship repository, maintained by NUFT but available for everyone.
1.1k74
aviralkumar2907/CQL
Code for conservative Q-learning
Language:Python38269
YifeiZhou02/Improve-Discourse-Dependency-Parsing-with-Contextualized-Representations
Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022
Language:Python142
liaopeiyuan/artbench
Benchmarking Generative Models with Artworks
Language:Python2199
dangkhoasdc/awesome-ai-residency
List of AI Residency Programs
3k270

YifeiZhou02

YifeiZhou02's Stars

gimme1dollar/b-moca

ServiceNow/BrowserGym

YifeiZhou02/ArCHer

snu-mllab/Achievement-Distillation

OSU-NLP-Group/TravelPlanner

princeton-nlp/WebShop

princeton-nlp/intercode

THUDM/AgentBench

PKU-Alignment/safe-rlhf

Jiayi-Pan/GPT-V-on-Web

microsoft/TextWorld

ikostrikov/jaxrl

mingkaid/rl-prompt

young-geng/EasyLM

Sea-Snell/JAXSeq

Sea-Snell/Implicit-Language-Q-Learning

fengyuli-dev/distribution-normalization

ikostrikov/rlpd

ikostrikov/pytorch-trpo

kairproject/kair_algorithms_draft

YifeiZhou02/generalized_paraphrase_identification

jungokasai/THumB

yudasong/HyQ

LexiFi/csml

SimplifyJobs/Summer2025-Internships

northwesternfintech/2025QuantInternships

aviralkumar2907/CQL

YifeiZhou02/Improve-Discourse-Dependency-Parsing-with-Contextualized-Representations

liaopeiyuan/artbench

dangkhoasdc/awesome-ai-residency