fzpshuaia

I am Zipeng Fang, a student in Shanghai Jiaotong University. I want to study in Github and try to make some contributions to the community.

shanghaijiaotong universityshanghaijiaotong university

fzpshuaia's Stars

openai/openai-python
The official Python library for the OpenAI API
Language:Python24k 311 8443.4k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.6k 77 1.3k1.4k
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Language:Jupyter Notebook2.9k 25 39257
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python2.3k 24 59191
araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Language:Jupyter Notebook628 12 13119
ryanbgriffiths/ICRA2024PaperList
ICRA2024 Paper List
479 4 531
WindyLab/LLM-RL-Papers
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
254 3 013
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python233 8 2325
flowersteam/lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
Language:Python208 4 2418
leggedrobotics/wild_visual_navigation
Wild Visual Navigation: A system for fast traversability learning via pre-trained models and online self-supervision
Language:Python146 9 17514
xlang-ai/text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
Language:Jupyter Notebook139 7 38
namoshizun/PyPOMDP
Python implementation of POMDP framework and PBVI & POMCP algorithms.
Language:Python106 3 626
UT-Austin-RPL/amago
a simple and scalable agent for training adaptive policies with sequence-based RL
Language:Python105 2 115
nke001/causal_learning_unknown_interventions
Code for "Neural causal learning from unknown interventions"
Language:C100 7 218
NithishkumarS/DWA-RL
Novel reinforcement learning based local planner that accounts for the dynamic constraints of the robot to enable smooth robot trajectories. Reward shaping is done to enable a spatially aware navigation.
Language:Python69 3 611
ZJLAB-AMMI/LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Language:Python66 2 413
Yuchen413/AnomalyRuler
Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"
Language:Python48 2 83
Traffic-Alpha/iLLM-TSC
This repository contains the code for the paper“iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement”
Language:Python47 3 119
jhejna/few-shot-preference-rl
Language:Python32 2 46
hmz-15/Interactive-Predicate-Learning
InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning (RSS 2024)
Language:Python27 2 01
jingGM/DTG
Language:Python27 3 22
yassinekebbati/Optimized_adaptive_MPC
Optimized adaptive MPC for lateral control for autonomous vehicles
Language:MATLAB26 1 03
thu-rllab/LESR
LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)
Language:Python24 5 01
GAMMA-UMD-Outdoor-Navigation/BehAV
BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes
Language:Python171
jingGM/adaptiveON
Language:Python17 1 10
HareshKarnan/sterling_corl23
Language:Python11 3 11
holken/polite
code for polite
Language:Python11 2 00
agentification/Language-Integrated-VI
Language:Python10 1 10
jingGM/GND
Language:C++10 1 10
1umos/Msc-Project-Reinforcement-Learning
TAMER: Training an Agent Manually via Evaluative Reinforcement
Language:Jupyter Notebook3 1 00

fzpshuaia

fzpshuaia's Stars

openai/openai-python

huggingface/trl

eureka-research/Eureka

allenai/RL4LMs

araffin/rl-tutorial-jnrr19

ryanbgriffiths/ICRA2024PaperList

WindyLab/LLM-RL-Papers

flowersteam/Grounding_LLMs_with_online_RL

flowersteam/lamorel

leggedrobotics/wild_visual_navigation

xlang-ai/text2reward

namoshizun/PyPOMDP

UT-Austin-RPL/amago

nke001/causal_learning_unknown_interventions

NithishkumarS/DWA-RL

ZJLAB-AMMI/LLM4RL

Yuchen413/AnomalyRuler

Traffic-Alpha/iLLM-TSC

jhejna/few-shot-preference-rl

hmz-15/Interactive-Predicate-Learning

jingGM/DTG

yassinekebbati/Optimized_adaptive_MPC

thu-rllab/LESR

GAMMA-UMD-Outdoor-Navigation/BehAV

jingGM/adaptiveON

HareshKarnan/sterling_corl23

holken/polite

agentification/Language-Integrated-VI

jingGM/GND

1umos/Msc-Project-Reinforcement-Learning