fzpshuaia
I am Zipeng Fang, a student in Shanghai Jiaotong University. I want to study in Github and try to make some contributions to the community.
shanghaijiaotong universityshanghaijiaotong university
fzpshuaia's Stars
openai/openai-python
The official Python library for the OpenAI API
huggingface/trl
Train transformer language models with reinforcement learning.
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
ryanbgriffiths/ICRA2024PaperList
ICRA2024 Paper List
WindyLab/LLM-RL-Papers
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
flowersteam/lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
leggedrobotics/wild_visual_navigation
Wild Visual Navigation: A system for fast traversability learning via pre-trained models and online self-supervision
xlang-ai/text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
namoshizun/PyPOMDP
Python implementation of POMDP framework and PBVI & POMCP algorithms.
UT-Austin-RPL/amago
a simple and scalable agent for training adaptive policies with sequence-based RL
nke001/causal_learning_unknown_interventions
Code for "Neural causal learning from unknown interventions"
NithishkumarS/DWA-RL
Novel reinforcement learning based local planner that accounts for the dynamic constraints of the robot to enable smooth robot trajectories. Reward shaping is done to enable a spatially aware navigation.
ZJLAB-AMMI/LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Yuchen413/AnomalyRuler
Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"
Traffic-Alpha/iLLM-TSC
This repository contains the code for the paper“iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement”
jhejna/few-shot-preference-rl
hmz-15/Interactive-Predicate-Learning
InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning (RSS 2024)
jingGM/DTG
yassinekebbati/Optimized_adaptive_MPC
Optimized adaptive MPC for lateral control for autonomous vehicles
thu-rllab/LESR
LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)
GAMMA-UMD-Outdoor-Navigation/BehAV
BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes
jingGM/adaptiveON
HareshKarnan/sterling_corl23
holken/polite
code for polite
agentification/Language-Integrated-VI
jingGM/GND
1umos/Msc-Project-Reinforcement-Learning
TAMER: Training an Agent Manually via Evaluative Reinforcement