Marroh
PhD student at Chinese Academic of Science, Intitute of Automation(CASIA). | CV, RL, MARL, and causel inference.
UCAS CASIABeijing, China
Marroh's Stars
chauncygu/Safe-Multi-Agent-Isaac-Gym
Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.
chauncygu/Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
abizovnuralem/go2_omniverse
Unitree Go2, Unitree G1 support for Nvidia Isaac Lab (Isaac Gym / Isaac Sim)
RayYoh/Awesome-Robot-Learning
This repo contains a curative list of robot learning (mainly for manipulation) resources.
denisgriaznov/CustomMuJoCoEnviromentForRL
This is a very simple example of creating and training your own MuJoCo environment using RL algorithms through the Gymnasium.
Farama-Foundation/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
PaulDanielML/MuJoCo_RL_UR5
A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
yuqingd/ellm
PegasusSimulator/PegasusSimulator
A framework built on top of NVIDIA Isaac Sim for simulating drones with PX4 support and much more
polixir/OfflineRL
A collection of offline reinforcement learning algorithms.
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
btx0424/OmniDrones
abdulhaim/LMRL-Gym
tencent-ailab/hok_env
Honor of Kings AI Open Environment of Tencent
itscassie/NLP_tools
compute bleu and rouge scores
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
tonyzhaozh/aloha
meta-llama/llama
Inference code for Llama models
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
rlopt/l2i
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
VictorYXL/ReplenishmentEnv
liuzuxin/OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
microsoft/PromptCraft-Robotics
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
TeaPearce/Conditional_Diffusion_MNIST
Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.
anuragajay/decision-diffuser
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference