Marroh

PhD student at Chinese Academic of Science, Intitute of Automation(CASIA). | CV, RL, MARL, and causel inference.

UCAS CASIABeijing, China

Marroh's Stars

chauncygu/Safe-Multi-Agent-Isaac-Gym
Safe Multi-Agent Isaac Gym benchmark for safe multi-agent reinforcement learning research.
Language:Python517
chauncygu/Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
Language:Python457
abizovnuralem/go2_omniverse
Unitree Go2, Unitree G1 support for Nvidia Isaac Lab (Isaac Gym / Isaac Sim)
Language:Python27524
RayYoh/Awesome-Robot-Learning
This repo contains a curative list of robot learning (mainly for manipulation) resources.
1385
denisgriaznov/CustomMuJoCoEnviromentForRL
This is a very simple example of creating and training your own MuJoCo environment using RL algorithms through the Gymnasium.
Language:Python297
Farama-Foundation/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
Language:Python52283
PaulDanielML/MuJoCo_RL_UR5
A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.
Language:Python41354
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML29316
yuqingd/ellm
Language:Python578
PegasusSimulator/PegasusSimulator
A framework built on top of NVIDIA Isaac Sim for simulating drones with PX4 support and much more
Language:Python27549
polixir/OfflineRL
A collection of offline reinforcement learning algorithms.
Language:Python15220
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.4k225
btx0424/OmniDrones
Language:Python14125
abdulhaim/LMRL-Gym
Language:Python659
tencent-ailab/hok_env
Honor of Kings AI Open Environment of Tencent
Language:Python61672
itscassie/NLP_tools
compute bleu and rouge scores
Language:Python8
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.6k242
tonyzhaozh/aloha
Language:Python1.4k245
meta-llama/llama
Inference code for Llama models
Language:Python55.5k9.5k
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Language:Jupyter Notebook2.8k251
rlopt/l2i
Language:Python10325
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
2.8k233
VictorYXL/ReplenishmentEnv
Language:Python329
liuzuxin/OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
Language:Python16112
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1.3k278
microsoft/PromptCraft-Robotics
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
Language:Python1.8k195
TeaPearce/Conditional_Diffusion_MNIST
Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.
Language:Python61067
anuragajay/decision-diffuser
Language:Python27939
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Language:HTML111k15.1k
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
Language:Python61438

Marroh

Marroh's Stars

chauncygu/Safe-Multi-Agent-Isaac-Gym

chauncygu/Safe-Multi-Agent-Mujoco

abizovnuralem/go2_omniverse

RayYoh/Awesome-Robot-Learning

denisgriaznov/CustomMuJoCoEnviromentForRL

Farama-Foundation/Gymnasium-Robotics

PaulDanielML/MuJoCo_RL_UR5

YingqingHe/Awesome-LLMs-meet-Multimodal-Generation

yuqingd/ellm

PegasusSimulator/PegasusSimulator

polixir/OfflineRL

tatsu-lab/alpaca_eval

btx0424/OmniDrones

abdulhaim/LMRL-Gym

tencent-ailab/hok_env

itscassie/NLP_tools

PhoebusSi/Alpaca-CoT

tonyzhaozh/aloha

meta-llama/llama

eureka-research/Eureka

rlopt/l2i

ahmetbersoz/chatgpt-prompts-for-academic-writing

VictorYXL/ReplenishmentEnv

liuzuxin/OSRL

Farama-Foundation/D4RL

microsoft/PromptCraft-Robotics

TeaPearce/Conditional_Diffusion_MNIST

anuragajay/decision-diffuser

f/awesome-chatgpt-prompts

BlackSamorez/tensor_parallel