ElcarimQAQ's Stars
ElcarimQAQ/ClothPPO
Code for ClothPPO (IJCAI 2024)
vpx-ecnu/FIND
Official code for 《FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models》 MM2024
vpx-ecnu/FIND-website
website for FIND MM2024
Yifan-Song793/ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
zhengjingwei/machine-learning-interview
算法工程师-机器学习面试题总结
modriczhang/DRL-Rec
Doragd/Algorithm-Practice-in-Industry
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
google-deepmind/open_x_embodiment
agilexrobotics/mobile_aloha_sim
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
alfworld/alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
noahshinn/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
SJTU-DMTai/MASTER
This is the official code and supplementary materials for our AAAI-2024 paper: MASTER: Market-Guided Stock Transformer for Stock Price Forecasting. MASTER is a stock transformer for stock price forecasting, which models the momentary and cross-time stock correlation and guide feature selection with market information.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
zhangchuheng123/RL4Execution
TradeMaster-NTU/TradeMaster
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
microsoft/qlib
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.
google-deepmind/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
google-deepmind/language_to_reward_2023
ok-robot/ok-robot
An open, modular framework for zero-shot, language conditioned pick-and-drop tasks in arbitrary homes.
stepjam/RLBench
A large-scale benchmark and learning environment.
Dobot-Arm/DobotLink
DobotLink
BenedictHomuth/iot4Dobot
Digital Twin Project of a Dobot M1
RishiHazra/saycanpay
Official code release of AAAI 2024 paper SayCanPay.
xiaoxiaoxh/UniFolding
[CoRL 2023] UniFolding: Towards Sample-efficient, Scalable, and Generalizable Robotic Garment Folding.
huangwl18/VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models