ElcarimQAQ

student of East China Normal University

East China Normal UniversityShanghai

ElcarimQAQ's Stars

ElcarimQAQ/ClothPPO
Code for ClothPPO (IJCAI 2024)
Language:Python2
vpx-ecnu/FIND
Official code for 《FIND: Fine-tuning Initial Noise Distribution with Policy Optimization for Diffusion Models》 MM2024
Language:Python10
vpx-ecnu/FIND-website
website for FIND MM2024
Language:JavaScript3
Yifan-Song793/ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
Language:Python919
zhengjingwei/machine-learning-interview
算法工程师-机器学习面试题总结
1.3k188
modriczhang/DRL-Rec
Language:Python62
Doragd/Algorithm-Practice-in-Industry
搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）
Language:Python2.3k292
google-deepmind/open_x_embodiment
Language:Jupyter Notebook81056
agilexrobotics/mobile_aloha_sim
Language:C++4710
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
Language:Python17318
alfworld/alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Language:Python34754
noahshinn/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Language:Python2.4k226
SJTU-DMTai/MASTER
This is the official code and supplementary materials for our AAAI-2024 paper: MASTER: Market-Guided Stock Transformer for Stock Price Forecasting. MASTER is a stock transformer for stock price forecasting, which models the momentary and cross-time stock correlation and guide feature selection with market information.
Language:Python15132
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python28.4k4.2k
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Language:Jupyter Notebook13.8k1.9k
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.1k172
zhangchuheng123/RL4Execution
Language:Python82
TradeMaster-NTU/TradeMaster
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:
Language:Jupyter Notebook1.4k279
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.5k472
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python2.7k169
microsoft/qlib
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.
Language:Python15.4k2.6k
google-deepmind/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Language:C++990148
google-deepmind/language_to_reward_2023
Language:Python10615
ok-robot/ok-robot
An open, modular framework for zero-shot, language conditioned pick-and-drop tasks in arbitrary homes.
Language:Python44031
stepjam/RLBench
A large-scale benchmark and learning environment.
Language:Python1.1k230
Dobot-Arm/DobotLink
DobotLink
Language:C++4
BenedictHomuth/iot4Dobot
Digital Twin Project of a Dobot M1
Language:Go1
RishiHazra/saycanpay
Official code release of AAAI 2024 paper SayCanPay.
Language:Python344
xiaoxiaoxh/UniFolding
[CoRL 2023] UniFolding: Towards Sample-efficient, Scalable, and Generalizable Robotic Garment Folding.
Language:C#24
huangwl18/VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Language:Python55076

ElcarimQAQ

ElcarimQAQ's Stars

ElcarimQAQ/ClothPPO

vpx-ecnu/FIND

vpx-ecnu/FIND-website

Yifan-Song793/ETO

zhengjingwei/machine-learning-interview

modriczhang/DRL-Rec

Doragd/Algorithm-Practice-in-Industry

google-deepmind/open_x_embodiment

agilexrobotics/mobile_aloha_sim

1989Ryan/llm-mcts

alfworld/alfworld

noahshinn/reflexion

SJTU-DMTai/MASTER

vllm-project/vllm

AI4Finance-Foundation/FinGPT

eric-mitchell/direct-preference-optimization

zhangchuheng123/RL4Execution

TradeMaster-NTU/TradeMaster

CarperAI/trlx

modelscope/data-juicer

microsoft/qlib

google-deepmind/mujoco_mpc

google-deepmind/language_to_reward_2023

ok-robot/ok-robot

stepjam/RLBench

Dobot-Arm/DobotLink

BenedictHomuth/iot4Dobot

RishiHazra/saycanpay

xiaoxiaoxh/UniFolding

huangwl18/VoxPoser