Shengqi-Kong's Stars
MorvanZhou/pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
Shengqi-Kong/PRIMAL_1
This is PRIMAL_1 repo
gsartoretti/PRIMAL
PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
123penny123/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
state-spaces/mamba
Mamba SSM architecture
alxndrTL/mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Starlight0798/gymRL
基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
LLMBook-zh/LLMBook-zh.github.io
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
ygjin11/r2-play
The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
google-deepmind/pysc2
StarCraft II Learning Environment
chaoyu1999/DRUNet
基于全局和局部残差图像预测的红外目标检测
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
OpenBMB/ProAgent
An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
yoheinakajima/babyagi
AGI-Edgerunners/Plan-and-Solve-Prompting
Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".
lafmdp/Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
ShusenTang/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
cqunlp/research_resources
Resources of CQU CS 1701 research, include NLP, Knowledge Graph,Cloud Computing, etc.
AccumulateMore/CV
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Gor-Ren/gym-jsbsim
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model
sleyoar/JobHelper
Design and implementation of job recruitment information website-job Gang network based on SSM 基于ssm的招聘信息网站-职帮网的设计与实现
konglingwen94/vue-bytedanceJob
Vue仿写字节跳动招聘网站的单页面应用,仅作为学习使用。
oncestep/IndexRecruit
Spring Boot + Mybatis开发实习生招聘网站