Shengqi-Kong

Shengqi-Kong's Stars

MorvanZhou/pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
Language:Python608142
Shengqi-Kong/PRIMAL_1
This is PRIMAL_1 repo
Language:Python1
gsartoretti/PRIMAL
PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
Language:Python29778
123penny123/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
31117
state-spaces/mamba
Mamba SSM architecture
Language:Python12.6k1.1k
alxndrTL/mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Language:Python91584
Starlight0798/gymRL
基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)
Language:Python578
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
Language:Jupyter Notebook2.3k519
LLMBook-zh/LLMBook-zh.github.io
《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣
2.2k147
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
Language:Python29836
ygjin11/r2-play
The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".
Language:Python321
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
Language:Python19212
google-deepmind/pysc2
StarCraft II Learning Environment
Language:Python8k1.2k
chaoyu1999/DRUNet
基于全局和局部残差图像预测的红外目标检测
Language:Python285
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
Language:JavaScript5.5k508
OpenBMB/ProAgent
An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
Language:Python75680
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
Language:Python8k825
yoheinakajima/babyagi
Language:Python19.9k2.6k
AGI-Edgerunners/Plan-and-Solve-Prompting
Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".
Language:Python57851
lafmdp/Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
52548
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Language:Jupyter Notebook1.9k189
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
6.3k379
ShusenTang/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Language:Jupyter Notebook18.2k5.4k
cqunlp/research_resources
Resources of CQU CS 1701 research, include NLP, Knowledge Graph,Cloud Computing, etc.
13435
AccumulateMore/CV
✔（已完结）最全面的深度学习笔记【土堆 Pytorch】【李沐动手学深度学习】【吴恩达深度学习】
Language:Jupyter Notebook5.5k742
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.3k1.9k
Gor-Ren/gym-jsbsim
A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model
Language:Python17085
sleyoar/JobHelper
Design and implementation of job recruitment information website-job Gang network based on SSM 基于ssm的招聘信息网站-职帮网的设计与实现
Language:JavaScript132
konglingwen94/vue-bytedanceJob
Vue仿写字节跳动招聘网站的单页面应用，仅作为学习使用。
Language:Vue14729
oncestep/IndexRecruit
Spring Boot + Mybatis开发实习生招聘网站
Language:Java27585