zhanyon

Institute of Automation， Chinese Academy of AcienceBeijing

zhanyon's Stars

facebookresearch/BenchMARL
A collection of MARL benchmarks based on TorchRL
Language:Python26638
guosyjlu/DS-Agent
Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24
Language:Python11815
whoenig/libMultiRobotPlanning
Library with search algorithms for task and path planning for multi robot/agent systems
Language:C++814218
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
Language:Jupyter Notebook37320
BAAI-Agents/GPA-LM
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".
964
junyuyang7/ChatAgent_RAG
离线部署大模型，构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。
Language:Python222
Guozheng-Ma/DA-in-visualRL
Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
706
sugarandgugu/Text2Image-Retrieval
计算机视觉课程设计-基于Chinese-CLIP的图文检索系统
Language:Python432
BeatsLeo/ClipCap-Chinese
DIP & NLP期末大作业 — 课程设计
Language:Jupyter Notebook183
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python42278
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.6k1.1k
songwenas12/fjsp-drl
Language:Python20956
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
Language:Jupyter Notebook1.9k181
danijar/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
Language:Python510109
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python21823
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
91485
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.1k1k
luban-agi/Awesome-AIGC-Tutorials
Curated tutorials and resources for Large Language Models, AI Painting, and more.
3.8k260
Mq-b/Loser-HomeWork
卢瑟们的作业展示，答案讲解，以及一些C++知识
Language:C++644137
ShenDezhou/Open-Prompt-Research
Some thoughts on prompts for Large Language Models.
Language:Python9
PKUanonym/REKCARC-TSC-UHT
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
Language:HTML33.3k7.6k
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Language:Python924147
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7k481
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
Language:Jupyter Notebook2.9k360
LC1332/Chinese-alpaca-lora
骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
Language:Jupyter Notebook71085
Melelery/c-binance-futures-quant
low-cost, high-efficiency, easy-to-implement
Language:Python632347
feedarchive/libera-feedbot-live
Live posts of FeedBot on Libera.Chat
3811
Farama-Foundation/Miniworld
Simple and easily configurable 3D FPS-game-like environments for reinforcement learning
Language:Python700130
thomashirtz/gym-hybrid
Collection of OpenAI parametrized action-space environments.
Language:Python5810
StepNeverStop/RLs
Reinforcement Learning Algorithms Based on PyTorch
Language:Python44893

zhanyon

zhanyon's Stars

facebookresearch/BenchMARL

guosyjlu/DS-Agent

whoenig/libMultiRobotPlanning

Coobiw/MPP-LLaVA

BAAI-Agents/GPA-LM

junyuyang7/ChatAgent_RAG

Guozheng-Ma/DA-in-visualRL

sugarandgugu/Text2Image-Retrieval

BeatsLeo/ClipCap-Chinese

FLAIROx/JaxMARL

naklecha/llama3-from-scratch

songwenas12/fjsp-drl

breezedeus/Pix2Text

danijar/dreamer

flowersteam/Grounding_LLMs_with_online_RL

hanjuku-kaso/awesome-offline-rl

microsoft/DeepSpeedExamples

luban-agi/Awesome-AIGC-Tutorials

Mq-b/Loser-HomeWork

ShenDezhou/Open-Prompt-Research

PKUanonym/REKCARC-TSC-UHT

Replicable-MARL/MARLlib

cloneofsimo/lora

yuanzhoulvpi2017/zero_nlp

LC1332/Chinese-alpaca-lora

Melelery/c-binance-futures-quant

feedarchive/libera-feedbot-live

Farama-Foundation/Miniworld

thomashirtz/gym-hybrid

StepNeverStop/RLs