zhanyon's Stars
facebookresearch/BenchMARL
A collection of MARL benchmarks based on TorchRL
guosyjlu/DS-Agent
Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24
whoenig/libMultiRobotPlanning
Library with search algorithms for task and path planning for multi robot/agent systems
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
BAAI-Agents/GPA-LM
This repo is a live list of papers on game playing and large multimodality model - "A Survey on Game Playing Agents and Large Models: Methods, Applications, and Challenges".
junyuyang7/ChatAgent_RAG
离线部署大模型,构建一个可以上传本地知识库进行RAG问答且可以自行调用工具的Agent。
Guozheng-Ma/DA-in-visualRL
Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).
sugarandgugu/Text2Image-Retrieval
计算机视觉课程设计-基于Chinese-CLIP的图文检索系统
BeatsLeo/ClipCap-Chinese
DIP & NLP期末大作业 — 课程设计
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
songwenas12/fjsp-drl
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
danijar/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
microsoft/DeepSpeedExamples
Example models using DeepSpeed
luban-agi/Awesome-AIGC-Tutorials
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Mq-b/Loser-HomeWork
卢瑟们的作业展示,答案讲解,以及一些C++知识
ShenDezhou/Open-Prompt-Research
Some thoughts on prompts for Large Language Models.
PKUanonym/REKCARC-TSC-UHT
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
LC1332/Chinese-alpaca-lora
骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
Melelery/c-binance-futures-quant
low-cost, high-efficiency, easy-to-implement
feedarchive/libera-feedbot-live
Live posts of FeedBot on Libera.Chat
Farama-Foundation/Miniworld
Simple and easily configurable 3D FPS-game-like environments for reinforcement learning
thomashirtz/gym-hybrid
Collection of OpenAI parametrized action-space environments.
StepNeverStop/RLs
Reinforcement Learning Algorithms Based on PyTorch