Pinned Repositories
ChatAgent
A Python-based agent framework for large language models.
DeepFakeFace
DeepFake Face Datasets. Code accompanying the paper "Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models".
DGPO
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
OpenPlugin
Toolkit to manage the plugins of the large language model
openrl
Unified Reinforcement Learning Framework
PyTorch_Tutorial
PyTorch使用技巧和教程
Ray_Tutorial
Tutorial for Ray
RL_Tutorial
Reinforcement Learning Tutorial (强化学习教程)
TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
Wandb_Tutorial
How to use wandb?
OpenRL's Repositories
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
OpenRL-Lab/Wandb_Tutorial
How to use wandb?
OpenRL-Lab/TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
OpenRL-Lab/DeepFakeFace
DeepFake Face Datasets. Code accompanying the paper "Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models".
OpenRL-Lab/Ray_Tutorial
Tutorial for Ray
OpenRL-Lab/ChatAgent
A Python-based agent framework for large language models.
OpenRL-Lab/DGPO
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
OpenRL-Lab/OpenPlugin
Toolkit to manage the plugins of the large language model
OpenRL-Lab/huggingface_tool
Tools for loading, uploading, managing huggingface models and datasets
OpenRL-Lab/RL_Tutorial
Reinforcement Learning Tutorial (强化学习教程)
OpenRL-Lab/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
OpenRL-Lab/openrl-docs
OpenRL document
OpenRL-Lab/LLM4Reversi
OpenRL-Lab/DiversePolicies
Code accompanying the paper "Diverse Policies Converge in Reward-free Markov Decision Processes" (PRICAI 2023)
OpenRL-Lab/VideoHub
videohub api
OpenRL-Lab/.github
OpenRL-Lab/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
OpenRL-Lab/dify
An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.
OpenRL-Lab/embedchain
The Open Source RAG framework
OpenRL-Lab/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
OpenRL-Lab/iHuggingfaceHub
OpenRL-Lab/letcode.ai
OpenRL-Lab/llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
OpenRL-Lab/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
OpenRL-Lab/LVBench
LVBench: An Extreme Long Video Understanding Benchmark
OpenRL-Lab/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
OpenRL-Lab/staged-recipes
A place to submit conda recipes before they become fully fledged conda-forge feedstocks
OpenRL-Lab/SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
OpenRL-Lab/text-embeddings-inference
A blazing fast inference solution for text embeddings models
OpenRL-Lab/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs