lxqpku
PH.D. Candidate of Computer Science at the University of Chinese Academy of Sciences.
Institute of Automation, Chinese Academy of SciencesBeijing, China
lxqpku's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
reworkd/AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
yoheinakajima/babyagi
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
google-research/robotics_transformer
takuseno/d3rlpy
An offline deep reinforcement learning library
facebookresearch/diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
google-deepmind/open_x_embodiment
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
uncbiag/Awesome-Foundation-Models
A curated list of foundation models for vision and language tasks
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
apexrl/Diff4RLSurvey
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"
lucidrains/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
123penny123/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
tomekkorbak/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
lucidrains/llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
fuyw/RepL4RL
Representation Learning for RL
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
YaoMarkMu/Awesome-Pretrained-RL
etaoxing/multigame-dt
Implementation of Multi-Game Decision Transformers in PyTorch
StanfordAI4HI/waypoint-transformer
luciferkonn/MDT
Multi-game Decision Transformer
goytoom/DT_eval
evaluate decision transformers on challenging games