hch1017's Stars
hiyouga/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
google-deepmind/opro
official code for "Large Language Models as Optimizers"
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
RL4VLM/RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
salesforce/BOLAA
noahshinn/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
flowersteam/lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
PKU-Alignment/Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
tinyzqh/light_mappo
Lightweight version of MAPPO to help you quickly migrate to your local environment.
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
DAMO-DI-ML/NeurIPS2023-One-Fits-All
The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"
SCXsunchenxi/TEST
thuml/Time-Series-Library
A Library for Advanced Deep Time Series Models.
chauncygu/Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
KimMeen/Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
chennnnnyize/LLM_PowerSystems
xinliangzhou/Survey
This is a repository contains materials for future survey submission
Pyosch/vpplib
TsingZ0/PFLlib
We expose this user-friendly algorithm library (with an integrated evaluation platform) for beginners who intend to start federated learning (FL) study
akocherovskiy/LLM_as_optimizer
LLM as optimizer for linear regression problem
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
snwfdhmp/awesome-gpt-prompt-engineering
A curated list of awesome resources, tools, and other shiny things for GPT prompt engineering.
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
mshumer/gpt-prompt-engineer
ngruver/llmtime
Infatoshi/fcc-intro-to-llms
nikhil3456/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).