hch1017

Anonymous

hch1017's Stars

hiyouga/LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python26k3.2k
google-deepmind/opro
official code for "Large Language Models as Optimizers"
Language:Python31731
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
10.6k661
RL4VLM/RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Language:Jupyter Notebook14414
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Language:Jupyter Notebook1.7k179
salesforce/BOLAA
Language:Python15416
noahshinn/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Language:Python2.2k211
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python44461
flowersteam/lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
Language:Python17414
flowersteam/Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python20320
PKU-Alignment/Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
Language:Python30844
tinyzqh/light_mappo
Lightweight version of MAPPO to help you quickly migrate to your local environment.
Language:Python42874
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
Language:Python2.1k244
DAMO-DI-ML/NeurIPS2023-One-Fits-All
The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"
Language:Python40057
SCXsunchenxi/TEST
Language:Python385
thuml/Time-Series-Library
A Library for Advanced Deep Time Series Models.
Language:Python5.1k855
chauncygu/Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
Language:Python13121
KimMeen/Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Language:Python1k180
chennnnnyize/LLM_PowerSystems
Language:Jupyter Notebook112
xinliangzhou/Survey
This is a repository contains materials for future survey submission
9
Pyosch/vpplib
Language:Python2920
TsingZ0/PFLlib
We expose this user-friendly algorithm library (with an integrated evaluation platform) for beginners who intend to start federated learning (FL) study
Language:Python1.3k272
akocherovskiy/LLM_as_optimizer
LLM as optimizer for linear regression problem
Language:Jupyter Notebook71
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Python88.9k14k
snwfdhmp/awesome-gpt-prompt-engineering
A curated list of awesome resources, tools, and other shiny things for GPT prompt engineering.
Language:Python89190
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Language:MDX46.2k4.5k
mshumer/gpt-prompt-engineer
Language:Jupyter Notebook8.2k579
ngruver/llmtime
Language:Jupyter Notebook625136
Infatoshi/fcc-intro-to-llms
Language:Jupyter Notebook550225
nikhil3456/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).
Language:Jupyter Notebook6510

hch1017

hch1017's Stars

hiyouga/LLaMA-Factory

google-deepmind/opro

eugeneyan/open-llms

RL4VLM/RL4VLM

ysymyth/ReAct

salesforce/BOLAA

noahshinn/reflexion

jannerm/trajectory-transformer

flowersteam/lamorel

flowersteam/Grounding_LLMs_with_online_RL

PKU-Alignment/Safe-Policy-Optimization

tinyzqh/light_mappo

amazon-science/chronos-forecasting

DAMO-DI-ML/NeurIPS2023-One-Fits-All

SCXsunchenxi/TEST

thuml/Time-Series-Library

chauncygu/Multi-Agent-Constrained-Policy-Optimisation

KimMeen/Time-LLM

chennnnnyize/LLM_PowerSystems

xinliangzhou/Survey

Pyosch/vpplib

TsingZ0/PFLlib

akocherovskiy/LLM_as_optimizer

langchain-ai/langchain

snwfdhmp/awesome-gpt-prompt-engineering

dair-ai/Prompt-Engineering-Guide

mshumer/gpt-prompt-engineer

ngruver/llmtime

Infatoshi/fcc-intro-to-llms

nikhil3456/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces