kingdy2002's Stars
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
younggyoseo/CQN
Coarse-to-fine Q-Network
OpenAutoCoder/Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
huiwon-jang/RSP
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Paitesanshi/LLM-Agent-Survey
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
kingdy2002/VCSE
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
OpenGVLab/GITM
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
EmbraceAGI/Awesome-AGI
A curated list of awesome AGI frameworks, software and resources
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
wangsr126/MAE-Lite
Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"
implus/mae_segmentation
reproduction of semantic segmentation using masked autoencoder (mae)
ikostrikov/rlpd
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)