kingdy2002

BS Student at KAIST, Research Intern, hosted by Prof. Jinwoo Shin at KAIST

kingdy2002's Stars

OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2.2k217
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
Language:Python38844
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
Language:Python9.2k986
younggyoseo/CQN
Coarse-to-fine Q-Network
Language:Python26
OpenAutoCoder/Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
Language:Python68381
huiwon-jang/RSP
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
Language:Python16
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python6.7k1.8k
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python16.7k1.1k
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python32.9k3.8k
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Language:Python13.4k1.3k
Paitesanshi/LLM-Agent-Survey
2.5k148
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
97883
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.6k396
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.1k170
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.1k1.6k
kingdy2002/VCSE
Language:Python172
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Language:Python9.2k717
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
Language:Jupyter Notebook4.4k154
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
1.4k111
OpenGVLab/GITM
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
59519
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python167k44.2k
EmbraceAGI/Awesome-AGI
A curated list of awesome AGI frameworks, software and resources
46041
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python2.3k171
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
Language:JavaScript5.5k514
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
16.9k2.2k
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
Language:Python36k5.1k
wangsr126/MAE-Lite
Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"
Language:Python1098
implus/mae_segmentation
reproduction of semantic segmentation using masked autoencoder (mae)
Language:Python15514
ikostrikov/rlpd
Language:Python20824
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
76742

kingdy2002

kingdy2002's Stars

OpenRLHF/OpenRLHF

RLHFlow/Online-RLHF

Doriandarko/claude-engineer

younggyoseo/CQN

OpenAutoCoder/Agentless

huiwon-jang/RSP

EleutherAI/lm-evaluation-harness

unslothai/unsloth

All-Hands-AI/OpenHands

princeton-nlp/SWE-agent

Paitesanshi/LLM-Agent-Survey

AIoT-MLSys-Lab/Efficient-LLMs-Survey

huggingface/alignment-handbook

eric-mitchell/direct-preference-optimization

huggingface/peft

kingdy2002/VCSE

nlpxucan/WizardLM

1rgs/jsonformer

hyp1231/awesome-llm-powered-agent

OpenGVLab/GITM

Significant-Gravitas/AutoGPT

EmbraceAGI/Awesome-AGI

X-PLUG/mPLUG-Owl

MineDojo/Voyager

joonspk-research/generative_agents

run-llama/llama_index

wangsr126/MAE-Lite

implus/mae_segmentation

ikostrikov/rlpd

opendilab/awesome-diffusion-model-in-rl