noowad93's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
nebuly-ai/optimate
A collection of libraries to optimise AI model performances
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
promptslab/Promptify
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
teknium1/GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
karpathy/randomfun
Notebooks and various random fun
llm-jp/awesome-japanese-llm
日本語LLMまとめ - Overview of Japanese LLMs
opensouls/soul-engine-sdk
Soul Engine SDK
NomaDamas/KICE_slayer_AI_Korean
수능 국어 1등급에 도전하는 AI
allenai/WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
xrsrke/instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
lcw99/evolve-instruct
evolve llm training instruction, from english instruction to any language.
1never/open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
oshizo/japanese-llm-roleplay-benchmark
PygmalionAI/data-toolbox
Our data munging code.
chai-research/lmgym
Code base for internal reward models and PPO training
sb-jang/kodialogbench
Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING 2024)