kajyuuen's Stars
ryokamoi/llm-self-correction-papers
List of papers on Self-Correction of LLMs.
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
teacherpeterpan/self-correction-llm-papers
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
jackdewinter/pymarkdown
yoshi389111/kinokobooks
「きのこ本」を勝手に電子書籍化
Delgan/loguru
Python logging made (stupidly) simple
twpayne/chezmoi
Manage your dotfiles across multiple diverse machines, securely.
imbushuo/mac-precision-touchpad
Windows Precision Touchpad Driver Implementation for Apple MacBook / Magic Trackpad
NousResearch/Hermes-Function-Calling
chujiezheng/LLM-Safeguard
Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"
Oxen-AI/Self-Rewarding-Language-Models
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
Spico197/Humback
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
arcee-ai/mergekit
Tools for merging pretrained large language models.
HumanSignal/awesome-human-in-the-loop
Awesome List of Human in the Loop resources and references for retraining models.
salesforce/AuditNLG
AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
lmmlzn/Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
Clipy/Clipy
Clipboard extension app for macOS.
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
HumanSignal/RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
IBM/Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
kislyuk/argcomplete
Python and tab completion, better together.
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
HqWu-HITCS/Awesome-LLM-Survey
An Awesome Collection for LLM Survey