kajyuuen

LINE CorporationJapan

kajyuuen's Stars

ryokamoi/llm-self-correction-papers
List of papers on Self-Correction of LLMs.
692
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
89049
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python1.1k75
teacherpeterpan/self-correction-llm-papers
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
47126
jackdewinter/pymarkdown
Language:Python8017
yoshi389111/kinokobooks
「きのこ本」を勝手に電子書籍化
Language:Markdown765
Delgan/loguru
Python logging made (stupidly) simple
Language:Python20.4k707
twpayne/chezmoi
Manage your dotfiles across multiple diverse machines, securely.
Language:Go13.7k498
imbushuo/mac-precision-touchpad
Windows Precision Touchpad Driver Implementation for Apple MacBook / Magic Trackpad
Language:C9.2k582
NousResearch/Hermes-Function-Calling
Language:Jupyter Notebook76198
chujiezheng/LLM-Safeguard
Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"
Language:Python798
Oxen-AI/Self-Rewarding-Language-Models
This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.
Language:Python1149
Spico197/Humback
🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.
Language:Python1369
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python5k459
HumanSignal/awesome-human-in-the-loop
Awesome List of Human in the Loop resources and references for retraining models.
231
salesforce/AuditNLG
AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness
Language:Python976
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python65580
lmmlzn/Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
1.1k112
Clipy/Clipy
Clipboard extension app for macOS.
Language:Swift7.8k645
hkust-nlp/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Language:Python51628
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python1.1k92
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
Language:Python8.2k899
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k2.3k
HumanSignal/RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
Language:Jupyter Notebook20342
IBM/Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
Language:Python1.1k87
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
95952
kislyuk/argcomplete
Python and tab completion, better together.
Language:Python1.4k137
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Language:Python1.8k144
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
1.1k60
HqWu-HITCS/Awesome-LLM-Survey
An Awesome Collection for LLM Survey
31731

kajyuuen

kajyuuen's Stars

ryokamoi/llm-self-correction-papers

wasiahmad/Awesome-LLM-Synthetic-Data

RLHFlow/RLHF-Reward-Modeling

teacherpeterpan/self-correction-llm-papers

jackdewinter/pymarkdown

yoshi389111/kinokobooks

Delgan/loguru

twpayne/chezmoi

imbushuo/mac-precision-touchpad

NousResearch/Hermes-Function-Calling

chujiezheng/LLM-Safeguard

Oxen-AI/Self-Rewarding-Language-Models

Spico197/Humback

arcee-ai/mergekit

HumanSignal/awesome-human-in-the-loop

salesforce/AuditNLG

NVIDIA/NeMo-Aligner

lmmlzn/Awesome-LLMs-Datasets

Clipy/Clipy

hkust-nlp/deita

uclaml/SPIN

axolotl-ai-cloud/axolotl

meta-llama/llama-recipes

HumanSignal/RLHF

IBM/Dromedary

HillZhang1999/llm-hallucination-survey

kislyuk/argcomplete

argilla-io/distilabel

yaodongC/awesome-instruction-dataset

HqWu-HITCS/Awesome-LLM-Survey