kouyakamada's Stars
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
meta-llama/llama-agentic-system
Agentic components of the Llama Stack APIs
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
kyegomez/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
MeetKai/functionary
Chat language model that can use tools and interpret the results
AnswerDotAI/fsdp_qlora
Training LLMs with QLoRA + FSDP
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
ibm-granite/granite-code-models
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
p-lambda/dsir
DSIR large-scale data selection framework for language model training
arcee-ai/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
EveripediaNetwork/fastc
Unattended Lightweight Text Classifiers with LLM Embeddings
limcheekin/open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
jshuadvd/LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
euclaise/SlimTrainer
Full finetuning of large language models without large memory requirements
UNITES-Lab/MC-SMoE
[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"
cli99/flops-profiler
pytorch-profiler
llm-jp/llm-jp-corpus
oshizo/japanese-contextual-qa-chat
phymhan/llm-dpo
RLSNLP/SimpleBART