kouyakamada

kouyakamada's Stars

langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript53.5k 379 5.1k7.8k
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript19.3k 108 4031.5k
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Language:Python14.4k 106 167908
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7.8k 47 1.1k568
h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Language:Python4k 81 401417
meta-llama/llama-agentic-system
Agentic components of the Llama Stack APIs
Language:Python3.2k 38 35308
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.1k 47 134152
kyegomez/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Language:Python1.7k 42 40155
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Language:Python1.7k 17 443132
MeetKai/functionary
Chat language model that can use tools and interpret the results
Language:Python1.4k 20 126112
AnswerDotAI/fsdp_qlora
Training LLMs with QLoRA + FSDP
Language:Jupyter Notebook1.4k 23 38188
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Language:Python1.3k 23 1773
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.3k 42 83127
ibm-granite/granite-code-models
Granite Code Models: A Family of Open Foundation Models for Code Intelligence
1.2k 22 1482
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
Language:Python969 15 63114
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python956 20 3171
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python444 5 6953
p-lambda/dsir
DSIR large-scale data selection framework for language model training
Language:Python232 21 819
arcee-ai/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
Language:Python198 3 1226
EveripediaNetwork/fastc
Unattended Lightweight Text Classifiers with LLM Embeddings
Language:Python174 6 210
limcheekin/open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
Language:Python133 5 1519
jshuadvd/LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
Language:Python125 6 314
euclaise/SlimTrainer
Full finetuning of large language models without large memory requirements
Language:Python93 7 23
UNITES-Lab/MC-SMoE
[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"
Language:Python64 4 29
cli99/flops-profiler
pytorch-profiler
Language:Python50 3 88
llm-jp/llm-jp-corpus
Language:Python41 4 234
oshizo/japanese-contextual-qa-chat
Language:Jupyter Notebook7 1 01
phymhan/llm-dpo
Language:Python7 1 03
RLSNLP/SimpleBART
Language:Python4 2 00