kaisugi's Stars
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
joelparkerhenderson/architecture-decision-record
Architecture decision record (ADR) examples for software planning, IT leadership, and template documentation
huggingface/chat-ui
Open source codebase powering the HuggingChat app
ragapp/ragapp
The easiest way to use Agentic RAG in any enterprise
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
pytorch/torchtitan
A native PyTorch Library for large model training
mlfoundations/dclm
DataComp for Language Models
illuin-tech/colpali
The code used to train and run inference with the ColPali architecture.
huggingface/evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
fmaclen/hollama
A minimal web-UI for talking to Ollama servers
ibm-aur-nlp/PubTabNet
joeraut/latex2image-web
LaTeX to image converter with web UI using Node.js / Docker
py-pdf/benchmarks
Benchmarking PDF libraries
minsing-jin/Korean-SAT-LLM-Leaderboard
Korean SAT leader board
takahashim/md2review
a converter from Markdown into Re:VIEW, using redcarpet
esrille/ibus-hiragana
ひらがなIME for IBus
34j/best-of-lean4
A list of awesome lean4 projects. Feel free to add your project.
DeNA/dify-google-cloud-terraform
Terraform configuration for deploying Dify on Google Cloud with scalability, high availability, and production-level readiness.
llm-jp/text2dataset
Easily turn large English text datasets into Japanese text datasets using open LLMs.
embeddings-benchmark/leaderboard
Code for the MTEB leaderboard
Aratako/Japanese-RP-Bench
dahlia/fedify-microblog-tutorial-ja
『自分だけのフェディバースのマイクロブログを作ろう!』のAsciiDocのソースコード
hodanov/stable-diffusion-cli-on-modal
This is a script for running Stable Diffusion on Modal.
MMMU-Japanese-Benchmark/JMMMU
The official repository for the website scripts of "JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark"
nii-nlp/med-eval
Evaluation Pipeline for medical tasks.
fjm2u/assetGenie
sociocom/TNM-Classifier-Tester