metterian's Stars
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
wandb/llm-kr-eval
Psycoy/MixEval
The official evaluation suite and dynamic data release for MixEval.
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
HeegyuKim/korouge
Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리
huggingface/optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
instructkr/LogicKor
한국어 언어모델 다분야 사고력 벤치마크
lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
teknium1/LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
microsoft/rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
Stability-AI/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
pytorch/torchtune
PyTorch native post-training library
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
google-research/metricx
MicrosoftTranslator/GEMBA
GEMBA — GPT Estimation Metric Based Assessment
microsoft/gpt-MT
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
MLGroupJLU/LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
confident-ai/deepeval
The LLM Evaluation Framework
NomaDamas/KICE_slayer_AI_Korean
수능 국어 1등급에 도전하는 AI
ruixiangcui/AGIEval
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
arcee-ai/mergekit
Tools for merging pretrained large language models.
huggingface/cookbook
Open-source AI cookbook
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
UpstageAI/dataverse
The Universe of Data. All about data, data science, and data engineering
tunib-ai/large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch