metterian

NLP Engineer 42dot | Korea University

42dotSeoul

metterian's Stars

jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8.1k484
wandb/llm-kr-eval
Language:Python185
Psycoy/MixEval
The official evaluation suite and dynamic data release for MixEval.
Language:Python23037
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
Language:Jinja56552
HeegyuKim/korouge
Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리
Language:Python14
huggingface/optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Language:Python27853
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
1.1k41
instructkr/LogicKor
한국어 언어모델 다분야 사고력 벤치마크
Language:Python17831
lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Language:Jupyter Notebook68981
teknium1/LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
1162
microsoft/rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
37011
Stability-AI/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python14848
pytorch/torchtune
PyTorch native post-training library
Language:Python4.5k468
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Language:Python1.4k105
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
Language:Python44729
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
Language:Python2k69
google-research/metricx
Language:Python7712
MicrosoftTranslator/GEMBA
GEMBA — GPT Estimation Metric Based Assessment
Language:Python10520
microsoft/gpt-MT
Language:Ruby848
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python19.5k1.6k
MLGroupJLU/LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
1.5k92
confident-ai/deepeval
The LLM Evaluation Framework
Language:Python4.1k334
NomaDamas/KICE_slayer_AI_Korean
수능 국어 1등급에 도전하는 AI
Language:Python52039
ruixiangcui/AGIEval
Language:Python71648
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.4k74
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python5k458
huggingface/cookbook
Open-source AI cookbook
Language:Jupyter Notebook1.7k256
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++9k1k
UpstageAI/dataverse
The Universe of Data. All about data, data science, and data engineering
Language:Python52352
tunib-ai/large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
Language:Jupyter Notebook28954

metterian

metterian's Stars

jzhang38/TinyLlama

wandb/llm-kr-eval

Psycoy/MixEval

chujiezheng/chat_templates

HeegyuKim/korouge

huggingface/optimum-benchmark

Xnhyacinth/Awesome-LLM-Long-Context-Modeling

instructkr/LogicKor

lmarena/arena-hard-auto

teknium1/LLM-Benchmark-Logs

microsoft/rho

Stability-AI/lm-evaluation-harness

pytorch/torchtune

McGill-NLP/llm2vec

FranxYao/Long-Context-Data-Engineering

facebookresearch/schedule_free

google-research/metricx

MicrosoftTranslator/GEMBA

microsoft/gpt-MT

mlc-ai/mlc-llm

MLGroupJLU/LLM-eval-survey

confident-ai/deepeval

NomaDamas/KICE_slayer_AI_Korean

ruixiangcui/AGIEval

XueFuzhao/OpenMoE

arcee-ai/mergekit

huggingface/cookbook

NVIDIA/TensorRT-LLM

UpstageAI/dataverse

tunib-ai/large-scale-lm-tutorials