Pinned Repositories
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
alignment-handbook
Robust recipes to align language models with human and AI preferences
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
EasyContext
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FIN-bench
Evaluation of Finnish generative models
fmengine
Utilities for Training Very Large Models
llama-recipes
Examples and recipes for Llama 2 model
multimodal
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Qualsimu
Taishi-N324's Repositories
Taishi-N324/fmengine
Utilities for Training Very Large Models
Taishi-N324/llama-recipes
Examples and recipes for Llama 2 model
Taishi-N324/multimodal
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Taishi-N324/Qualsimu
Taishi-N324/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Taishi-N324/alignment-handbook
Robust recipes to align language models with human and AI preferences
Taishi-N324/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Taishi-N324/EasyContext
Taishi-N324/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Taishi-N324/FIN-bench
Evaluation of Finnish generative models
Taishi-N324/hpsc-2024
Taishi-N324/llm-jp-sakura-ansible
Taishi-N324/llm-jp-sft
Taishi-N324/lm-evaluation-harness
Taishi-N324/megablocks
Taishi-N324/Megatron-DeepSpeed
Taishi-N324/Megatron-LLM
distributed trainer for LLMs
Taishi-N324/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Taishi-N324/llm-leaderboard
Project of llm evaluation to Japanese tasks
Taishi-N324/long-context
YaRN: Efficient Context Window Extension of Large Language Models
Taishi-N324/Megatron-LM-LUMI
Ongoing research training transformer models at scale
Taishi-N324/mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark
Taishi-N324/nccl-tests
NCCL Tests
Taishi-N324/Robin
Taishi-N324/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
Taishi-N324/SEED
Empowers LLMs with the ability to see and draw.
Taishi-N324/t5x
Taishi-N324/Taishi-N324.github.io
Taishi-N324/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Taishi-N324/VMLU