Taishi-N324

Institute of Science TokyoJapan

Pinned Repositories

AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python00
alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python0 0 00
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python0 0 00
EasyContext
Language:Python00
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0 00
FIN-bench
Evaluation of Finnish generative models
Language:Python0 0 00
fmengine
Utilities for Training Very Large Models
Language:Python1 0 00
llama-recipes
Examples and recipes for Llama 2 model
Language:Jupyter Notebook1 0 00
multimodal
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python1 0 00
Qualsimu
Language:HTML10

Taishi-N324's Repositories

Taishi-N324/fmengine
Utilities for Training Very Large Models
Language:Python1 0 00
Taishi-N324/llama-recipes
Examples and recipes for Llama 2 model
Language:Jupyter Notebook1 0 00
Taishi-N324/multimodal
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python1 0 00
Taishi-N324/Qualsimu
Language:HTML10
Taishi-N324/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python00
Taishi-N324/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python0 0 00
Taishi-N324/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python0 0 00
Taishi-N324/EasyContext
Language:Python00
Taishi-N324/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0 00
Taishi-N324/FIN-bench
Evaluation of Finnish generative models
Language:Python0 0 00
Taishi-N324/hpsc-2024
Language:Shell00
Taishi-N324/llm-jp-sakura-ansible
Language:Jinja0 0 00
Taishi-N324/llm-jp-sft
Language:Shell0 0 00
Taishi-N324/lm-evaluation-harness
Language:Python0 0 00
Taishi-N324/megablocks
Language:Python00
Taishi-N324/Megatron-DeepSpeed
Language:Python00
Taishi-N324/Megatron-LLM
distributed trainer for LLMs
Language:Python00
Taishi-N324/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python0 0
Taishi-N324/llm-leaderboard
Project of llm evaluation to Japanese tasks
Language:Python
Taishi-N324/long-context
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python
Taishi-N324/Megatron-LM-LUMI
Ongoing research training transformer models at scale
Taishi-N324/mlmm-evaluation
Multilingual Large Language Models Evaluation Benchmark
Language:Python0 0
Taishi-N324/nccl-tests
NCCL Tests
Language:Cuda0 0
Taishi-N324/Robin
Language:Python0 0
Taishi-N324/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
Language:Python0 0
Taishi-N324/SEED
Empowers LLMs with the ability to see and draw.
Language:Python0 0
Taishi-N324/t5x
Language:Python0 0
Taishi-N324/Taishi-N324.github.io
Language:HTML1 0
Taishi-N324/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
Taishi-N324/VMLU
Language:Python0 0