okoge-kaz
Master (Computer Science) student at Tokyo Institute of Technology
Tokyo Institute of TechnologyTokyo Japan
Pinned Repositories
Compiler_Construction
Tokyo Institute of Technology 2022-2Q CSC. T372
Creative-Programming-Project
2021-1Q,2Q プログラミング創造演習 (Tokyo Tech)
document
Documentation for Computer Science and Blog for Software Engineer Career
golang-todo-application
システム設計演習 Tokyo Institute of Technology
llm-jp-sakura-ansible
llm-recipes
Ongoing Research Project for continaual pre-training LLM(dense mode)
megatron-deepspeed-turing-techblog
Turing Tech Blog Repository
moe-recipes
Ongoing research training Mixture of Expert models.
turing-techblog-megatron-deepspeed
環境構築方法の詳細は以下のLinkから
wandb_watcher
ABCI 大規模言語モデル構築支援にてwandbのジョブを監視するためのツール
okoge-kaz's Repositories
okoge-kaz/llm-recipes
Ongoing Research Project for continaual pre-training LLM(dense mode)
okoge-kaz/moe-recipes
Ongoing research training Mixture of Expert models.
okoge-kaz/Megatron-LM
Ongoing research training transformer models at scale
okoge-kaz/swallow-tuning
okoge-kaz/cluster-toolkit
Cluster Toolkit is an open-source software offered by Google Cloud which makes it easy for customers to deploy AI/ML and HPC environments on Google Cloud.
okoge-kaz/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
okoge-kaz/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
okoge-kaz/deploymentmanager-samples
Deployment Manager samples and templates.
okoge-kaz/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
okoge-kaz/eleutherai-cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
okoge-kaz/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
okoge-kaz/hpsc-2024
okoge-kaz/levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
okoge-kaz/llama-recipes
Examples and recipes for Llama 2 model
okoge-kaz/llama3v
A SOTA vision model built on top of llama3 8B.
okoge-kaz/llm-jp-Megatron-DeepSpeed
okoge-kaz/llm-node-tests
okoge-kaz/mistral-hackathon
okoge-kaz/ml-engineering
Machine Learning Engineering Open Book
okoge-kaz/nanotron
Minimalistic large language model 3D-parallelism training
okoge-kaz/NeMo-Aligner
Scalable toolkit for efficient model alignment
okoge-kaz/nvidia-resiliency-ext
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
okoge-kaz/okoge-kaz
okoge-kaz/ppcomp24
okoge-kaz/swallow-project-parper-graph
okoge-kaz/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
okoge-kaz/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
okoge-kaz/torchtitan
A native PyTorch Library for large model training
okoge-kaz/torchtune
A Native-PyTorch Library for LLM Fine-tuning
okoge-kaz/TSUBAME-4.0-hands-on