Pinned Repositories
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
CoherentDualCore
gem5
This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews should be submitted to https://gem5-review.googlesource.com/. The mirrors are synchronized every 15 minutes.
gem5-mcpat-parser
A parser to convert the output of gem5 to a format for McPat.
Megatron-LM
Ongoing research training transformer models at scale
NeMo
NeMo: a toolkit for conversational AI
NeMo-Aligner
Scalable toolkit for efficient model alignment
NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
VerilogDNN
JimmyZhang12's Repositories
JimmyZhang12/NeMo-Aligner
Scalable toolkit for efficient model alignment
JimmyZhang12/VerilogDNN
JimmyZhang12/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
JimmyZhang12/CoherentDualCore
JimmyZhang12/gem5
This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews should be submitted to https://gem5-review.googlesource.com/. The mirrors are synchronized every 15 minutes.
JimmyZhang12/gem5-mcpat-parser
A parser to convert the output of gem5 to a format for McPat.
JimmyZhang12/Megatron-LM
Ongoing research training transformer models at scale
JimmyZhang12/NeMo
NeMo: a toolkit for conversational AI
JimmyZhang12/NeMo-Megatron-Launcher
NeMo Megatron launcher and tools
JimmyZhang12/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
JimmyZhang12/gpu-algorithms-labs
IMPACT GPU Algorithms Teaching Labs
JimmyZhang12/LowPoly
15-618 Final Project: Transforming an image to low poly style
JimmyZhang12/markdown-cheatsheet
Markdown Cheatsheet for Github Readme.md
JimmyZhang12/mcpat
An integrated power, area, and timing modeling framework for multicore and manycore architectures
JimmyZhang12/OutofOrderCore
Functional out of order core, written in Verilog
JimmyZhang12/PrairieLearn
Online problem-driving learning system
JimmyZhang12/predict-T
JimmyZhang12/raytracer
JimmyZhang12/riscv-boom
SonicBOOM: The Berkeley Out-of-Order Machine
JimmyZhang12/scons-example
An example C/C++ project showing how to build a shared library and tests in nested subdirectories using Scons.
JimmyZhang12/SLIC_CUDA
JimmyZhang12/testbin