ziyang-arch

University of California, Riverside

Pinned Repositories

Compiler
Language:C++0 1 00
CourseProject_C
this solve the SAT problem with _basic and _improve method
Language:C0 0 00
CS-211-Lab3
Language:C00
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python00
Efficient-Tuning-LLMs
Easy and Efficient Finetuning of QLoRA LLMs. (Supported LLama, LLama2, bloom, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
Language:Python00
GEMMOptimization
Language:Jupyter Notebook0 1 00
HolisticTraceAnalysis
A library to analyze PyTorch traces.
Language:Python00
Hybrid-Cooling-For-Data-Center
Language:MATLAB3 0 00
kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Language:HTML00
llama2
Inference code for LLaMA models
Language:Python00

ziyang-arch's Repositories

ziyang-arch/Hybrid-Cooling-For-Data-Center
Language:MATLAB3 0 00
ziyang-arch/Compiler
Language:C++0 1 00
ziyang-arch/CourseProject_C
this solve the SAT problem with _basic and _improve method
Language:C0 0 00
ziyang-arch/CS-211-Lab3
Language:C00
ziyang-arch/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python00
ziyang-arch/Efficient-Tuning-LLMs
Easy and Efficient Finetuning of QLoRA LLMs. (Supported LLama, LLama2, bloom, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
Language:Python00
ziyang-arch/GEMMOptimization
Language:Jupyter Notebook0 1 00
ziyang-arch/HolisticTraceAnalysis
A library to analyze PyTorch traces.
Language:Python00
ziyang-arch/kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Language:HTML00
ziyang-arch/llama2
Inference code for LLaMA models
Language:Python00
ziyang-arch/LoadPredict
Language:Python
ziyang-arch/MachineLearning_Ng
吴恩达机器学习coursera课程，学习代码(2017年秋) The Stanford Coursera course on MachineLearning with Andrew Ng
Language:Jupyter Notebook
ziyang-arch/matrixprofiler
This is the core functions needed by the `tsmp` package. The low level and carefully checked mathematical functions are here. These are implementations of the Matrix Profile concept that was created by CS-UCR <http://www.cs.ucr.edu/~eamonn/MatrixProfile.html>.
Language:C++0 0
ziyang-arch/MLinference
Reference implementations of MLPerf™ inference benchmarks
Language:Python0 0
ziyang-arch/nccl
Optimized primitives for collective multi-GPU communication
Language:C++0 0
ziyang-arch/nccl-tests-power
NCCL Tests
Language:Cuda
ziyang-arch/OpenBLAS
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
Language:Fortran0 0
ziyang-arch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python0 0
ziyang-arch/pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
Language:Python0 0
ziyang-arch/ziyang-arch.github.io
Language:HTML1 0

ziyang-arch

Pinned Repositories

Compiler

CourseProject_C

CS-211-Lab3

DeepSpeed

Efficient-Tuning-LLMs

GEMMOptimization

HolisticTraceAnalysis

Hybrid-Cooling-For-Data-Center

kineto

llama2

ziyang-arch's Repositories

ziyang-arch/Hybrid-Cooling-For-Data-Center

ziyang-arch/Compiler

ziyang-arch/CourseProject_C

ziyang-arch/CS-211-Lab3

ziyang-arch/DeepSpeed

ziyang-arch/Efficient-Tuning-LLMs

ziyang-arch/GEMMOptimization

ziyang-arch/HolisticTraceAnalysis

ziyang-arch/kineto

ziyang-arch/llama2

ziyang-arch/LoadPredict

ziyang-arch/MachineLearning_Ng

ziyang-arch/matrixprofiler

ziyang-arch/MLinference

ziyang-arch/nccl

ziyang-arch/nccl-tests-power

ziyang-arch/OpenBLAS

ziyang-arch/pytorch

ziyang-arch/pytorch-OpCounter

ziyang-arch/ziyang-arch.github.io